菜单 学习猿地 - LMONKEY

VIP

开通学习猿地VIP

尊享10项VIP特权 持续新增

知识通关挑战

打卡带练!告别无效练习

接私单赚外块

VIP优先接,累计金额超百万

学习猿地私房课免费学

大厂实战课仅对VIP开放

你的一对一导师

每月可免费咨询大牛30次

领取更多软件工程师实用特权

入驻
225
0

sklearn

原创
05/13 14:22
阅读数 10726

分类任务、回归任务、聚类任务、降维任务、模型选择、数据预处理

from sklearn.cluster import KMeans
import numpy as np

def loadData(filePath):
    fr = open(filePath,'r+',encoding='UTF-8')
    lines = fr.readlines()
    retData = []
    retCityName = []
    for line in lines:
        items = line.strip().split(',')
        retCityName.append(items[0])
        retData.append([float(items[i]) for i in range(1,len(items))])

    #print(retData)
    #print(retCityName)
    
    return retData,retCityName


if __name__=='__main__':
    data,cityName = loadData('city.txt')
    
    km = KMeans(n_clusters=4)
    label = km.fit_predict(data)
    #print(label)   # data中每一条的类别
    #print(km.cluster_centers_) # 聚类中心
    expenses = np.sum(km.cluster_centers_,axis=1)

    CityCluster=[[],[],[],[]]
    for i in range(len(cityName)):
        CityCluster[label[i]].append(cityName[i])
    for i in range(len(CityCluster)):
        print('Expenses:%.2f' % expenses[i])
        print(CityCluster[i])

发表评论

0/200
225 点赞
0 评论
收藏
为你推荐 换一批