- 添加 .idea 目录和相关配置文件,设置项目忽略文件、编码、模块管理等 - 创建商务大数据分析目录和子目录,准备数据和任务笔记本 - 添加示例数据文件:中国城市人口数据.csv - 创建任务笔记本文件,进行数据处理和分析示例
27 KiB
27 KiB
None
<html lang="en">
<head>
</head>
</html>
In [1]:
import pandas as pd
In [8]:
data1 = pd.read_excel('data/healthcare-dataset-stroke.xlsx')
data1.head(3)
Out[8]:
In [10]:
data2 = pd.read_excel('data/healthcare-dataset-age_abs.xlsx')
data2.head(3)
Out[10]:
In [17]:
print(data1.size)
data2.size
Out[17]:
In [71]:
merge_data = data1.merge(data2, on=['编号'], how='left')
merge_data.head(3)
Out[71]:
In [72]:
def age_process(x):
if (x % 1 != 0 or x < 0):
return None
return int(x)
In [73]:
merge_data['年龄'] = merge_data['年龄'].apply(lambda x: age_process(x))
In [74]:
merge_data[merge_data['年龄'].isna()]
Out[74]: