解决Python中pandas读取*.csv文件出现编码问题

yipeiwu_com6年前Python基础

1、问题

在使用Python中pandas读取csv文件时,由于文件编码格式出现以下问题:

Traceback (most recent call last):
 File "pandas\_libs\parsers.pyx", line 1134, in pandas._libs.parsers.TextReader._convert_tokens
 File "pandas\_libs\parsers.pyx", line 1240, in pandas._libs.parsers.TextReader._convert_with_dtype
 File "pandas\_libs\parsers.pyx", line 1256, in pandas._libs.parsers.TextReader._string_convert
 File "pandas\_libs\parsers.pyx", line 1494, in pandas._libs.parsers._string_box_utf8
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa0 in position 19: invalid start byte
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
 File "E:\PyCharm 2017.3.4\helpers\pydev\pydevd.py", line 1668, in <module>
  main()
 File "E:\PyCharm 2017.3.4\helpers\pydev\pydevd.py", line 1662, in main
  globals = debugger.run(setup['file'], None, None, is_module)
 File "E:\PyCharm 2017.3.4\helpers\pydev\pydevd.py", line 1072, in run
  pydev_imports.execfile(file, globals, locals) # execute the script
 File "E:\PyCharm 2017.3.4\helpers\pydev\_pydev_imps\_pydev_execfile.py", line 18, in execfile
  exec(compile(contents+"\n", file, 'exec'), glob, loc)
 File "F:/OneDrive - emails.bjut.edu.cn/Program/Python/DCAE/test.py", line 18, in <module>
  load_phenotypes_ABIDE2_RfMRIMaps()
 File "F:/OneDrive - emails.bjut.edu.cn/Program/Python/DCAE\Data\load_data.py", line 109, in load_phenotypes_ABIDE2_RfMRIMaps
  pheno = pd.read_csv(pheno_path)
 File "E:\Python\Python35\lib\site-packages\pandas\io\parsers.py", line 678, in parser_f
  return _read(filepath_or_buffer, kwds)
 File "E:\Python\Python35\lib\site-packages\pandas\io\parsers.py", line 446, in _read
  data = parser.read(nrows)
 File "E:\Python\Python35\lib\site-packages\pandas\io\parsers.py", line 1036, in read
  ret = self._engine.read(nrows)
 File "E:\Python\Python35\lib\site-packages\pandas\io\parsers.py", line 1848, in read
  data = self._reader.read(nrows)
 File "pandas\_libs\parsers.pyx", line 876, in pandas._libs.parsers.TextReader.read
 File "pandas\_libs\parsers.pyx", line 891, in pandas._libs.parsers.TextReader._read_low_memory
 File "pandas\_libs\parsers.pyx", line 968, in pandas._libs.parsers.TextReader._read_rows
 File "pandas\_libs\parsers.pyx", line 1094, in pandas._libs.parsers.TextReader._convert_column_data
 File "pandas\_libs\parsers.pyx", line 1141, in pandas._libs.parsers.TextReader._convert_tokens
 File "pandas\_libs\parsers.pyx", line 1240, in pandas._libs.parsers.TextReader._convert_with_dtype
 File "pandas\_libs\parsers.pyx", line 1256, in pandas._libs.parsers.TextReader._string_convert
 File "pandas\_libs\parsers.pyx", line 1494, in pandas._libs.parsers._string_box_utf8
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa0 in position 19: invalid start byte

我认为该问题是由于文件编码格式不是'utf-8'所导致的,因此,尝试将文件格式进行转换,转换方式如下:

首先使用txt文本打开文件,然后另存为,在右下角将编码改为‘UTF-8',点击保存即可

总结

以上所述是小编给大家介绍的解决Python中pandas读取*.csv文件出现编码问题 ,希望对大家有所帮助,如果大家有任何疑问请给我留言,小编会及时回复大家的。在此也非常感谢大家对【听图阁-专注于Python设计】网站的支持!
如果你觉得本文对你有帮助,欢迎转载,烦请注明出处,谢谢!

相关文章

Django如何实现内容缓存示例详解

Django如何实现内容缓存示例详解

前言 本文主要给大家介绍了关于Django实现内容缓存的相关内容,分享出来供大家参考学习,下面话不多说了,来一起看看详细的介绍吧。 1.缓存的简介 在动态网站中,用户所有的请求,服务器都...

python 循环数据赋值实例

python在数值赋值的时候可以采用数值内循环赋值,很方便 如下 a = [x for x in range(10)] 这样 a = [0,1,2,3,4,5,6,7,8,9]...

Python数据结构之顺序表的实现代码示例

Python数据结构之顺序表的实现代码示例

顺序表即线性表的顺序存储结构。它是通过一组地址连续的存储单元对线性表中的数据进行存储的,相邻的两个元素在物理位置上也是相邻的。比如,第1个元素是存储在线性表的起始位置LOC(1),那么第...

PyCharm搭建Spark开发环境的实现步骤

PyCharm搭建Spark开发环境的实现步骤

1.安装好JDK 下载并安装好jdk-12.0.1_windows-x64_bin.exe,配置环境变量: 新建系统变量JAVA_HOME,值为Java安装路径 新建系统变量...

python 将print输出的内容保存到txt文件中

具体代码如下所示: import sys import os class Logger(object): def __init__(self, filename="Default...