Python 유니코드 디코딩 오류, 잘못된 연속 바이트, UnicodeDecodeError, invalid continuation byte

질문

아래 항목이 왜 실패하는 것인가요? "latin-1" 코덱으로는 왜 성공하는 건가요?

o = "a test of \xe9 char" #I want this to remain a string as this is what I am receiving
v = o.decode("utf-8")

이로 인해 다음과 같은 결과가 발생합니다:

 Traceback (most recent call last):  
 File "<stdin>", line 1, in <module>  
 File "C:\Python27\lib\encodings\utf_8.py",
 line 16, in decode
     return codecs.utf_8_decode(input, errors, True) UnicodeDecodeError:
 'utf8' 코덱은 위치 10의 바이트 0xe9를 디코드할 수 없습니다: 잘못된 연속 바이트

답변

CSV 파일을 pandas.read_csv 메소드로 열려고 할 때 동일한 오류가 발생했습니다.

해결책은 인코딩을 latin-1으로 변경하는 것이었습니다:

pd.read_csv('ml-100k/u.item', sep='|', names=m_cols , encoding='latin-1')

'Python > Python FAQ' 카테고리의 다른 글

Python 파이썬 인터프리터에서 업데이트된 패키지를 다시 가져오는 방법은 무엇인가요? [중복], How to re import an updated package while in Python Interpreter? [duplicate] (0)	2023.11.03
Python defaultdict의 defaultdict?, defaultdict of defaultdict? (0)	2023.11.03
Python 두 개의 문자열 사이의 유사도 측정치를 찾으세요., Find the similarity metric between two strings (0)	2023.11.03
실행 중에 Python 버전을 어떻게 감지할 수 있나요? [중복], How do I detect the Python version at runtime? [duplicate] (0)	2023.11.02
Python 파이썬에서 크론과 유사한 스케줄러를 어떻게 사용할 수 있을까요?, How do I get a Cron like scheduler in Python? (0)	2023.11.02

Python 유니코드 디코딩 오류, 잘못된 연속 바이트, UnicodeDecodeError, invalid continuation byte

질문

답변

'Python > Python FAQ' 카테고리의 다른 글

댓글

티스토리툴바

Python 유니코드 디코딩 오류, 잘못된 연속 바이트, UnicodeDecodeError, invalid continuation byte

질문

답변

'Python > Python FAQ' 카테고리의 다른 글

관련글

댓글

티스토리툴바