Issues with Python pandas: read_html and python3-lxml installation -
i'm trying run following code, no avail. knowledge, there aren't syntax errors.
import quandl import pandas pd fifty_states =pd.read_html('https://simple.wikipedia.org/wiki/list_of_u.s._states') print(fifty_states)
i'm getting following error when run code:
traceback (most recent call last):
file "c:/users/dave/documents/python files/helloworld.py", line 15, in fiddy_states = pd.read_html('http://simple.wikipedia.org/wiki/list_of_u.s._states')
file "c:\python35\lib\site-packages\pandas\io\html.py", line 874, in read_html parse_dates, tupleize_cols, thousands, attrs, encoding)
file "c:\python35\lib\site-packages\pandas\io\html.py", line 726, in _parse parser = _parser_dispatch(flav)
file "c:\python35\lib\site-packages\pandas\io\html.py", line 685, in _parser_dispatch raise importerror("lxml not found, please install it")
importerror: lxml not found, please install it
not sure why occurring, (should) have packages required run code. have problems installing lxml , python3-lxml, packages fail install. backup, i've installed following:
python-dev libxml2-dev libxslt1-dev zlib1g-dev
in addition 'html5lib', i've read suitable replacement lxml.
not sure else @ point, since searching similar corrections (i.e. installing lxml) don't apply me (i can't install lxml in format via pip on command line).
any appreciated.
edit: appears lxml
never installed on computer. it's weird, because i'm unable install via pip install lxml
. here're error logs when attempting install:
collecting lxml using cached lxml-3.6.4.tar.gz building wheels collected packages: lxml running setup.py bdist_wheel lxml ... error complete output command c:\python35\python.exe -u -c "import setuptools, tokenize;__file__='c:\\users\\dwang\\appdata\\local\\temp\\pip-build-738bf61u\\l xml\\setup.py';exec(compile(getattr(tokenize, 'open', open)(__file__).read().rep lace('\r\n', '\n'), __file__, 'exec'))" bdist_wheel -d c:\users\dwang\appdata\lo cal\temp\tmpm9z4yol6pip-wheel- --python-tag cp35: building lxml version 3.6.4. building without cython. error: b"'xslt-config' not recognized internal or external command,\r \noperable program or batch file.\r\n" ** make sure development packages of libxml2 , libxslt installed ** using build configuration of libxslt running bdist_wheel running build running build_py creating build creating build\lib.win-amd64-3.5 creating build\lib.win-amd64-3.5\lxml copying src\lxml\builder.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\cssselect.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\doctestcompare.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\elementinclude.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\pyclasslookup.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\sax.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\usedoctest.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\_elementpath.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\__init__.py -> build\lib.win-amd64-3.5\lxml creating build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\__init__.py -> build\lib.win-amd64-3.5\lxml\includes creating build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\builder.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\clean.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\defs.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\diff.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\elementsoup.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\formfill.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\html5parser.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\soupparser.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\usedoctest.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\_diffcommand.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\_html5builder.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\_setmixin.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\__init__.py -> build\lib.win-amd64-3.5\lxml\html creating build\lib.win-amd64-3.5\lxml\isoschematron copying src\lxml\isoschematron\__init__.py -> build\lib.win-amd64-3.5\lxml\iso schematron copying src\lxml\lxml.etree.h -> build\lib.win-amd64-3.5\lxml copying src\lxml\lxml.etree_api.h -> build\lib.win-amd64-3.5\lxml copying src\lxml\includes\c14n.pxd -> build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\config.pxd -> build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\dtdvalid.pxd -> build\lib.win-amd64-3.5\lxml\include s copying src\lxml\includes\etreepublic.pxd -> build\lib.win-amd64-3.5\lxml\incl udes copying src\lxml\includes\htmlparser.pxd -> build\lib.win-amd64-3.5\lxml\inclu des copying src\lxml\includes\relaxng.pxd -> build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\schematron.pxd -> build\lib.win-amd64-3.5\lxml\inclu des copying src\lxml\includes\tree.pxd -> build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\uri.pxd -> build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\xinclude.pxd -> build\lib.win-amd64-3.5\lxml\include s copying src\lxml\includes\xmlerror.pxd -> build\lib.win-amd64-3.5\lxml\include s copying src\lxml\includes\xmlparser.pxd -> build\lib.win-amd64-3.5\lxml\includ es copying src\lxml\includes\xmlschema.pxd -> build\lib.win-amd64-3.5\lxml\includ es copying src\lxml\includes\xpath.pxd -> build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\xslt.pxd -> build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\etree_defs.h -> build\lib.win-amd64-3.5\lxml\include s copying src\lxml\includes\lxml-version.h -> build\lib.win-amd64-3.5\lxml\inclu des creating build\lib.win-amd64-3.5\lxml\isoschematron\resources creating build\lib.win-amd64-3.5\lxml\isoschematron\resources\rng copying src\lxml\isoschematron\resources\rng\iso-schematron.rng -> build\lib.w in-amd64-3.5\lxml\isoschematron\resources\rng creating build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl copying src\lxml\isoschematron\resources\xsl\rng2schtrn.xsl -> build\lib.win-a md64-3.5\lxml\isoschematron\resources\xsl copying src\lxml\isoschematron\resources\xsl\xsd2schtrn.xsl -> build\lib.win-a md64-3.5\lxml\isoschematron\resources\xsl creating build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schematr on-xslt1 copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_abstract _expand.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-sche matron-xslt1 copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_dsdl_inc lude.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schemat ron-xslt1 copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_schematr on_message.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-s chematron-xslt1 copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_schematr on_skeleton_for_xslt1.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resource s\xsl\iso-schematron-xslt1 copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_svrl_for _xslt1.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schem atron-xslt1 copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\readme.txt - > build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schematron-xslt1 running build_ext building 'lxml.etree' extension error: unable find vcvarsall.bat ---------------------------------------- failed building wheel lxml running setup.py clean lxml failed build lxml installing collected packages: lxml running setup.py install lxml ... error complete output command c:\python35\python.exe -u -c "import setuptools , tokenize;__file__='c:\\users\\dwang\\appdata\\local\\temp\\pip-build-738bf61u\ \lxml\\setup.py';exec(compile(getattr(tokenize, 'open', open)(__file__).read().r eplace('\r\n', '\n'), __file__, 'exec'))" install --record c:\users\dwang\appdat a\local\temp\pip-4_tf2u3a-record\install-record.txt --single-version-externally- managed --compile: building lxml version 3.6.4. building without cython. error: b"'xslt-config' not recognized internal or external command, \r\noperable program or batch file.\r\n" ** make sure development packages of libxml2 , libxslt installed * * using build configuration of libxslt running install running build running build_py creating build creating build\lib.win-amd64-3.5 creating build\lib.win-amd64-3.5\lxml copying src\lxml\builder.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\cssselect.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\doctestcompare.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\elementinclude.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\pyclasslookup.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\sax.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\usedoctest.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\_elementpath.py -> build\lib.win-amd64-3.5\lxml copying src\lxml\__init__.py -> build\lib.win-amd64-3.5\lxml creating build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\__init__.py -> build\lib.win-amd64-3.5\lxml\includ es creating build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\builder.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\clean.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\defs.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\diff.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\elementsoup.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\formfill.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\html5parser.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\soupparser.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\usedoctest.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\_diffcommand.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\_html5builder.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\_setmixin.py -> build\lib.win-amd64-3.5\lxml\html copying src\lxml\html\__init__.py -> build\lib.win-amd64-3.5\lxml\html creating build\lib.win-amd64-3.5\lxml\isoschematron copying src\lxml\isoschematron\__init__.py -> build\lib.win-amd64-3.5\lxml\i soschematron copying src\lxml\lxml.etree.h -> build\lib.win-amd64-3.5\lxml copying src\lxml\lxml.etree_api.h -> build\lib.win-amd64-3.5\lxml copying src\lxml\includes\c14n.pxd -> build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\config.pxd -> build\lib.win-amd64-3.5\lxml\include s copying src\lxml\includes\dtdvalid.pxd -> build\lib.win-amd64-3.5\lxml\inclu des copying src\lxml\includes\etreepublic.pxd -> build\lib.win-amd64-3.5\lxml\in cludes copying src\lxml\includes\htmlparser.pxd -> build\lib.win-amd64-3.5\lxml\inc ludes copying src\lxml\includes\relaxng.pxd -> build\lib.win-amd64-3.5\lxml\includ es copying src\lxml\includes\schematron.pxd -> build\lib.win-amd64-3.5\lxml\inc ludes copying src\lxml\includes\tree.pxd -> build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\uri.pxd -> build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\xinclude.pxd -> build\lib.win-amd64-3.5\lxml\inclu des copying src\lxml\includes\xmlerror.pxd -> build\lib.win-amd64-3.5\lxml\inclu des copying src\lxml\includes\xmlparser.pxd -> build\lib.win-amd64-3.5\lxml\incl udes copying src\lxml\includes\xmlschema.pxd -> build\lib.win-amd64-3.5\lxml\incl udes copying src\lxml\includes\xpath.pxd -> build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\xslt.pxd -> build\lib.win-amd64-3.5\lxml\includes copying src\lxml\includes\etree_defs.h -> build\lib.win-amd64-3.5\lxml\inclu des copying src\lxml\includes\lxml-version.h -> build\lib.win-amd64-3.5\lxml\inc ludes creating build\lib.win-amd64-3.5\lxml\isoschematron\resources creating build\lib.win-amd64-3.5\lxml\isoschematron\resources\rng copying src\lxml\isoschematron\resources\rng\iso-schematron.rng -> build\lib .win-amd64-3.5\lxml\isoschematron\resources\rng creating build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl copying src\lxml\isoschematron\resources\xsl\rng2schtrn.xsl -> build\lib.win -amd64-3.5\lxml\isoschematron\resources\xsl copying src\lxml\isoschematron\resources\xsl\xsd2schtrn.xsl -> build\lib.win -amd64-3.5\lxml\isoschematron\resources\xsl creating build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schema tron-xslt1 copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_abstra ct_expand.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-sc hematron-xslt1 copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_dsdl_i nclude.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schem atron-xslt1 copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_schema tron_message.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso -schematron-xslt1 copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_schema tron_skeleton_for_xslt1.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resour ces\xsl\iso-schematron-xslt1 copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_svrl_f or_xslt1.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-sch ematron-xslt1 copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\readme.txt -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schematron-xslt 1 running build_ext building 'lxml.etree' extension error: unable find vcvarsall.bat ---------------------------------------- command "c:\python35\python.exe -u -c "import setuptools, tokenize;__file__='c:\ \users\\dwang\\appdata\\local\\temp\\pip-build-738bf61u\\lxml\\setup.py';exec(co mpile(getattr(tokenize, 'open', open)(__file__).read().replace('\r\n', '\n'), __ file__, 'exec'))" install --record c:\users\dwang\appdata\local\temp\pip-4_tf2u3 a-record\install-record.txt --single-version-externally-managed --compile" faile d error code 1 in c:\users\dwang\appdata\local\temp\pip-build-738bf61u\lxml \
from understand , according docs, if read_html()
fails use lxml
, should fall html5lib
, looks ike not happen in case , error thrown.
try explicitly state flavor
:
fifty_states = pd.read_html('https://simple.wikipedia.org/wiki/list_of_u.s._states', flavor='html5lib`)
Comments
Post a Comment