Issues with Python pandas: read_html and python3-lxml installation -


i'm trying run following code, no avail. knowledge, there aren't syntax errors.

import quandl import pandas pd  fifty_states =pd.read_html('https://simple.wikipedia.org/wiki/list_of_u.s._states') print(fifty_states) 

i'm getting following error when run code:

traceback (most recent call last):

file "c:/users/dave/documents/python files/helloworld.py", line 15, in fiddy_states = pd.read_html('http://simple.wikipedia.org/wiki/list_of_u.s._states')

file "c:\python35\lib\site-packages\pandas\io\html.py", line 874, in read_html parse_dates, tupleize_cols, thousands, attrs, encoding)

file "c:\python35\lib\site-packages\pandas\io\html.py", line 726, in _parse parser = _parser_dispatch(flav)

file "c:\python35\lib\site-packages\pandas\io\html.py", line 685, in _parser_dispatch raise importerror("lxml not found, please install it")

importerror: lxml not found, please install it

not sure why occurring, (should) have packages required run code. have problems installing lxml , python3-lxml, packages fail install. backup, i've installed following:

python-dev libxml2-dev libxslt1-dev zlib1g-dev

in addition 'html5lib', i've read suitable replacement lxml.

not sure else @ point, since searching similar corrections (i.e. installing lxml) don't apply me (i can't install lxml in format via pip on command line).

any appreciated.

edit: appears lxml never installed on computer. it's weird, because i'm unable install via pip install lxml. here're error logs when attempting install:

collecting lxml   using cached lxml-3.6.4.tar.gz building wheels collected packages: lxml   running setup.py bdist_wheel lxml ... error   complete output command c:\python35\python.exe -u -c "import setuptools, tokenize;__file__='c:\\users\\dwang\\appdata\\local\\temp\\pip-build-738bf61u\\l xml\\setup.py';exec(compile(getattr(tokenize, 'open', open)(__file__).read().rep lace('\r\n', '\n'), __file__, 'exec'))" bdist_wheel -d c:\users\dwang\appdata\lo cal\temp\tmpm9z4yol6pip-wheel- --python-tag cp35:   building lxml version 3.6.4.   building without cython.   error: b"'xslt-config' not recognized internal or external command,\r \noperable program or batch file.\r\n"   ** make sure development packages of libxml2 , libxslt installed **    using build configuration of libxslt   running bdist_wheel   running build   running build_py   creating build   creating build\lib.win-amd64-3.5   creating build\lib.win-amd64-3.5\lxml   copying src\lxml\builder.py -> build\lib.win-amd64-3.5\lxml   copying src\lxml\cssselect.py -> build\lib.win-amd64-3.5\lxml   copying src\lxml\doctestcompare.py -> build\lib.win-amd64-3.5\lxml   copying src\lxml\elementinclude.py -> build\lib.win-amd64-3.5\lxml   copying src\lxml\pyclasslookup.py -> build\lib.win-amd64-3.5\lxml   copying src\lxml\sax.py -> build\lib.win-amd64-3.5\lxml   copying src\lxml\usedoctest.py -> build\lib.win-amd64-3.5\lxml   copying src\lxml\_elementpath.py -> build\lib.win-amd64-3.5\lxml   copying src\lxml\__init__.py -> build\lib.win-amd64-3.5\lxml   creating build\lib.win-amd64-3.5\lxml\includes   copying src\lxml\includes\__init__.py -> build\lib.win-amd64-3.5\lxml\includes    creating build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\builder.py -> build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\clean.py -> build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\defs.py -> build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\diff.py -> build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\elementsoup.py -> build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\formfill.py -> build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\html5parser.py -> build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\soupparser.py -> build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\usedoctest.py -> build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\_diffcommand.py -> build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\_html5builder.py -> build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\_setmixin.py -> build\lib.win-amd64-3.5\lxml\html   copying src\lxml\html\__init__.py -> build\lib.win-amd64-3.5\lxml\html   creating build\lib.win-amd64-3.5\lxml\isoschematron   copying src\lxml\isoschematron\__init__.py -> build\lib.win-amd64-3.5\lxml\iso schematron   copying src\lxml\lxml.etree.h -> build\lib.win-amd64-3.5\lxml   copying src\lxml\lxml.etree_api.h -> build\lib.win-amd64-3.5\lxml   copying src\lxml\includes\c14n.pxd -> build\lib.win-amd64-3.5\lxml\includes   copying src\lxml\includes\config.pxd -> build\lib.win-amd64-3.5\lxml\includes   copying src\lxml\includes\dtdvalid.pxd -> build\lib.win-amd64-3.5\lxml\include s   copying src\lxml\includes\etreepublic.pxd -> build\lib.win-amd64-3.5\lxml\incl udes   copying src\lxml\includes\htmlparser.pxd -> build\lib.win-amd64-3.5\lxml\inclu des   copying src\lxml\includes\relaxng.pxd -> build\lib.win-amd64-3.5\lxml\includes    copying src\lxml\includes\schematron.pxd -> build\lib.win-amd64-3.5\lxml\inclu des   copying src\lxml\includes\tree.pxd -> build\lib.win-amd64-3.5\lxml\includes   copying src\lxml\includes\uri.pxd -> build\lib.win-amd64-3.5\lxml\includes   copying src\lxml\includes\xinclude.pxd -> build\lib.win-amd64-3.5\lxml\include s   copying src\lxml\includes\xmlerror.pxd -> build\lib.win-amd64-3.5\lxml\include s   copying src\lxml\includes\xmlparser.pxd -> build\lib.win-amd64-3.5\lxml\includ es   copying src\lxml\includes\xmlschema.pxd -> build\lib.win-amd64-3.5\lxml\includ es   copying src\lxml\includes\xpath.pxd -> build\lib.win-amd64-3.5\lxml\includes   copying src\lxml\includes\xslt.pxd -> build\lib.win-amd64-3.5\lxml\includes   copying src\lxml\includes\etree_defs.h -> build\lib.win-amd64-3.5\lxml\include s   copying src\lxml\includes\lxml-version.h -> build\lib.win-amd64-3.5\lxml\inclu des   creating build\lib.win-amd64-3.5\lxml\isoschematron\resources   creating build\lib.win-amd64-3.5\lxml\isoschematron\resources\rng   copying src\lxml\isoschematron\resources\rng\iso-schematron.rng -> build\lib.w in-amd64-3.5\lxml\isoschematron\resources\rng   creating build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl   copying src\lxml\isoschematron\resources\xsl\rng2schtrn.xsl -> build\lib.win-a md64-3.5\lxml\isoschematron\resources\xsl   copying src\lxml\isoschematron\resources\xsl\xsd2schtrn.xsl -> build\lib.win-a md64-3.5\lxml\isoschematron\resources\xsl   creating build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schematr on-xslt1   copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_abstract _expand.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-sche matron-xslt1   copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_dsdl_inc lude.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schemat ron-xslt1   copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_schematr on_message.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-s chematron-xslt1   copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_schematr on_skeleton_for_xslt1.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resource s\xsl\iso-schematron-xslt1   copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_svrl_for _xslt1.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schem atron-xslt1   copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\readme.txt - > build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schematron-xslt1   running build_ext   building 'lxml.etree' extension   error: unable find vcvarsall.bat    ----------------------------------------   failed building wheel lxml   running setup.py clean lxml failed build lxml installing collected packages: lxml   running setup.py install lxml ... error     complete output command c:\python35\python.exe -u -c "import setuptools , tokenize;__file__='c:\\users\\dwang\\appdata\\local\\temp\\pip-build-738bf61u\ \lxml\\setup.py';exec(compile(getattr(tokenize, 'open', open)(__file__).read().r eplace('\r\n', '\n'), __file__, 'exec'))" install --record c:\users\dwang\appdat a\local\temp\pip-4_tf2u3a-record\install-record.txt --single-version-externally- managed --compile:     building lxml version 3.6.4.     building without cython.     error: b"'xslt-config' not recognized internal or external command, \r\noperable program or batch file.\r\n"     ** make sure development packages of libxml2 , libxslt installed * *      using build configuration of libxslt     running install     running build     running build_py     creating build     creating build\lib.win-amd64-3.5     creating build\lib.win-amd64-3.5\lxml     copying src\lxml\builder.py -> build\lib.win-amd64-3.5\lxml     copying src\lxml\cssselect.py -> build\lib.win-amd64-3.5\lxml     copying src\lxml\doctestcompare.py -> build\lib.win-amd64-3.5\lxml     copying src\lxml\elementinclude.py -> build\lib.win-amd64-3.5\lxml     copying src\lxml\pyclasslookup.py -> build\lib.win-amd64-3.5\lxml     copying src\lxml\sax.py -> build\lib.win-amd64-3.5\lxml     copying src\lxml\usedoctest.py -> build\lib.win-amd64-3.5\lxml     copying src\lxml\_elementpath.py -> build\lib.win-amd64-3.5\lxml     copying src\lxml\__init__.py -> build\lib.win-amd64-3.5\lxml     creating build\lib.win-amd64-3.5\lxml\includes     copying src\lxml\includes\__init__.py -> build\lib.win-amd64-3.5\lxml\includ es     creating build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\builder.py -> build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\clean.py -> build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\defs.py -> build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\diff.py -> build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\elementsoup.py -> build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\formfill.py -> build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\html5parser.py -> build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\soupparser.py -> build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\usedoctest.py -> build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\_diffcommand.py -> build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\_html5builder.py -> build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\_setmixin.py -> build\lib.win-amd64-3.5\lxml\html     copying src\lxml\html\__init__.py -> build\lib.win-amd64-3.5\lxml\html     creating build\lib.win-amd64-3.5\lxml\isoschematron     copying src\lxml\isoschematron\__init__.py -> build\lib.win-amd64-3.5\lxml\i soschematron     copying src\lxml\lxml.etree.h -> build\lib.win-amd64-3.5\lxml     copying src\lxml\lxml.etree_api.h -> build\lib.win-amd64-3.5\lxml     copying src\lxml\includes\c14n.pxd -> build\lib.win-amd64-3.5\lxml\includes     copying src\lxml\includes\config.pxd -> build\lib.win-amd64-3.5\lxml\include s     copying src\lxml\includes\dtdvalid.pxd -> build\lib.win-amd64-3.5\lxml\inclu des     copying src\lxml\includes\etreepublic.pxd -> build\lib.win-amd64-3.5\lxml\in cludes     copying src\lxml\includes\htmlparser.pxd -> build\lib.win-amd64-3.5\lxml\inc ludes     copying src\lxml\includes\relaxng.pxd -> build\lib.win-amd64-3.5\lxml\includ es     copying src\lxml\includes\schematron.pxd -> build\lib.win-amd64-3.5\lxml\inc ludes     copying src\lxml\includes\tree.pxd -> build\lib.win-amd64-3.5\lxml\includes     copying src\lxml\includes\uri.pxd -> build\lib.win-amd64-3.5\lxml\includes     copying src\lxml\includes\xinclude.pxd -> build\lib.win-amd64-3.5\lxml\inclu des     copying src\lxml\includes\xmlerror.pxd -> build\lib.win-amd64-3.5\lxml\inclu des     copying src\lxml\includes\xmlparser.pxd -> build\lib.win-amd64-3.5\lxml\incl udes     copying src\lxml\includes\xmlschema.pxd -> build\lib.win-amd64-3.5\lxml\incl udes     copying src\lxml\includes\xpath.pxd -> build\lib.win-amd64-3.5\lxml\includes      copying src\lxml\includes\xslt.pxd -> build\lib.win-amd64-3.5\lxml\includes     copying src\lxml\includes\etree_defs.h -> build\lib.win-amd64-3.5\lxml\inclu des     copying src\lxml\includes\lxml-version.h -> build\lib.win-amd64-3.5\lxml\inc ludes     creating build\lib.win-amd64-3.5\lxml\isoschematron\resources     creating build\lib.win-amd64-3.5\lxml\isoschematron\resources\rng     copying src\lxml\isoschematron\resources\rng\iso-schematron.rng -> build\lib .win-amd64-3.5\lxml\isoschematron\resources\rng     creating build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl     copying src\lxml\isoschematron\resources\xsl\rng2schtrn.xsl -> build\lib.win -amd64-3.5\lxml\isoschematron\resources\xsl     copying src\lxml\isoschematron\resources\xsl\xsd2schtrn.xsl -> build\lib.win -amd64-3.5\lxml\isoschematron\resources\xsl     creating build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schema tron-xslt1     copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_abstra ct_expand.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-sc hematron-xslt1     copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_dsdl_i nclude.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schem atron-xslt1     copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_schema tron_message.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso -schematron-xslt1     copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_schema tron_skeleton_for_xslt1.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resour ces\xsl\iso-schematron-xslt1     copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\iso_svrl_f or_xslt1.xsl -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-sch ematron-xslt1     copying src\lxml\isoschematron\resources\xsl\iso-schematron-xslt1\readme.txt  -> build\lib.win-amd64-3.5\lxml\isoschematron\resources\xsl\iso-schematron-xslt 1     running build_ext     building 'lxml.etree' extension     error: unable find vcvarsall.bat      ---------------------------------------- command "c:\python35\python.exe -u -c "import setuptools, tokenize;__file__='c:\ \users\\dwang\\appdata\\local\\temp\\pip-build-738bf61u\\lxml\\setup.py';exec(co mpile(getattr(tokenize, 'open', open)(__file__).read().replace('\r\n', '\n'), __ file__, 'exec'))" install --record c:\users\dwang\appdata\local\temp\pip-4_tf2u3 a-record\install-record.txt --single-version-externally-managed --compile" faile d error code 1 in c:\users\dwang\appdata\local\temp\pip-build-738bf61u\lxml \ 

from understand , according docs, if read_html() fails use lxml, should fall html5lib, looks ike not happen in case , error thrown.

try explicitly state flavor:

fifty_states = pd.read_html('https://simple.wikipedia.org/wiki/list_of_u.s._states', flavor='html5lib`) 

Comments

Popular posts from this blog

amazon web services - S3 Pre-signed POST validate file type? -

c# - Check Keyboard Input Winforms -