Installing lxml =============== .. contents:: :depth: 1 .. 1 Where to get it 2 Requirements 3 Installation 4 Building lxml from dev sources 5 Using lxml with python-libxml2 6 Source builds on MS Windows 7 Source builds on MacOS-X Where to get it --------------- lxml is generally distributed through PyPI_. .. _PyPI: http://pypi.python.org/pypi/lxml Most **Linux** platforms come with some version of lxml readily packaged, usually named ``python-lxml`` for the Python 2.x version and ``python3-lxml`` for Python 3.x. If you can use that version, the quickest way to install lxml is to use the system package manager, e.g. ``apt-get`` on Debian/Ubuntu:: sudo apt-get install python3-lxml For **MacOS-X**, a `macport `_ of lxml is available. Try something like :: sudo port install py27-lxml To install a newer version or to install lxml on other systems, see below. Requirements ------------ You need Python 2.6 or later. Unless you are using a static binary distribution (e.g. from a Windows binary installer), lxml requires libxml2 and libxslt to be installed, in particular: * `libxml2 `_ version 2.7.0 or later. * We recommend libxml2 2.9.2 or a later version. * If you want to use the feed parser interface, especially when parsing from unicode strings, do not use libxml2 2.7.4 through 2.7.6. * `libxslt `_ version 1.1.23 or later. * We recommend libxslt 1.1.28 or later. Version 1.1.25 will not work due to a missing library symbol. Newer versions generally contain fewer bugs and are therefore recommended. XML Schema support is also still worked on in libxml2, so newer versions will give you better compliance with the W3C spec. To install the required development packages of these dependencies on Linux systems, use your distribution specific installation tool, e.g. apt-get on Debian/Ubuntu:: sudo apt-get install libxml2-dev libxslt-dev python-dev For Debian based systems, it should be enough to install the known build dependencies of the provided lxml package, e.g. :: sudo apt-get build-dep python3-lxml Installation ------------ If your system does not provide binary packages or you want to install a newer version, the best way is to get the pip_ package management tool (or use a `virtualenv `_) and run the following:: pip install lxml If you are not using pip in a virtualenv and want to install lxml globally instead, you have to run the above command as admin, e.g. on Linux:: sudo pip install lxml To install a specific version, either download the distribution manually and let pip install that, or pass the desired version to pip:: pip install lxml==3.4.2 .. _pip: http://pypi.python.org/pypi/pip To speed up the build in test environments, e.g. on a continuous integration server, disable the C compiler optimisations by setting the ``CFLAGS`` environment variable:: CFLAGS="-O0" pip install lxml (The option reads "minus Oh Zero", i.e. zero optimisations.) MS Windows .......... For MS Windows, recent lxml releases feature community donated binary distributions, although you might still want to take a look at the related `FAQ entry `_. If you fail to build lxml on your MS Windows system from the signed and tested sources that we release, consider using the binary builds from PyPI or the `unofficial Windows binaries `_ that Christoph Gohlke generously provides. Linux ..... On Linux (and most other well-behaved operating systems), ``pip`` will manage to build the source distribution as long as libxml2 and libxslt are properly installed, including development packages, i.e. header files, etc. See the requirements section above and use your system package management tool to look for packages like ``libxml2-dev`` or ``libxslt-devel``. If the build fails, make sure they are installed. Alternatively, setting ``STATIC_DEPS=true`` will download and build both libraries automatically in their latest version, e.g. ``STATIC_DEPS=true pip install lxml``. MacOS-X ....... On MacOS-X, use the following to build the source distribution, and make sure you have a working Internet connection, as this will download libxml2 and libxslt in order to build them:: STATIC_DEPS=true sudo pip install lxml Building lxml from dev sources ------------------------------ If you want to build lxml from the GitHub repository, you should read `how to build lxml from source`_ (or the file ``doc/build.txt`` in the source tree). Building from developer sources or from modified distribution sources requires Cython_ to translate the lxml sources into C code. The source distribution ships with pre-generated C source files, so you do not need Cython installed to build from release sources. .. _Cython: http://www.cython.org .. _`how to build lxml from source`: build.html If you have read these instructions and still cannot manage to install lxml, you can check the archives of the `mailing list`_ to see if your problem is known or otherwise send a mail to the list. .. _`mailing list`: http://lxml.de/mailinglist/ Using lxml with python-libxml2 ------------------------------ If you want to use lxml together with the official libxml2 Python bindings (maybe because one of your dependencies uses it), you must build lxml statically. Otherwise, the two packages will interfere in places where the libxml2 library requires global configuration, which can have any kind of effect from disappearing functionality to crashes in either of the two. To get a static build, either pass the ``--static-deps`` option to the setup.py script, or run ``pip`` with the ``STATIC_DEPS`` or ``STATICBUILD`` environment variable set to true, i.e. :: STATIC_DEPS=true pip install lxml The ``STATICBUILD`` environment variable is handled equivalently to the ``STATIC_DEPS`` variable, but is used by some other extension packages, too. Source builds on MS Windows --------------------------- Most MS Windows systems lack the necessarily tools to build software, starting with a C compiler already. Microsoft leaves it to users to install and configure them, which is usually not trivial and means that distributors cannot rely on these dependencies being available on a given system. In a way, you get what you've paid for and make others pay for it. Due to the additional lack of package management of this platform, it is best to link the library dependencies statically if you decide to build from sources, rather than using a binary installer. For that, lxml can use the `binary distribution of libxml2 and libxslt `_, which it downloads automatically during the static build. It needs both libxml2 and libxslt, as well as iconv and zlib, which are available from the same download site. Further build instructions are in the `source build documentation `_. Source builds on MacOS-X ------------------------ If you are not using macports or want to use a more recent lxml release, you have to build it yourself. While the pre-installed system libraries of libxml2 and libxslt are less outdated in recent MacOS-X versions than they used to be, so lxml should work with them out of the box, it is still recommended to use a static build with the most recent library versions. Luckily, lxml's ``setup.py`` script has built-in support for building and integrating these libraries statically during the build. Please read the `MacOS-X build instructions `_.