Altair Debugging Guide#

In this notebook we show you common debugging techniques that you can use if you run into issues with Altair.

You can jump to the following sections:

  • Installation and Setup when Altair is not installed correctly

  • Display Issues when you don’t see a chart

  • Invalid Specifications when you get an error

  • Properties are Being Ignored when you don’t see any errors or warnings

  • Asking for Help when you get stuck

  • Reporting Issues when you find a bug

In addition to this notebook, you might find the Frequently Asked Questions and Display Troubleshooting guides helpful.

This notebook is part of the data visualization curriculum.

Installation#

These instructions follow the Altair documentation but focus on some specifics for this series of notebooks.

In every notebook, we will import the Altair and Vega Datasets packages. If you are running this notebook on Colab, Altair and Vega Datasets should be preinstalled and ready to go. The notebooks in this series are designed for Colab but should also work in Jupyter Lab or the Jupyter Notebook (the notebook requires a bit more setup described below) but additional packages are required.

If you are running in Jupyter Lab or Jupyter Notebooks, you have to install the necessary packages by running the following command in your terminal.

pip install altair vega_datasets

Or if you use Conda

conda install -c conda-forge altair vega_datasets

You can run command line commands from a code cell by prefixing it with !. For example, to install Altair and Vega Datasets with Pip, you can run the following cell.

!pip install altair vega_datasets
Requirement already satisfied: altair in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (5.1.2)
Requirement already satisfied: vega_datasets in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (0.9.0)
Requirement already satisfied: jinja2 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from altair) (3.1.2)
Requirement already satisfied: jsonschema>=3.0 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from altair) (4.19.2)
Requirement already satisfied: numpy in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from altair) (1.23.5)
Requirement already satisfied: packaging in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from altair) (23.2)
Requirement already satisfied: pandas>=0.25 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from altair) (2.0.3)
Requirement already satisfied: toolz in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from altair) (0.12.0)
Requirement already satisfied: typing-extensions>=4.0.1 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from altair) (4.5.0)
Requirement already satisfied: attrs>=22.2.0 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from jsonschema>=3.0->altair) (23.1.0)
Requirement already satisfied: importlib-resources>=1.4.0 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from jsonschema>=3.0->altair) (6.1.1)
Requirement already satisfied: jsonschema-specifications>=2023.03.6 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from jsonschema>=3.0->altair) (2023.7.1)
Requirement already satisfied: pkgutil-resolve-name>=1.3.10 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from jsonschema>=3.0->altair) (1.3.10)
Requirement already satisfied: referencing>=0.28.4 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from jsonschema>=3.0->altair) (0.30.2)
Requirement already satisfied: rpds-py>=0.7.1 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from jsonschema>=3.0->altair) (0.12.0)
Requirement already satisfied: python-dateutil>=2.8.2 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from pandas>=0.25->altair) (2.8.2)
Requirement already satisfied: pytz>=2020.1 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from pandas>=0.25->altair) (2023.3.post1)
Requirement already satisfied: tzdata>=2022.1 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from pandas>=0.25->altair) (2023.3)
Requirement already satisfied: MarkupSafe>=2.0 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from jinja2->altair) (2.1.3)
Requirement already satisfied: zipp>=3.1.0 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from importlib-resources>=1.4.0->jsonschema>=3.0->altair) (3.17.0)
Requirement already satisfied: six>=1.5 in /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages (from python-dateutil>=2.8.2->pandas>=0.25->altair) (1.16.0)
[notice] A new release of pip is available: 23.0.1 -> 23.3.1
[notice] To update, run: pip install --upgrade pip
import altair as alt
from vega_datasets import data

Make sure you are Using the Latest Version of Altair#

If you are running into issues with Altair, first make sure that you are running the latest version. To check the version of Altair that you have installed, run the cell below.

alt.__version__
'5.1.2'

To check what the latest version of altair is, go to this page or run the cell below (requires Python 3).

import urllib.request, json 
with urllib.request.urlopen("https://pypi.org/pypi/altair/json") as url:
    print(json.loads(url.read().decode())['info']['version'])
5.1.2

If you are not running the latest version, you can update it with pip. You can update Altair and Vega Datasets by running this command in your terminal.

pip install -U altair vega_datasets

Try Making a Chart#

Now you can create an Altair chart.

cars = data.cars()

alt.Chart(cars).mark_point().encode(
    x='Horsepower',
    y='Displacement',
    color='Origin'
)

Special Setup for the Jupyter Notebook#

If you are running in Jupyter Lab, Jupyter Notebook, or Colab (and have a working Internet connection) you should be seeing a chart. If you are running in another environment (or offline), you will need to tell Altair to use a different renderer;

To activate a different renderer in a notebook cell:

# to run in nteract, VSCode, or offline in JupyterLab
alt.renderers.enable('mimebundle')

To run offline in Jupyter Notebook you must install an additional dependency, the vega package. Run this command in your terminal:

pip install vega

Then activate the notebook renderer:

# to run offline in Jupyter Notebook
alt.renderers.enable('notebook')

These instruction follow the instructions on the Altair website.

Display Troubleshooting#

If you are having issues with seeing a chart, make sure your setup is correct by following the debugging instruction above. If you are still having issues, follow the instruction about debugging display issues in the Altair documentation.

Non Existent Fields#

A common error is accidentally using a field that does not exit.

import pandas as pd

df = pd.DataFrame({'x': [1, 2, 3],
                     'y': [3, 1, 4]})

alt.Chart(df).mark_point().encode(
    x='x:Q',
    y='y:Q',
    color='color:Q'  # <-- this field does not exist in the data!
)

Check the spelling of your files and print the data source to confirm that the data and fields exist. For instance, here you see that color is not a vaid field.

df.head()
x y
0 1 3
1 2 1
2 3 4

Invalid Specifications#

Another common issue is creating an invalid specification and getting an error.

Invalid Properties#

Altair might show an SchemaValidationError or ValueError. Read the error message carefully. Usually it will tell you what is going wrong.

For example, if you forget the mark type, you will see this SchemaValidationError.

alt.Chart(cars).encode(
    y='Horsepower'
)
---------------------------------------------------------------------------
SchemaValidationError                     Traceback (most recent call last)
File /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages/altair/vegalite/v5/api.py:2713, in Chart.to_dict(self, validate, format, ignore, context)
   2709     copy.data = core.InlineData(values=[{}])
   2710     return super(Chart, copy).to_dict(
   2711         validate=validate, format=format, ignore=ignore, context=context
   2712     )
-> 2713 return super().to_dict(
   2714     validate=validate, format=format, ignore=ignore, context=context
   2715 )

File /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages/altair/vegalite/v5/api.py:921, in TopLevelMixin.to_dict(self, validate, format, ignore, context)
    916 context["top_level"] = False
    918 # TopLevelMixin instance does not necessarily have to_dict defined
    919 # but due to how Altair is set up this should hold.
    920 # Too complex to type hint right now
--> 921 vegalite_spec = super(TopLevelMixin, copy).to_dict(  # type: ignore[misc]
    922     validate=validate, ignore=ignore, context=dict(context, pre_transform=False)
    923 )
    925 # TODO: following entries are added after validation. Should they be validated?
    926 if is_top_level:
    927     # since this is top-level we add $schema if it's missing

File /opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages/altair/utils/schemapi.py:983, in SchemaBase.to_dict(self, validate, ignore, context)
    976         self.validate(result)
    977     except jsonschema.ValidationError as err:
    978         # We do not raise `from err` as else the resulting
    979         # traceback is very long as it contains part
    980         # of the Vega-Lite schema. It would also first
    981         # show the less helpful ValidationError instead of
    982         # the more user friendly SchemaValidationError
--> 983         raise SchemaValidationError(self, err) from None
    984 return result

SchemaValidationError: '{'data': {'name': 'data-f02450ab61490a1363517a0190416235'}, 'encoding': {'y': {'field': 'Horsepower', 'type': 'quantitative'}}}' is an invalid value.

'mark' is a required property
alt.Chart(...)

Or if you use a non-existent channel, you get a ValueError.

alt.Chart(cars)).mark_point().encode(
    z='Horsepower'
)
  Cell In[9], line 1
    alt.Chart(cars)).mark_point().encode(
                   ^
SyntaxError: unmatched ')'

Properties are Being Ignored#

Altair might ignore a property that you specified. In the chart below, we are using a text channel, which is only compatible with mark_text. You do not see an error or a warning about this in the notebook. However, the underlying Vega-Lite library will show a warning in the browser console. Press Alt+Cmd+I on Mac or Alt+Ctrl+I on Windows and Linux to open the developer tools and click on the Console tab. When you run the example in the cell below, you will see a the following warning.

WARN text dropped as it is incompatible with "bar".
alt.Chart(cars).mark_bar().encode(
    y='mean(Horsepower)',
    text='mean(Acceleration)'
)

If you find yourself debugging issues related to Vega-Lite, you can open the chart in the Vega Editor either by clicking on the “Open in Vega Editor” link at the bottom of the chart or in the action menu (click to open) at the top right of a chart. The Vega Editor provides additional debugging but you will be writing Vega-Lite JSON instead of Altair in Python.

Note: The Vega Editor may be using a newer version of Vega-Lite and so the behavior may vary.

Asking for Help#

If you find a problem with Altair and get stuck, you can ask a question on Stack Overflow. Ask your question with the altair and vega-lite tags. You can find a list of questions people have asked before here.

Reporting Issues#

If you find a problem with Altair and believe it is a bug, please create an issue in the Altair GitHub repo with a description of your problem. If you believe the issue is related to the underlying Vega-Lite library, please create an issue in the Vega-Lite GitHub repo.