Introduction to NLP with spaCy#

Learning objectives#

  • Understand the basic concepts of probabilistic language models and how these can be applied to different NLP tasks including sentiment analysis, genre classification and named-entity recognition.

  • Understand and know how to apply n-grams and word embeddings for feature extraction in a classification pipeline.

  • Have a conceptual understanding of transformers and deep learning techniques for NLP.

  • Understand and know how to use Python spaCy for NLP and develop reproducible text processing pipelines.

spaCy#

spaCy is a free, open-source library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.

Suggested study plan#