Natural Language Processing (NLP) and Machine Learning with Apache Spark

Avengers Tower (Room 1)

FRI 11:00 AM - 11:45 AM

Attendees will be presented with the popular machine learning techniques to classify, train, and predict, a structured text corpus. The code base will make use of the ecosystem from Apache Spark, and in particular, its Spark SQL (for loading data) and MLlib (for machine learning) features. In addition, the program will be written using the Scala programming language and use the IntelliJ IDEA code editor. Further real-world use cases around sentiment analysis, and the interplay with social graphs, will be illustrated.