Resolving John Snow Labs(JSL) Library Error in Databricks to seamlessly make insightful decisions driving the business objectives.

Why is it essential to resolve a JSL Library Error?

Resolving errors in the John Snow Labs (JSL) Library in Databricks is critical for seamless and efficient data analysis and processing. It helps achieve business objectives, such as making data-driven decisions, identifying trends and patterns, and ultimately making informed business decisions. To resolve errors in JSL Library in Databricks, one needs to identify the root cause and apply the appropriate solution.

In this solution the TekLink Team was working on a requirement for Text Analysis. We used the JSL Libraries as part of the solution. The solution necessitated installation of JSL Libraries in Databricks.

To install the JSL Libraries we followed the below steps:

Workspace –> Users –> Create –> Library, to download com.johnsnowlabs.nlp with spark-nlp_2.12 version 4.2.4 from the maven repository.

We encountered the following error after downloading the required resources:

super (DocumentAssembler,self).__init__(classname=”com.johnsnowlabs.nlp.DocumentAssembler”)

On studying further, we realized that the error was due to the absence of valid JSL Version and the necessary functions.

The Solution:

To resolve the issue, we then downloaded the cluster from the JSL website, which includes all the libraries and function related to JSL. While downloading JSL we also faced problems like no cluster created even after following complete steps and another problem was cluster created but there needed to be functions and libraries installed.

Steps to download JSL cluster:

  • Login to John Snow Labs website
  • Click on Install in my Databricks account.
  • Fill First name, Last Name Company email id.
  • Go to Databricks Workspace copy the URL and paste it in Databricks instance URL.
  • Go to user settings access token click on generate tokenpaste it in the Databricks access token.
  • Click on test connection.
  • After testing the connection, it will prompt to create new a cluster or to select an existing cluster. Select the Create Cluster option and click on submit.

You will get a validation mail to complete the validation process.

  • Once the cluster is created, change the size of the cluster as it, by default creates a high volume of cluster.
  • Now you will be able to use the cluster with JSL libraries.
“TekLink’s team exceeded Kellogg Latin America’s expectations with the implementation of Anaplan. Not only their diligence and technical mastery were evident, but also provided critical and out-of-the-box solutions to meet the project’s criteria and expand its scope.”
Francisco Ibarra
Francisco Ibarra

Sr. Manager

“TekLink provided worry free BEx to AO Migration by analyzing and converting our 500+ BEx workbooks to Analysis for Office while also increasing adoption by running power user workshops.”
Lakshmi Thota
Lakshmi Thota

Sr. Manager

"We partnered with TekLink based upon a previous positive experience where they did a great job for us as well as hearing positive feedback about their excellent level of service. We’ve also interviewed many of their experts and consistently found their candidates to be the most technically sound with excellent BW and HANA knowledge above and beyond that of their competitors. Teklink has certainly helped us to stabilize and improve the reliability of our BI operations"
Patrick Bachman
Patrick Bachman

IT Architect

Contact Us to know more