May 10-12, 2023
Vancouver, British Columbia, Canada + Virtual
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit North America 2023 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Pacific Daylight Time (UTC/GMT -8). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Back To Schedule
Monday, May 1 • 7:00am - 7:40am
(Virtual) Using Apache OpenNLP with OpenSearch K-NN Vector Search - Jeff Zemerick, Mountain Fog

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Apache OpenNLP is a machine learning library for Java that provides natural language processing capabilities, and OpenSearch is a distributed open-source search and analytics suite. In this talk, Jeff will provide an overview of Apache OpenNLP and its capabilities and show how it can be used to power OpenSearch’s k-NN (nearest neighbor) vector search. Jeff will introduce vector search and show how it differs from “traditional” search, followed by a demonstration of how Apache OpenNLP can generate vectors suitable for indexing into OpenSearch and for querying. Attendees will come away with knowledge of how natural language processing, ONNX Runtime, and vector search can work together to provide a powerful search capability. All software used is open-source and sample source code will be provided to get started with your own projects!

avatar for Jeff Zemerick

Jeff Zemerick

Cloud and NLP Consultant, Mountain Fog
Jeff is a consultant in the areas of cloud, NLP, and search. Based outside of Pittsburgh, PA, USA, Jeff is the current chair of the Apache OpenNLP project, an infrequent piano player, and best friends to two energetic dogs.

Monday May 1, 2023 7:00am - 7:40am PDT
  Open AI & Data Forum, Natural Language Processing