
LangExtract
API Overview
From Google's open-source project LangExtract: https://github.com/google/langextract
LangExtract is a Python library developed by Google, focused on "extracting structured information from unstructured text using large language models (LLMs)." Its core features include:
It can extract structured data such as entities, attributes, and relationships from text—like medical reports or literary works—based on your custom rules and examples.
The tool precisely locates the extracted results within the original text, making it easy to highlight, visualize, and perform checks or traceability tasks.
It’s especially well-suited for handling long texts, such as parsing an entire novel or lengthy medical report at once, leveraging chunking, parallel processing, and iterative refinement to enhance recall rates.
An interactive HTML visualization feature is built-in, allowing you to vividly display the extracted results directly on a web page.
Price: Calculated based on the cost of model calls
API Console
Log in to explore more features! Click to Log In