LangExtract

LangExtract

Extract structured information from unstructured text
2025-08-18
Data Processing
Pricing:
Calculate based on the price of model calls
Bulk order? Contact your manager for exclusive deals

API Overview

From Google's open-source project LangExtract: https://github.com/google/langextract

LangExtract is a Python library developed by Google, focused on "extracting structured information from unstructured text using large language models (LLMs)." Its core features include:

It can extract structured data such as entities, attributes, and relationships from text—like medical reports or literary works—based on your custom rules and examples.

The tool precisely locates the extracted results within the original text, making it easy to highlight, visualize, and perform checks or traceability tasks.

It’s especially well-suited for handling long texts, such as parsing an entire novel or lengthy medical report at once, leveraging chunking, parallel processing, and iterative refinement to enhance recall rates.

An interactive HTML visualization feature is built-in, allowing you to vividly display the extracted results directly on a web page.

Price: Calculated based on the cost of model calls

API Console

Log in to explore more features! Click to Log In

API Reference (2)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
reate an information extraction task
POST
Stable
View Details
View the information extraction task
GET
Stable
View Details

API Pricing

$
ModelDescription302.AI Price

reate an information extraction task

-

Calculate based on the price of model calls

View the information extraction task

-

Free