gliner-api

A minimal FastAPI app serving GLiNER models.

Features

FastAPI backend for serving GLiNER models (NER).
Gradio frontend (optional) for interactive use.
Prometheus metrics endpoint (/metrics).
Configurable via YAML, CLI, or environment variables.
Docker and Docker Compose support.
ONNX inference support (including quantized models).
API key authentication (optional).
Custom metrics port and enable/disable option for Prometheus metrics.

Documentation

For detailed documentation, see DeepWiki (⚠️ AI-generated)

Live Demo

You can try the live demo of the GLiNER API container in it's Huggingface Space: GLiNER API Demo.

It uses a minimally changed image to make it work in the Huggingface Space environment.

Usage

Run with Docker

You can either build the container yourself or use a prebuilt image from GitHub Container Registry.

Run prebuilt container (recommended)

docker run \
  -p 8080:8080 \
  -p 9090:9090 \
  -v $(pwd)/config.yaml:/app/config.yaml \
  -v $HOME/.cache/huggingface:/app/huggingface \
  ghcr.io/freinold/gliner-api:latest

-v $(pwd)/config.yaml:/app/config.yaml mounts your config file (edit as needed)
-v $HOME/.cache/huggingface:/app/huggingface mounts your Huggingface cache for faster model loading

Build and run locally (CPU version)

docker build \
  -f cpu.Dockerfile \
  --build-arg IMAGE_CREATED="$(date -u +%Y-%m-%dT%H:%M:%SZ)" \
  --build-arg IMAGE_REVISION="$(git rev-parse HEAD)" \
  --build-arg IMAGE_VERSION="$(git describe --tags --always)" \
  -t gliner-api .

docker run --rm \
  -p 8080:8080 \
  -p 9090:9090 \
  -v $(pwd)/config.yaml:/app/config.yaml \
  -v $HOME/.cache/huggingface:/app/huggingface \
  gliner-api

Run with Docker Compose

Edit compose.yaml to select the config you want (see example_configs/).

Then start:

docker compose up --build

Run the app directly

Be sure to check the installation instructions first.

uv run main.py [OPTIONS]

Or with FastAPI CLI:

fastapi run main.py --host localhost

Run options

uv run main.py --help

Option	Description	Default
`--use-case` / `--name`	Use case for the GLiNER model (application/domain)	`general`
`--model-id`	Huggingface model ID (browse models)	`knowledgator/gliner-x-base`
`--onnx-enabled`	Use ONNX for inference	`False`
`--onnx-model-path`	Path to ONNX model file	`model.onnx`
`--default-entities`	Default entities to detect	`['person', 'organization', 'location', 'date']`
`--default-threshold`	Default detection threshold	`0.5`
`--api-key`	API key for authentication (if set, required in requests)	`null`
`--host`	Host address	`0.0.0.0`
`--port`	Port	`8080`
`--metrics-enabled`	Enable Prometheus metrics endpoint	`True`
`--metrics-port`	Port for Prometheus metrics endpoint	`9090`
`--frontend-enabled`	Enable Gradio frontend	`True`

API & Frontend Endpoints

Description	Path	Demo Link
Gradio Frontend (if enabled)	`/`	Frontend
API Docs (Swagger)	`/docs`	Swagger UI
API Docs (ReDoc)	`/redoc`	ReDoc
Prometheus Metrics	`/metrics`	(no public demo link; available on metrics port if enabled)

Example Request

curl -X POST "http://localhost:8080/api/invoke" -H "Content-Type: application/json" -d '{"text": "Steve Jobs founded Apple in Cupertino."}'

Installation

Prerequisites:

Python 3.12.11
uv (for dependency management)

Install dependencies:

# CPU version
uv sync --extra cpu [--extra frontend]

# GPU version
uv sync --extra gpu [--extra frontend]

The frontend is optional, but encouraged for interactive use.

Install from source:

git clone https://github.com/freinold/gliner-api.git
cd gliner-api
uv sync --extra cpu  # or --extra gpu

Configuration

You can configure the app via:

config.yaml (default, see example_configs/)
CLI options (see above)
Environment variables (prefix: GLINER_API_)

Example configs:

example_configs/general.yaml (default NER)
example_configs/pii.yaml (PII detection)
example_configs/medical.yaml (medical NER)
example_configs/general_onnx.yaml (ONNX inference)
example_configs/general_onnx_quantized.yaml (quantized ONNX)

Used Frameworks & Libraries

FastAPI (API backend)
Gradio (optional frontend)
Uvicorn (ASGI server)
Prometheus Client (metrics)
Huggingface Hub (model loading)
PyTorch (CPU/GPU inference)
ONNX (optional, for ONNX models)
uv (dependency management)

License

See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 209 Commits
.github/workflows		.github/workflows
example_configs		example_configs
gliner_api		gliner_api
static		static
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
compose.yaml		compose.yaml
cpu.Dockerfile		cpu.Dockerfile
gpu.Dockerfile		gpu.Dockerfile
logconf.yaml		logconf.yaml
main.py		main.py
pyproject.toml		pyproject.toml
renovate.json5		renovate.json5
repo-og-image.png		repo-og-image.png
repo-og-image.xcf		repo-og-image.xcf
ruff.toml		ruff.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

gliner-api

A minimal FastAPI app serving GLiNER models.

Features

Documentation

Live Demo

Usage

Run with Docker

Run prebuilt container (recommended)

Build and run locally (CPU version)

Run with Docker Compose

Run the app directly

Run options

API & Frontend Endpoints

Example Request

Installation

Configuration

Used Frameworks & Libraries

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors 5

Uh oh!

Languages

License

freinold/GLiNER-API

Folders and files

Latest commit

History

Repository files navigation

gliner-api

A minimal FastAPI app serving GLiNER models.

Features

Documentation

Live Demo

Usage

Run with Docker

Run prebuilt container (recommended)

Build and run locally (CPU version)

Run with Docker Compose

Run the app directly

Run options

API & Frontend Endpoints

Example Request

Installation

Configuration

Used Frameworks & Libraries

License

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors 5

Uh oh!

Languages

Packages