Requirements

A sementic segmentation model using PyTorch and torchvision. The API is build with FastAPI. server_main.py is the main file to run the server. client_main.py is the main file to run the client.

Requirements

The code has been tested on Ubuntu 24.04 with Python 3.10 with CPU inference.
Python 3.10+(python 3.10.6 is recommended)
python3.10 -m venv venv to create a virtual environment
source venv/bin/activate to activate the virtual environment
pip3 install torch torchvision --index-url https://download.pytorch.org/whl/cu126
pip install -r requirements.txt Install torch and torchvision outside requirements.txt for simplify Dockerfile building.

If you have any trouble installing requirements.txt, you can try to install the packages one by one. The main packages are:

torch, torchvision from https://pytorch.org/
fastapi
uvicorn
requests
huggingface, huggingface_hub
transformers pip install --no-cache-dir fastapi uvicorn requests huggingface transformers

Run without a api server

python main.py --image tests/images/cat_3.jpg

Run with a api server

Start the server

python server_main.py

In another terminal, run the client

python client_main.py --image tests/images/cat_3.jpg

Test concurrency

You can test the concurrency of the API server using the provided script produce_stats_concurrent_users.py. e.g., python tests/produce_stats_concurrent_users.py with the server running in the background.

Docker

You can build and run the Docker container using the provided Dockerfile.

docker build -t segmentation-api:cuda . docker run -p 8080:8000 --name cpu_inference segmentation-api:cuda possible arguments:

add -e MODEL=MODEL_NAME to set the model name
add -e DEVICE=cuda/cpu to set the device for the server
add -e WORKERS=NUM_WORKERS to set the number of workers for the server
add -e PORT=PORT_NUMBER to set the port number for the server
add -e HOST=HOST_ADDRESS to set the host address for the server

The GPU inference is not tested yet due to the lack of GPU resource.

or pull my docker image from docker hub

docker pull skyzhou323/segmentation-api:1.1
docker run -p 8088:8000 --name cpu_inference skyzhou323/segmentation-api:1.1
then run the client from another terminal python client_main.py --image tests/images/cat_3.jpg --port 8088

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Requirements

Run without a api server

Run with a api server

Start the server

In another terminal, run the client

Test concurrency

Docker

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
api		api
config		config
inference		inference
models		models
tests		tests
utils		utils
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
client_main.py		client_main.py
main.py		main.py
requirements.txt		requirements.txt
server_main.py		server_main.py

mxz2013/segmentation_api

Folders and files

Latest commit

History

Repository files navigation

Requirements

Run without a api server

Run with a api server

Start the server

In another terminal, run the client

Test concurrency

Docker

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages