Deployment and Optimization of DeepSeek-OCR
High-Performance GPU Inference Setup | CUDA 11.8 | vLLM 0.8.5
Minimum Requirements for DeepSeek-OCR
Component
Minimum
Recommended
Software Requirements
Verify GPU

Install Miniconda
Create Conda Environment

Install PyTorch (CUDA 11.8 build)

Clone DeepSeek-OCR and install requirements
After Cloning DeepSeek-OCR You Will See These Files

Install CUDA Toolkit 11.8 (Required for FlashAttention)


Install GCC-11 (required for CUDA 11.8 build tools)
Install FlashAttention

Make CUDA & GCC Paths Permanent in Conda Env
Install vLLM 0.8.5 (CUDA 11.8 wheel)

Prepare input / output paths and a sample image

Run DeepSeek-OCR
Last updated