Own our AI models end-to-end: dataset strategy, training/fine-tuning, evaluation, optimization, and deployment to real hardware in the field. * Hands-on experience with Audio/Video AI - HuggingFace Transformers, SpeechBrain or torchaudio, one detection/pose stack (YOLO, MediaPipe). You'll turn messy real-world streams into robust, low-latency inference pipelines. What you'll do * Design, train, and fine-tune text/audio/vision models (e.g., DistilRoBERTa, wav2vec2, YOLO) for threat and aggression detection. * Build reproducible training pipelines (HF/ PyTorch/ SpeechBrain), incl. PEFT/LoRA, adapters, and transfer learning. * Optimize for real-time: quantization, pruning, ONNX/TensorRT, mixed precision, batching, caching. * Ship models to edge & cloud with CI/CD, versioning, and rollback; instrument latency and accuracy SLAs. * Create d
more