AI Operations and Inference Specialist
Sök senast
Datum: 8 september, 2025 Tid: 23:59
Placering: DESY
Mer information
About your role:
- Evaluate, test, and implement AI frameworks, tools, workflows, and infrastructures suited for operating locally hosted generative AI services (ChatBot and inference solutions)
- Design and configure inference backends, ensuring optimal deployment configurations and performance
- Integrate into existing HPC based GPU infrastructure the appropriate front-end and back-end technologies for providing robust, user-friendly, and reliable generative AI applications
- Conduct performance assessments and optimization, leveraging available local high-performance computing (HPC) and powerful GPU-based infrastructure
- Provide operational support, including troubleshooting, debugging, and enhancing service reliability across various AI models and authentication frameworks
- Collaborate with other team members, and document implemented systems and operational processes