Tata Consultancy Services | Artificial Intelligence Engineer (Llm & Rag) | São Paulo

Detalhes da Vaga

We are looking for an experienced AI Engineer specializing in Retrieval-Augmented Generation (RAG) to build and optimize hybrid AI solutions leveraging Large Language Models (LLMs). This role involves working with cutting-edge language models and retrieval systems to deliver highly accurate, context-aware, and responsive AI applications. You'll collaborate with cross-functional teams to develop scalable solutions that enhance information retrieval, comprehension, and generation capabilities in real-world applications. Key Responsibilities: Design, develop, and deploy hybrid RAG architectures integrating LLMs with retrieval-based systems for improved relevance and contextual responses. Fine-tune and optimize large language models, enhancing their performance and adaptability to domain-specific requirements. Implement and manage RAG pipelines that effectively combine retrieval mechanisms with generative capabilities, ensuring high accuracy and efficiency. Develop custom plugins, adapters, or APIs to integrate retrieval systems (e.g., Elasticsearch, FAISS) with generative models, facilitating seamless information retrieval. Monitor and troubleshoot issues within RAG pipelines, fine-tuning retrieval parameters and model hyperparameters to optimize performance. Work closely with data engineers to manage and preprocess large datasets for training, ensuring high-quality and diverse data coverage. Evaluate and benchmark the performance of RAG solutions, using metrics such as response accuracy, latency, and user satisfaction. Stay up-to-date with advancements in NLP, LLMs, and RAG methodologies, continually improving existing architectures and recommending new techniques. Qualifications: Bachelor's or Master's degree in Computer Science, Artificial Intelligence, or a related field, or equivalent practical experience. Experience in AI/NLP, with a focus on LLMs, transformer-based architectures, and retrieval systems. Proven experience building and deploying RAG solutions or other hybrid AI architectures. Strong understanding of information retrieval methods, including dense retrieval, sparse retrieval, and embeddings-based techniques. Proficiency in Python, TensorFlow or PyTorch, and experience with libraries and tools related to LLMs, such as Hugging Face Transformers. Familiarity with retrieval frameworks like Elasticsearch, FAISS, or OpenSearch. Knowledge of prompt engineering, fine-tuning, and deployment of language models for production environments. Strong analytical skills, with experience in optimizing LLM and retrieval model performance. English required Preferred Skills: Experience with cloud services and infrastructure (AWS, GCP, Azure) and MLOps tools for model deployment and monitoring. Contributions to open-source RAG projects or experience working with OpenAI, LangChain, or similar frameworks. Knowledge of vector databases, memory-augmented networks, and distributed systems.


Salário Nominal: A acordar

Fonte: Sercanto_Ppc

Função de trabalho:

Requisitos

Atendente

Atendente - Temporário- Bauru / SPResumo da Vaga Oportunidade para empresa no segmento Hospitalar. Bauru/SP. Atividades Atender os clientes via telefone o...


Jcp Assessoria - São Paulo

Publicado 8 days ago

Estágio Nível Superior (Facilities) - Jundiaí, Sp

Desenvolva seu talento para alimentar o mundo! Nosso estágio é para quem tem vontade de fazer a mudança acontecer. Valorizamos nossos profissionais desde os...


Brf - São Paulo

Publicado 8 days ago

Jovem Aprendiz Operador De Supermercado - Zona Oeste

_**TEMOS VAGAS PARA APRENDIZ**:_ - **O QUE VOCÊ PRECISA SABER PARA PODER PARTICIPAR DO PROCESSO SELETIVO**:_ - As Vagas São URGENTES e o Inicio é Imediato 1...


Cup Rh Boutique De Recursos Humanos - São Paulo

Publicado 8 days ago

Operador De Telemarketing Ativo

Trabalhar com vendas ativas, buscando leads, prospecção de novos clientes via telefone e whatsapp. Realizar agendamentos para que novas pessoas possam compar...


Orthopride Guarujá - São Paulo

Publicado 8 days ago

Built at: 2024-12-02T02:35:18.052Z