Wealth - Lead - GenAI Testing and Evaluation Framework - Vice President job opportunity at Citibank.



bot
Citibank Wealth - Lead - GenAI Testing and Evaluation Framework - Vice President
Requires: 1-year - XP
Pattern: full-time
apply Apply Now
Salary:
Status:
Copy Link Report
Technical Certificate
New York, United States Of America
New York....United States Of America

We are seeking an #innovative and #detail-oriented professional to lead the development and management of the Generative AI (GenAI) testing and evaluation framework. This role focuses on creating patterns, methodologies, and iterative structures to optimize the performance and effectiveness of GenAI models, with a particular emphasis on prompt engineering and evaluation. The ideal candidate will have a strong background in GenAI, a deep understanding of natural language processing, and a passion for refining AI solutions through rigorous testing and iteration. __ Framework Development: Design and implement a comprehensive testing and evaluation framework for GenAI model outputs. Develop standards and patterns for assessing the quality and "goodness" of prompts across diverse use cases. Create iterative processes for testing and refining prompts to optimize model outputs. __ Prompt Engineering and Evaluation: Establish criteria for evaluating prompt performance, including accuracy, completeness, relevance, coherence, and alignment with desired outcomes. Experiment with prompt structures to identify optimal configurations for various business applications. Develop and document best practices for prompt design and refinement. __ Collaboration and Integration: Work closely with tech partners, engineers, and product teams to ensure testing frameworks integrate seamlessly into the development lifecycle. Partner with stakeholders to understand business requirements and tailor testing methodologies to address specific needs. Provide actionable insights and recommendations to improve model performance based on evaluation results. __ Tooling and Automation: Identify and implement tools for automating the testing and evaluation process. Develop dashboards and reporting mechanisms to monitor prompt and model performance metrics. Stay updated on emerging tools and techniques in AI testing and integrate them into the framework. __ Continuous Improvement: Establish feedback loops to iteratively improve testing methodologies and evaluation standards. Establish process for ongoing monitoring of prompts, once productionalized. Monitor industry trends and advancements in Generative AI to ensure the framework remains cutting-edge. Advocate for a culture of experimentation and continuous learning within the organization.

Other Ai Matches

Banking, Investment Banking, Placement Analyst Internship, Frankfurt, Germany 2026 Applicants are expected to have a solid experience in handling Administration related tasks
Wealth Relationship Manager - Poway, CA Applicants are expected to have a solid experience in handling Management related tasks
Wealth Relationship Manager SAFE Act - Key Biscayne Applicants are expected to have a solid experience in handling surveillance related tasks
SUPER DAY KYC Operations Analyst Applicants are expected to have a solid experience in handling Analyst related tasks
Vice President Mergers & Acquisitions Applicants are expected to have a solid experience in handling Administrative related tasks
Stress Testing 2nd Line of Defense Lead Analyst Applicants are expected to have a solid experience in handling Analyst related tasks
Risk Reporting Senior Analyst Applicants are expected to have a solid experience in handling Analyst related tasks
Wealth - Lead - GenAI Testing and Evaluation Framework - Vice President Applicants are expected to have a solid experience in handling Lead related tasks
Associate Banker - Taiwan Offshore Team Applicants are expected to have a solid experience in handling banking related tasks
Data Quality Lead Analyst Applicants are expected to have a solid experience in handling Analyst related tasks
Banking - Investment Banking, Summer Analyst, Korea, Republic of – APAC, 2026 Applicants are expected to have a solid experience in handling Analyst related tasks
Scorecard Design and Management Analyst - AVP, C12 - Banking & International, Pune Applicants are expected to have a solid experience in handling Management Finance related tasks
Wealth Relationship Manager SAFE Act Applicants are expected to have a solid experience in handling Manager related tasks
Wholesale Credit Risk Stress Testing Lead, Senior Vice President Applicants are expected to have a solid experience in handling Management related tasks
Head of Markets Sales - Australia and New Zealand Applicants are expected to have a solid experience in handling Sales related tasks
Assistant Vice President, FX Settlement Apply Applicants are expected to have a solid experience in handling Operations related tasks
Sr Java Developer - Spring Kafka - Assistant Vice president Applicants are expected to have a solid experience in handling Developer related tasks
Assistant Vice President, FX Specialist (Treasury), Consumer Banking Applicants are expected to have a solid experience in handling Management related tasks
IRRBB Senior Lead Analyst - SVP Applicants are expected to have a solid experience in handling IRRBB Senior Lead Analyst - SVP related tasks
Underwriter - C13 - SHANGHAI Applicants are expected to have a solid experience in handling underwriter related tasks
Senior Linux System Admin Applicants are expected to have a solid experience in handling Manager related tasks
Model/ Analysis /Validation Officer Applicants are expected to have a solid experience in handling Officer related tasks
Wealth Group Executive - San Francisco, CA Applicants are expected to have a solid experience in handling Executive related tasks