Senior ML Engineer - Agentic AI
SS&C Technologies, Inc. | |
parental leave, paid time off, paid holidays, sick time, 401(k)
| |
United States, Massachusetts, Waltham | |
Apr 16, 2026 | |
|
As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology. Job Description Senior ML Engineer - Agentic AI Location: Waltham, MA(Hybrid) Get To Know the Team You'll be joining a collaborative, fast-moving team of data scientists, AI engineers, machine learning engineers, and data engineers who work together to tackle complex, high-impact problems at the intersection of AI and enterprise software. We operate with a genuinely agile mindset - shipping iteratively, challenging assumptions, and staying close to the cutting edge. The team is proactive about research, consistently evaluating and adopting state-of-the-art methodologies, and we encourage everyone to experiment, share findings, and bring new ideas to the table. If you thrive in an environment where intellectual curiosity is the norm and the work is always evolving, you'll fit right in. Why You Will Love It Here!
What You Will Get to Do: Agent Design & Implementation
Terminal-Based AI Coding & Development
Model Context Protocol (MCP) & Tool Integration
CI/CD Pipeline
LLM Experimentation & Evaluation
Memory & Retrieval Systems
What You Will Bring Strong Python Engineering- Expert-level Python skills across OOP, async programming, testing, and packaging, with the ability to write clean, modular, production-grade code. Familiarity with JavaScript/TypeScript is a plus for UI integrations or edge agent work. AI & LLM Expertise- A solid foundation in how large language models work, including transformer architecture, tokenization, context windows, and prompting strategies, paired with hands-on experience using LLM APIs from providers like OpenAI, Anthropic, Hugging Face, and Ollama. You understand embeddings, vector similarity search, and RAG pipelines and know how to apply them in production systems. Agentic AI Development- Proven experience designing and building autonomous AI agents using frameworks such as LangChain, LangGraph, LlamaIndex, CrewAI, AutoGen, and Semantic Kernel. You're fluent in agent design patterns - ReAct loops, chain-of-thought planning, hierarchical task decomposition, and multi-agent coordination - and can translate complex business workflows into reliable agentic solutions. MCP & Tool Integration- Familiarity with Model Context Protocol architecture and the ability to build and extend MCP servers using FastMCP to expose enterprise tools, databases, and APIs as structured, agent-callable capabilities with well-defined schemas and invocation patterns. Framework-Level Customization & Source Code Engineering- The ability to go beyond standard library usage and work directly at the source code level of AI frameworks. This includes forking and modifying open-source agent libraries, fusing capabilities across frameworks - such as integrating LangGraph's ReAct agent architecture with the deep planning capabilities of OpenCode-style coding agents to produce a hybrid base agent - and packaging those modifications cleanly for team-wide reuse. You know how to navigate unfamiliar codebases quickly, make targeted changes without breaking upstream compatibility, and contribute back where appropriate. Custom Agent Packaging & CI/CD Deployment- Experience containerizing heavily customized agent builds - including forked or modified third-party libraries - into reproducible Docker images and shipping them through automated CI/CD pipelines (GitHub Actions, Jenkins). You can manage dependency pinning, versioning, and environment parity across dev, staging, and production, ensuring that bespoke framework modifications are treated as first-class, production-grade software rather than one-off hacks. Memory & Retrieval Systems- Hands-on experience with vector databases (PgVector, Milvus) and the ability to design short-term, long-term, and episodic memory architectures that improve agent continuity and personalization over time. Data Engineering & Databases- Working knowledge of relational (PostgreSQL), NoSQL (MongoDB), and graph databases (Neo4j), along with experience building ETL pipelines to maintain agent knowledge bases. DevOps & Deployment- Comfort containerizing and deploying agent services using Docker and Kubernetes, building CI/CD pipelines with Jenkins, and working with cloud ML platforms such as AWS. Evaluation & Observability- Experience defining and running evaluation frameworks - using tools like LangFuse or custom harnesses - to measure retrieval accuracy, tool-selection precision, hallucination rates, and end-to-end task completion, then using those insights to drive continuous improvement. Security & Governance- Awareness of secure coding practices, IAM and token-based authentication, prompt injection mitigations, and data encryption, with a mindset of embedding compliance into every layer of agent design. Curiosity & Learning Agility- A genuine habit of staying at the cutting edge - reading research, experimenting with emerging models, and building side projects. You're comfortable with ambiguity, thrive in fast-moving environments, and bring new ideas to the table without waiting to be asked. Hands on experience in below stack: Agent Frameworks (e.g., OpenCode, Claude Code, LangChain, LangGraph, LlamaIndex, CrewAI, AutoGen) provide abstractions for prompt chaining, tool integration, planning loops, and multi-agent coordination, making them essential for building complex agent workflows. Models / LLMs - including open-source LLMs with speculative decoding, constraint decoding, router mode, and similar techniques - serve as the reasoning engine for agents. The right choice depends on cost, latency, context size, and task complexity, with fine-tuned or domain-specific models available for specialized use cases. Memory Stores such as Milvus and PgVector are vector databases that store embeddings for retrieval-augmented generation (RAG), alongside document stores like ElasticSearch and S3 for raw data persistence. Databases including PostgreSQL, Milvus, MongoDB, MySQL, and Neo4j serve as structured and graph data sources for agent lookups, status checks, and relationship-aware reasoning. Tool Access / Services - spanning MCPs, FastMCP, REST APIs, AWS Lambda, Microservices, and Internal SDKs - form the agent action layer, triggering business logic, external APIs, and microservices through unified access via Model Context Protocol (MCP) wrappers. Orchestration, handled by a Custom Orchestration Framework, manages multi-step workflows and multi-agent coordination with retry logic and reliable execution. Deployment technologies like Docker, Kubernetes, and Serverless host and scale agents, with serverless suited for stateless or bursty tasks and Kubernetes for stateful or GPU workloads. CI/CD & Infra tools - Jenkins, Helm, and Git - automate the testing and deployment of agent services while managing infrastructure and secrets as code. Monitoring solutions such as LangFuse and Custom SDKs capture logs, metrics, and traces of agent behavior, providing observability into prompt outputs, tool usage, latencies, and errors. Single-Agent Pipeline is the simplest pattern, where one agent handles an end-to-end task (input LLM tools output), often as an iterative RAG setup. It is ideal for narrow, well-defined tasks like FAQ bots or basic assistants, and is the easiest pattern to develop and debug. Multi-Agent Orchestration involves multiple specialized agents coordinating together - one retrieves data, another analyzes it, and a supervisor routes tasks - in either sequential or hierarchical arrangements. This pattern suits complex workflows such as data analysis combined with report generation, and improves modularity, scalability, and maintainability. Serverless (Function) Agents deploy lightweight agent logic in serverless functions like AWS Lambda or Cloud Run, scaling automatically with demand. They are best for stateless or event-driven tasks requiring quick API responses, offering rapid deployment with minimal infrastructure overhead. Containerized Microservices run each agent in its own container managed on Kubernetes, enabling persistent, stateful agents with custom runtimes and GPU access. This pattern is suited for heavy ML workloads, GPU acceleration, low-latency requirements, and complex dependency management. On-Premises / Edge Deployment hosts agents on private hardware or edge devices where data cannot leave the environment. This is required for sensitive industries such as healthcare and defense, and reduces cloud costs for constant, high-throughput workloads. Hybrid Model combines cloud infrastructure for scale with on-premises or edge deployment for data privacy and compliance. It is ideal when sensitive data must remain on premises while still leveraging cloud resources for heavy computation. Thank you for your interest in SS&C! To further explore this opportunity, please apply through our careers page on the corporate website at https://www.ssctech.com/careers/join-ssc #LI-PE1 #LI-HYBRID Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. SS&C Technologies offers a comprehensive total rewards package designed to support your wellbeing, growth, and future. Our benefits include medical, dental, and vision coverage; a 401(k) plan with company match; paid time off, holidays, and parental leave; and professional development reimbursement opportunity. Actual base salary will vary based on several factors, including but not limited to relevant skills, prior experience, education, demonstrated performance, and geographic location. Massachusetts: The expected base salary for the position is between 140,000 USD to 150,000 USD.Applications will be accepted on an ongoing basis until the position is filled. SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws. | |
parental leave, paid time off, paid holidays, sick time, 401(k)
Apr 16, 2026