Principal AI Research Engineer - Redis
San Francisco Bay Area, CA
About the Job
Who we are
We're Redis. We built the product that runs the fast apps our world runs on. (If you checked the weather, used your credit card, or looked at your flight status online today, you're welcome.) At Redis, you'll work with the fastest, simplest technology in the business—whether you're building it, telling its story, or selling it to our 10,000+ worldwide customers. We're creating a faster world with simpler experiences. You in?
Why would you love this job?
You will join our core AI team to research, design, and implement advancements in how companies run Large Language Models (LLMs) to optimize for relevance, latency, and privacy.
You will tackle complex challenges related to model optimization, fine-tuning, and tool development, impacting millions of users worldwide.
This is a unique opportunity to shape the future of AI in a hands-on research role.
What you'll do:
- Optimize and fine-tune LLMs or embedding models.
- Develop and enhance tools and frameworks built around model orchestration.
- Research, design, and implement state-of-the-art algorithms to optimize models at scale for latency and efficiency.
- Investigate complex NLP problems driven by customer use cases.
- Implement hardware-specific optimizations (e.g., GPUs, TPUs, CUDA).
What will you need to have?
- Experience in designing and developing machine learning models, especially in natural language processing.
- Deep understanding of transformer architecture, LLMs, and embedding models.
- Strong proficiency in Python, including experience with ML/NLP libraries (e.g., TensorFlow, PyTorch, Hugging Face Transformers).
- Experience with at least one low-level system language (c, c++, rust).
- Experience with CUDA and high-performance computing.
- Self-directed, ambitious, and eager to learn new skills and technologies.
- Master's or PhD in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
Extra great if you have:
- Publications or contributions to projects in AI/ML, NLP, or related areas.
- Experience with large-scale distributed training and optimization techniques.
- Contributions to open-source AI projects.
We give back to our employees:
Our culture is what makes Redis a fun and rewarding place to work. To support you at work and beyond, we offer all our US team members fantastic benefits and perks:
- Competitive salaries and equity grants
- Unlimited time off to promote a healthy work-life balance
- H/D/V coverage along with 401K, FSA, and Commuter Benefits
- Frequent team celebrations and recreation events
- Learning and development opportunities
- Ability to influence a high-performance company on its way to IPO
#LI-CA2 #LI Remote #LI-DNI
The estimated gross base annual salary range for this role is $179,500 – $269,825 per year in New York, California, Washington, Colorado, and Rhode Island. Actual compensation may vary and is dependent on various factors, including a candidate's work location, qualifications, experience, and competencies. Base annual salary is one component of Redis' total compensation and competitive benefits package, which may include 401(k), unlimited time off, learning and development opportunities, and comprehensive health and wellness benefits. This role may include discretionary bonuses, stock options, commuter benefits based on location, or a commission plan. Salary history is not used in compensation package decisions. Redis utilizes market pay data to determine compensation, so posted compensation ranges are subject to change as new market data becomes available.