Data - Data Scientist AI/ML Sr - Quadrant, Inc.
Woodlawn, MD 21207
About the Job
AI/ML Data Scientist (Sr)
Woodlawn, MD
MUST:
Must be able to obtain and maintain a US Public Trust clearance
Experienced Data Scientist
8+ years of experience as Data Scientist
Experience with Natural Language Processing (NLP), Generative AI and Large Language Models (LLM) required
Fluency in Python Programming, version control and collaboration with GIT, standard python packages (ex. Pandas, numpy, matplotlib) and ML frameworks required
Knowledge of TensorFlow, PyTorch, scikit-learn, NLTK, Azure ML (optional), Amazon Web Services EC2.
Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow, and/or experience with semantic search
Expert knowledge in conducting data analysis and applying advanced statistical concepts and machine learning methods to build, train, test, and evaluate a variety of supervised and unsupervised analytic models
Experience with ML model deployment and operations like Devops, MLOps, LLMOps
Experience with NLP and Generative AI libraries like regular expressions (like spacy, langchain), text annotation tools and semantic frameworks
Experience with statistical and machine learning software such as pandas and scikit-learn
Prior experience working on applications that relates to clinical domain
Ability to clean and process large amounts of real-world data
Experience retrieving and manipulating data from a variety of data sources included Db2, Oracle, SQL Server, Hadoop and flat files
Experience with database management systems, e.g., MySQL, SQLite, SQL, etc
Either experience with, or the ability and willingness to learn distributed processing via the Hadoop ecosystem, i.e., Spark, Impala and Hive
Excellent analytical skills to identify potential risks and propose effective solutions
Prior experience with federal or state governments IT project preferred
Prior experience working on applications that relates to clinical domain preferred
Experience working in an analytical research environment preferred
Bachelor's degree in Statistics, Applied Mathematics, Computer Science, or Information Science with industry experience on NLP, data science, AI/ML/LLM engineering
DUTIES:
Staying updated on the new methods in NLP, ML and Generative AI
Understand real world challenges and developing automated data solutions
Develop, test, and deploy new techniques for NLP understanding
Scalable development/deployment of ML and Generative AI approaches (such as Large Language Models (LLMs)
Train and optimize NLP/LLM models and creating Python based pipeline
Determine the nature of analytic problems, evaluate options, and offer recommendations for resolution
Advise on the methods and data needed and/or available to evaluate the (intelligence or data) problem
Collaborate with data collectors and analysts to identify and close gaps on complex monitoring problems
Provide accurate, timely, complex, and sophisticated data analysis
Quadrant is an affirmative action/equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, status as a protected veteran, or status as an individual with a disability. Healthcare benefits are offered to all eligible employees according to compliance mandated by the Affordable Care Act .
Woodlawn, MD
MUST:
Must be able to obtain and maintain a US Public Trust clearance
Experienced Data Scientist
8+ years of experience as Data Scientist
Experience with Natural Language Processing (NLP), Generative AI and Large Language Models (LLM) required
Fluency in Python Programming, version control and collaboration with GIT, standard python packages (ex. Pandas, numpy, matplotlib) and ML frameworks required
Knowledge of TensorFlow, PyTorch, scikit-learn, NLTK, Azure ML (optional), Amazon Web Services EC2.
Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow, and/or experience with semantic search
Expert knowledge in conducting data analysis and applying advanced statistical concepts and machine learning methods to build, train, test, and evaluate a variety of supervised and unsupervised analytic models
Experience with ML model deployment and operations like Devops, MLOps, LLMOps
Experience with NLP and Generative AI libraries like regular expressions (like spacy, langchain), text annotation tools and semantic frameworks
Experience with statistical and machine learning software such as pandas and scikit-learn
Prior experience working on applications that relates to clinical domain
Ability to clean and process large amounts of real-world data
Experience retrieving and manipulating data from a variety of data sources included Db2, Oracle, SQL Server, Hadoop and flat files
Experience with database management systems, e.g., MySQL, SQLite, SQL, etc
Either experience with, or the ability and willingness to learn distributed processing via the Hadoop ecosystem, i.e., Spark, Impala and Hive
Excellent analytical skills to identify potential risks and propose effective solutions
Prior experience with federal or state governments IT project preferred
Prior experience working on applications that relates to clinical domain preferred
Experience working in an analytical research environment preferred
Bachelor's degree in Statistics, Applied Mathematics, Computer Science, or Information Science with industry experience on NLP, data science, AI/ML/LLM engineering
DUTIES:
Staying updated on the new methods in NLP, ML and Generative AI
Understand real world challenges and developing automated data solutions
Develop, test, and deploy new techniques for NLP understanding
Scalable development/deployment of ML and Generative AI approaches (such as Large Language Models (LLMs)
Train and optimize NLP/LLM models and creating Python based pipeline
Determine the nature of analytic problems, evaluate options, and offer recommendations for resolution
Advise on the methods and data needed and/or available to evaluate the (intelligence or data) problem
Collaborate with data collectors and analysts to identify and close gaps on complex monitoring problems
Provide accurate, timely, complex, and sophisticated data analysis
Source : Quadrant, Inc.