Data Scientist - Agama Solutions
Pleasanton, CA
About the Job
Job Details: Data Scientist
Location: Pleasanton, CA
Top Skill:
Qualifications for Data Scientist Strong problem solving skills with an emphasis on product development.
Experience using statistical computer languages (R, Python, SQL, etc.) to manipulate data and draw insights from large data sets.
Experience working with and creating data architectures.
Knowledge of a variety of machine learning techniques (clustering, decision tree learning, artificial neural networks, etc.) and their real-world advantages drawbacks.
Knowledge of advanced statistical techniques and concepts (regression, properties of distributions, statistical tests anritaprikhodkod proper usage, etc.) and experience with applications.
Experience manipulating data sets and building statistical models, has a Master's or PHD in Statistics, Mathematics, Computer Science or another quantitative field, and is familiar with the following software tools: Coding knowledge and experience with several languages: C, C++, Java, JavaScript, etc.
Knowledge and experience in statistical and data mining techniques: GLM Regression, Random Forest, Boosting, Trees, text mining, social network analysis, etc
Experience querying databases and using statistical computer languages: R, Python, SLQ, etc. Experience using web services: Redshift, S3, Spark, , etc.
Experience creating and using advanced machine learning algorithms and statistics: regression, simulation, scenario analysis, modeling, clustering, decision trees, neural networks, etc
Experience analyzing data from 3rd party providers: Client Analytics, Site Catalyst, Coremetrics, Adwords, Crimson Hexagon, Client Insights, etc. Experience with distributed data/computing tools: Map/Reduce, Hadoop, Hive, Spark, Gurobi, MySQL, etc. Experience visualizing/presenting data for stakeholders using: Periscope, Business Objects, D3, ggplot, etc.
Top Daily Responsibilities:
1. Support Data-Science and other analytics as needed.
2. Develop SQL queries and data sets 3. Develop business and client facing reports
Skills a Top Candidate Should Have:
- Knowledge and experience in statistical and data mining techniques: GLM/Regression, Random Forest, Boosting, Trees, text mining, social network analysis, etc.
- Experience querying databases and using statistical computer languages: R, Python, SLQ, etc.
- Experience using web services: Redshift, S3, Spark, DigitalOcean, etc.
- Experience creating and using advanced machine learning algorithms and statistics: regression, simulation, scenario analysis, modeling, clustering, decision trees, neural networks, etc.
- Experience analyzing data from 3rd party providers: Client Analytics, Site Catalyst, Coremetrics, Adwords, Crimson Hexagon, Client Insights, etc.
- Experience with distributed data/computing tools: Map/Reduce, Hadoop, Hive, Spark, Gurobi, MySQL, etc.
- Experience visualizing/presenting data for stakeholders using: Periscope, Business Objects, D3, ggplot, etc.
Desired Skills:
- Strong problem solving skills with an emphasis on product development.
- Experience using statistical computer languages (R, Python, SQL, etc.) to manipulate data and draw insights from large data sets.
- Experience working with and creating data architectures.
- Knowledge of a variety of machine learning techniques (clustering, decision tree learning, artificial neural networks, etc.) and their real-world advantages/drawbacks.
- Knowledge of advanced statistical techniques and concepts (regression, properties of distributions, statistical tests and proper usage, etc.) and experience with applications.
- Excellent written and verbal communication skills for coordinating across teams.
- A drive to learn and master new technologies and techniques.
- We're looking for someone with experience manipulating data sets and building statistical models, has a Master's or PHD in Statistics, Mathematics, Computer Science or another quantitative field, and is familiar with software.
Skills:
1. Excellent Communication Skills.
2. Ability to work with business to gather report requirements.
3. Team player. Custom Job Description: If you have a custom job description that you would like to use. Please paste it here: Knowledge and experience with large data sets, event streams and distributed computing (Hive,Impala,Hadoop etc.) Ability to gather requirements and develop reports in tool selected by business and KPIT.