Data Scientist- Houston, TX - Georgia Tek Systems
Houston, TX
About the Job
Job Title: Data Scientist
Location: Houston, TX
Duration: Contract
Rate: DOE
Duties & Responsibilities
Knowledge, Skills, & Ability
Required:
Experience & Education
Required:
Location: Houston, TX
Duration: Contract
Rate: DOE
Duties & Responsibilities
- Design and train models for use in production environments
- Mine structured and unstructured data for patterns
- Utilize data from databases, historians, and/or data lakes
- Rigorously build, analyze and compare machine learning or statistical models; there is a strong emphasis on programming using the most popular machine learning languages such as Python
- Work with application developers to develop data-analytics products that are deployed to end-users as part of packaged solutions
- Visualize and report findings of deployed data analytics solutions to provide insights to the organization and our customers
- Deploy machine learning models and integrate model predictions in business
- Setup infrastructure for machine learning, model deployment
- Deploy CI/CD framework to frequently deliver code/features to production
Knowledge, Skills, & Ability
Required:
- Expertise in predictive modeling, machine learning and statistics
- Software development skills in one or more high level languages (Python/Java/R/Scala).
- Experience using one or more of the following common Client software packages: scikit-learn, TensorFlow, NumPy, pandas, jupyter
- Well-versed in machine learning algorithms and their suitability for solving various problems, such as Regression, Bayesian, Support Vector Machines, Decision Trees, Random Forest, Clustering, Neural Networks
- Experience working in Linux is an advantage
- Experience with building and deploying data pipelines
- Good communication skills
- Ability to work as a team member in a fast-paced environment
- Knowledge of DataOps
- Experience in using SQL/No SQL database
- Experience with Big Data technologies such as Hadoop, Hive, Spark, Cassandra
- Knowledge of data versioning tools such as git, DVC
- Knowledge of Client environments such as MLflow, databricks
- Knowledge of Client deployment tools such as Kubeflow, Kubernetes
- Experience with Agile software development processes
- Experience with Cloud service offerings from AWS, Azure or GCP
Experience & Education
Required:
- B.S. or higher in Engineering, Mathematics, Statistics or Computer Science with significant experience in data analytics; overall relative experience should be around 6-8 years range
- MS degree with 5+ years' relative experience is preferred
Source : Georgia Tek Systems