Data Scientist
About us:
We are a startup based in the Bay Area, focusing on providing data science solutions
to the entertainment industry. We help studios understand ever-changing audience
preferences and produce even more exciting and relevant movies and TV Shows. Our
solutions and services help studios optimize production costs. We make a difference
in every aspect of filmed entertainment – ideation, production, marketing, distribution – using data science.
Primary Purpose:
We are seeking a talented Data Scientist, preferably proficient in both Python and R.
As a Data Scientist, you will analyze data and propose and develop solutions that
address various business needs of the entertainment industry. Your insights, apps,
recommendations will be used by stakeholders in crucial decision-making. You will
develop and deploy data products and solutions that address the needs of the
entertainment industry. The solutions that you build will be used by stakeholders to understand audience engagement, optimize production costs, drive content strategy, help greenlight movies, make TV shows successful, and much more.
Responsibilities:
- Design, develop, test, and deploy ML models that drive applications and APIs.
- Work with stakeholders to understand the business requirements and develop a data science solution to address those requirements.
- Write clean, efficient, and well-documented code, conduct code reviews.
- Containerize the models that you have developed.
- Deploy those models in the cloud.
- Use ML Ops pipeline to serve models.
- Use SQL to extract desired data from Snowflake, Redshift, etc.
- You will ensure that your models and insights satisfy the business objectives – rapid execution is paramount.
Qualification:
- Foremost, you are curious! You have a burning desire to investigate data for what, why, when, how.
- B.S. in a Computer Science/Statistics/Data Science or any related quantitative field.
- You are proficient in Python and R.
- You are good at statistics and linear algebra.
- Familiarity and experience in building Shiny or Dash applications is desirable.
- Strong proficiency with ML-frameworks such as TensorFlow, PyTorch, JAX, Scikit-learn, and GBMs (LightGBM, XGBoost, CatBoost, etc.).
- Production experience implementing machine learning pipelines and models at scale in Python.
- Experience or strong interest in working with cloud computing systems (AWS and Google Cloud).
- Experience with Docker and building containerized data and model workflows is desirable.
- You have demonstrated excellence in globally conducted ML/AI competitions.
- You understand the importance of time and timeliness.
- Excellent written and verbal communication skills.
- Passion for data-driven research, development, and experimentation.
- Self-motivated, growth-oriented, and driven to pursue solutions to challenging problems.
Location:
Flexible – US & India.