Description:
We are looking for a Data Scientist with more than 2 years experience, who will be mainly responsible for working on a project related to Machine Learning, Natural Language Processing and Computer Vision.
Requirements
- 2-4 years of experience in Data Science, Machine Learning, and Artificial Intelligence
- BS or preferable MS degree in CS, SE, IT, EE, DS, AI, or a related field
- Strong Python programming, problem-solving, and algorithm design skills
- Broad knowledge and experience in various areas of Data Science and AI
- Expertise in Machine Learning and Deep Learning theory and implementation (SciKit-Learn, TensorFlow, Keras, PyTorch, Huggingface-transformers)
- Proficiency in Data Wrangling using Pandas, SQL, Polars, Excel, and PySpark
- Strong skills in Data Analysis & Statistics, and Data Visualization (Matplotlib, Seaborn, Plotly, Tableau)
- Knowledge of Natural Language Processing theory and implementation (SpaCy, NLTK, Huggingface-transformers)
- Experience with Large Language Models (LLMs), Prompt Engineering, Retrieval Augmented Generation, LLM Fine-Tuning
- Expertise in Computer Vision theory and implementation (OpenCV, SkImage, DL libraries)Knowledge of Time Series Analysis & Forecasting (Sktime, Prophet, pmdarima, statsmodels)
- Familiarity with Data Engineering fundamentals, Data Modeling, and databases (PostgreSQL, MongoDB
- Experience with Cloud data storage and data warehouses (S3, Redshift, BigQuery)
Responsibilities
- Collaborate on project planning and execute data science and machine learning projects.
- Conduct data wrangling and exploratory analysis using tools like Pandas, SQL, Polars, Excel, and PySpark.
- Develop machine learning and deep learning models with libraries such as SciKit-Learn, TensorFlow, Keras, PyTorch, and Huggingface-transformers.
- Implement NLP techniques and work with language models like SpaCy, NLTK, and Huggingface-transformers.
- Apply LLM knowledge for applications, including prompt engineering and retrieval augmented generation.
- Utilize computer vision techniques with OpenCV, SkImage, and DL libraries for tasks like image classification, object detection, OCR, and image segmentation.
- Apply time series analysis and forecasting using libraries like Sktime, Prophet, pmdarima, and statsmodels.
- Conduct in-depth data analysis and visualization using Matplotlib, Seaborn, Plotly, and Tableau.
- Apply data engineering fundamentals, contribute to data modeling efforts, and work with databases like PostgreSQL and MongoDB.
- Stay updated on the latest developments in data science, machine learning, and AI
- Propose innovative solutions and document code, methodologies, and findings.
- Effectively communicate complex technical concepts to non-technical stakeholders.
- Collaborate with team members, share knowledge, and contribute to a collaborative and learning-oriented environment.