Data Scientist
106 jobs · 6 sub-roles · 16 countries
Top Skills
% of postings mentioning each skill
| # | Skill | Category | Prevalence | Jobs |
|---|---|---|---|---|
| 1 | Python | Language | 88.7% | 94 |
| 2 | SQL | Protocol / API / Standard | 60.4% | 64 |
| 3 | R | Language | 36.8% | 39 |
| 4 | AWS | Cloud / Platform | 17.0% | 18 |
| 5 | scikit-learn | AI/ML Framework | 15.1% | 16 |
| 6 | PyTorch | AI/ML Framework | 15.1% | 16 |
| 7 | Tableau | Developer Tool | 14.2% | 15 |
| 8 | Azure | Cloud / Platform | 14.2% | 15 |
| 9 | TensorFlow | AI/ML Framework | 13.2% | 14 |
| 10 | GCP | Cloud / Platform | 13.2% | 14 |
| 11 | Pandas | Framework / Library | 12.3% | 13 |
| 12 | Spark | Data Platform | 10.4% | 11 |
| 13 | Java | Language | 9.4% | 10 |
| 14 | Kubernetes | Containers / Orchestration | 7.5% | 8 |
| 15 | PySpark | Framework / Library | 6.6% | 7 |
| 16 | Hadoop | Framework / Library | 6.6% | 7 |
| 17 | Git | Developer Tool | 6.6% | 7 |
| 18 | Docker | Containers / Orchestration | 6.6% | 7 |
| 19 | Databricks | Data Platform | 6.6% | 7 |
| 20 | C++ | Language | 6.6% | 7 |
| 21 | BigQuery | Database | 6.6% | 7 |
| 22 | Snowflake | Database | 5.7% | 6 |
| 23 | NumPy | Framework / Library | 5.7% | 6 |
| 24 | Airflow | DevOps / CI/CD | 5.7% | 6 |
| 25 | prompt engineering | AI/ML Concept | 4.7% | 5 |
| 26 | Scala | Language | 4.7% | 5 |
| 27 | Power BI | Developer Tool | 4.7% | 5 |
| 28 | MLflow | Developer Tool | 4.7% | 5 |
| 29 | LLMs | AI/ML Concept | 4.7% | 5 |
| 30 | agent orchestration | Technical Capability | 3.8% | 4 |
| 31 | Redshift | Database | 3.8% | 4 |
| 32 | PHP | Language | 3.8% | 4 |
| 33 | MLOps | DevOps / CI/CD | 3.8% | 4 |
| 34 | Looker | Developer Tool | 3.8% | 4 |
| 35 | Hive | Database | 3.8% | 4 |
| 36 | Google Cloud Platform | Cloud / Platform | 3.8% | 4 |
| 37 | CI/CD | DevOps / CI/CD | 3.8% | 4 |
| 38 | statistics | Technical Capability | 2.8% | 3 |
| 39 | dbt | DevOps / CI/CD | 2.8% | 3 |
| 40 | TypeScript | Language | 2.8% | 3 |
| 41 | LangChain | Framework / Library | 2.8% | 3 |
| 42 | Hugging Face Transformers | AI/ML Framework | 2.8% | 3 |
| 43 | C | Language | 2.8% | 3 |
| 44 | spaCy | Framework / Library | 1.9% | 2 |
| 45 | machine learning | AI/ML Concept | 1.9% | 2 |
| 46 | causal inference | AI/ML Concept | 1.9% | 2 |
| 47 | XGBoost | AI/ML Framework | 1.9% | 2 |
| 48 | Terraform | DevOps / CI/CD | 1.9% | 2 |
| 49 | S3 | Cloud / Platform | 1.9% | 2 |
| 50 | PostgreSQL | Database | 1.9% | 2 |
| 51 | Linux | Operating System | 1.9% | 2 |
| 52 | LightGBM | AI/ML Framework | 1.9% | 2 |
| 53 | Kubeflow | Containers / Orchestration | 1.9% | 2 |
| 54 | JAX | AI/ML Framework | 1.9% | 2 |
| 55 | Hugging Face | AI/ML Framework | 1.9% | 2 |
| 56 | GitHub | DevOps / CI/CD | 1.9% | 2 |
| 57 | GeoLift | Framework / Library | 1.9% | 2 |
| 58 | Gemini | AI/ML Framework | 1.9% | 2 |
| 59 | Cloud Run | Cloud / Platform | 1.9% | 2 |
| 60 | C# | Language | 1.9% | 2 |
| 61 | ARIMA | AI/ML Concept | 1.9% | 2 |
| 62 | AI tools | Developer Tool | 1.9% | 2 |
| 63 | vector databases | Database | 0.9% | 1 |
| 64 | uv | Technical Capability | 0.9% | 1 |
| 65 | shell scripting | Language | 0.9% | 1 |
| 66 | retrieval-augmented generation (RAG) | AI/ML Concept | 0.9% | 1 |
| 67 | reinforcement learning | AI/ML Concept | 0.9% | 1 |
| 68 | propensity score matching | AI/ML Concept | 0.9% | 1 |
| 69 | natural language processing | AI/ML Concept | 0.9% | 1 |
| 70 | mammalian cell culture | Technical Capability | 0.9% | 1 |
| 71 | embeddings | AI/ML Concept | 0.9% | 1 |
| 72 | econometric modelling | Technical Capability | 0.9% | 1 |
| 73 | dplyr | Framework / Library | 0.9% | 1 |
| 74 | double machine learning | AI/ML Concept | 0.9% | 1 |
| 75 | data.table | Framework / Library | 0.9% | 1 |
| 76 | data mining | Technical Capability | 0.9% | 1 |
| 77 | clinical outcome assessments (COAs) | Domain Knowledge | 0.9% | 1 |
| 78 | bioreactors | Developer Tool | 0.9% | 1 |
| 79 | anomaly detection | AI/ML Concept | 0.9% | 1 |
| 80 | agent scaffold architectures | Architecture | 0.9% | 1 |
| 81 | Weaviate | Database | 0.9% | 1 |
| 82 | Vertex AI | AI/ML Framework | 0.9% | 1 |
| 83 | Unix | Operating System | 0.9% | 1 |
| 84 | UNIX Shell scripting | Language | 0.9% | 1 |
| 85 | Trello | Developer Tool | 0.9% | 1 |
| 86 | Transformers | AI/ML Framework | 0.9% | 1 |
| 87 | Time-Series DB | AI/ML Concept | 0.9% | 1 |
| 88 | TF-IDF | AI/ML Concept | 0.9% | 1 |
| 89 | TF | AI/ML Framework | 0.9% | 1 |
| 90 | Synth | Framework / Library | 0.9% | 1 |
| 91 | Stata SE | Developer Tool | 0.9% | 1 |
| 92 | Spark SQL | Framework / Library | 0.9% | 1 |
| 93 | Scrum | Methodology | 0.9% | 1 |
| 94 | SciPy | Framework / Library | 0.9% | 1 |
| 95 | SageMaker | Cloud / Platform | 0.9% | 1 |
| 96 | Rust | Language | 0.9% | 1 |
| 97 | Robyn | AI/ML Framework | 0.9% | 1 |
| 98 | Ray | Framework / Library | 0.9% | 1 |
| 99 | Random Forests | AI/ML Concept | 0.9% | 1 |
| 100 | REST APIs | Protocol / API / Standard | 0.9% | 1 |
| 101 | RAG architectures | AI/ML Concept | 0.9% | 1 |
| 102 | RAG | AI/ML Concept | 0.9% | 1 |
| 103 | Qlik Sense | Developer Tool | 0.9% | 1 |
| 104 | Qlik | Developer Tool | 0.9% | 1 |
| 105 | QGIS | Developer Tool | 0.9% | 1 |
| 106 | PyWhy | Framework / Library | 0.9% | 1 |
| 107 | PyMC | AI/ML Framework | 0.9% | 1 |
| 108 | PyArrow | Framework / Library | 0.9% | 1 |
| 109 | Pub/Sub | Message Broker | 0.9% | 1 |
| 110 | Prophet | AI/ML Framework | 0.9% | 1 |
| 111 | Presto | Data Platform | 0.9% | 1 |
| 112 | Power Platform | Data Platform | 0.9% | 1 |
| 113 | Power Apps | Framework / Library | 0.9% | 1 |
| 114 | PostGIS | Database | 0.9% | 1 |
| 115 | Polars | Framework / Library | 0.9% | 1 |
| 116 | Palantir | Data Platform | 0.9% | 1 |
| 117 | Oracle | Database | 0.9% | 1 |
| 118 | OpenAI API | Protocol / API / Standard | 0.9% | 1 |
| 119 | Omni | Data Platform | 0.9% | 1 |
| 120 | Named Entity Recognition | AI/ML Concept | 0.9% | 1 |
| 121 | NLP | AI/ML Concept | 0.9% | 1 |
| 122 | Mode | Developer Tool | 0.9% | 1 |
| 123 | Microsoft Fabric | Data Platform | 0.9% | 1 |
| 124 | Meridian | AI/ML Framework | 0.9% | 1 |
| 125 | ML.NET | AI/ML Framework | 0.9% | 1 |
| 126 | MATLAB | Language | 0.9% | 1 |
| 127 | LlamaIndex | Framework / Library | 0.9% | 1 |
| 128 | LSTM | AI/ML Concept | 0.9% | 1 |
| 129 | LLM frameworks | AI/ML Framework | 0.9% | 1 |
| 130 | LC-MS/MS | Technical Capability | 0.9% | 1 |
| 131 | Kerberos | Security / Networking | 0.9% | 1 |
| 132 | Keras | AI/ML Framework | 0.9% | 1 |
| 133 | JavaScript | Language | 0.9% | 1 |
| 134 | JMP | Developer Tool | 0.9% | 1 |
| 135 | Hex | Developer Tool | 0.9% | 1 |
| 136 | Haystack | Framework / Library | 0.9% | 1 |
| 137 | Hadoop ecosystem | Data Platform | 0.9% | 1 |
| 138 | HPLC | Technical Capability | 0.9% | 1 |
| 139 | HDFS | Data Platform | 0.9% | 1 |
| 140 | Google Sheets | Developer Tool | 0.9% | 1 |
| 141 | Google Colab | Developer Tool | 0.9% | 1 |
| 142 | Google Cloud | Cloud / Platform | 0.9% | 1 |
| 143 | Golang | Language | 0.9% | 1 |
| 144 | GitLab | DevOps / CI/CD | 0.9% | 1 |
| 145 | GeoPandas | Framework / Library | 0.9% | 1 |
| 146 | GCP Vertex | Cloud / Platform | 0.9% | 1 |
| 147 | GC-MS | Technical Capability | 0.9% | 1 |
| 148 | GC | Technical Capability | 0.9% | 1 |
| 149 | Fivetran | Developer Tool | 0.9% | 1 |
| 150 | FTIR | Technical Capability | 0.9% | 1 |
| 151 | Excel | Developer Tool | 0.9% | 1 |
| 152 | ECOA | Protocol / API / Standard | 0.9% | 1 |
| 153 | DevOps | DevOps / CI/CD | 0.9% | 1 |
| 154 | Delta Lake | Database | 0.9% | 1 |
| 155 | Dataflow | Cloud / Platform | 0.9% | 1 |
| 156 | DORA metrics | DevOps / CI/CD | 0.9% | 1 |
| 157 | CrewAI | Framework / Library | 0.9% | 1 |
| 158 | Control-M | Developer Tool | 0.9% | 1 |
| 159 | Clustering | AI/ML Concept | 0.9% | 1 |
| 160 | Cloud Build | DevOps / CI/CD | 0.9% | 1 |
| 161 | CausalImpact | Framework / Library | 0.9% | 1 |
| 162 | CatBoost | AI/ML Framework | 0.9% | 1 |
| 163 | Cassandra | Database | 0.9% | 1 |
| 164 | C/C++ | Language | 0.9% | 1 |
| 165 | BigQuery ML | Data Platform | 0.9% | 1 |
| 166 | BentoML | DevOps / CI/CD | 0.9% | 1 |
| 167 | Bash | Language | 0.9% | 1 |
| 168 | Azure ML Studio | Cloud / Platform | 0.9% | 1 |
| 169 | Azure ML | Cloud / Platform | 0.9% | 1 |
| 170 | AutoGen | Framework / Library | 0.9% | 1 |
| 171 | Athena | Database | 0.9% | 1 |
| 172 | Argo | DevOps / CI/CD | 0.9% | 1 |
| 173 | Anthropic Model Context Protocol (MCP) | Protocol / API / Standard | 0.9% | 1 |
| 174 | Amazon Quicksight | Developer Tool | 0.9% | 1 |
| 175 | Agentic AI | AI/ML Concept | 0.9% | 1 |
| 176 | Agent based simulation | AI/ML Concept | 0.9% | 1 |
| 177 | AWS SageMaker | Cloud / Platform | 0.9% | 1 |
| 178 | API Platform | AI/ML Framework | 0.9% | 1 |
| 179 | A/B testing | Methodology | 0.9% | 1 |
Seniority
Top Countries
Sub-roles
| Sub-role | Jobs | |
|---|---|---|
| data scientist | 85 | |
| machine learning engineer | 11 | |
| research scientist | 6 | |
| científico/a de datos | 2 | |
| data scientist/analyst power bi | 1 | |
| data science | 1 |