Job Description

About VOXVOX is a visionary company led by a single founder, currently leading the way in flashcall and telecom carrier services, transforming the way businesses communicate, authenticate and connect. As a hyper-growth company, VOX achieved over 25% YoY revenue growth last year and is aiming to reach $100M+ revenue this year. VOX is looking for a team of growth-driven individuals to take the company to new heights.VOX's cutting-edge technology and dedicated customer service team ensure that telcos and enterprises maintain secure, fast, and reliable connections while protecting their networks. VOX's promise of a hassle-free experience and superior customer support enables telcos and enterprises to focus on success. As a company, VOX focuses on solutions that monetize the assets of mobile network operators.Joining VOX offers the opportunity to work with the industry's leading technologies and help them stay ahead and continue to innovate with a comprehensive suite of flashcall and telecom carrier services. VOX is highly committed to providing its employees with a dynamic, forward-thinking work environment, competitive compensation and benefits, vacation and time-off packages, and stock options. This is a once-in-a-lifetime opportunity for highly ambitious individuals, as VOX plans to expand its solutions portfolio and go public in the next 3-5 years.About the RoleVOX is building a multi-tenant Customer Data Platform for mobile network operators across multiple countries. Our platform ingests billions of telecom events and transforms them into actionable insights, segmentation, scoring, and campaign activation.As a Data Scientist on the VOX CDP team, you will work across Spark-based large-scale analytics, telecom event modeling, classification and clustering, scoring systems, and audience intelligence features. You will leverage Iceberg/Nessie datasets and collaborate closely with Data Engineers and Product to build models that power user segmentation, sender profiling, and activation use cases.This is a role for someone excited by massive event data, ML at scale, and advanced behavioral modelling.ResponsibilitiesExploratory & Descriptive Analytics (Spark + Dremio)Analyze high-volume telecom datasetsBuild large-scale EDA workflows using Spark (batch + distributed analytics)Use Dremio to explore, validate, and aggregate Iceberg data efficientlyProduce actionable insights on user behavior, engagement, messaging patterns, and campaign outcomes Feature Engineering & Data Modeling (Python + Spark)Develop scalable feature sets for: Relevance scoring, Sender categorization, Engagement propensity, Audience quality/quantity modelling, Cohort analysisBuild reusable transformation pipelines on Spark that integrate directly into Iceberg tablesWork with Data Engineers to deploy feature pipelines into production environmentsML Model Development (Classification, Clustering, Scoring)Build models for telecom-specific use cases, including: Category prediction for senders, RFU scoring refinement, User-level behavioral segmentation, Anomaly detection on message activity, Propensity and engagement ML modelsSelect and implement appropriate ML techniques (tree-based models, embedding, clustering, graph-based grouping, etc.)Evaluate model performance with robust offline validation strategiesCampaign & Audience IntelligenceDevelop analytical models for campaign performance: Response modelling, Lift analysis, Control vs. exposed cohort evaluation, Confidence intervals and campaign impact scoringBuild audience scoring and relevancy models used directly in VOX’s segmentation engineWork with product teams to define intelligence features that help MNOs select the strongest audiencesModel Deployment & Operationalize (CI/CD + Kubernetes)Package models and feature pipelines for deployment in multi-tenant MNO clustersVersion and manage model releases via Git-based CI/CDEnsure reliable execution of batch scoring jobs on Kubernetes/SparkMonitor model health, drift, and performance across multiple deploymentsExperimentation & ValidationDesign and evaluate experiments (A/B, multivariate, holdout cohorts)Build frameworks for causal measurement in messaging and telecom campaigns.Validate assumptions using statistical tests and robust confidence intervals.Collaboration & Product DevelopmentWork closely with Data Engineers to ensure features and models are aligned with Iceberg/Nessie patternsCollaborate with Product to define new intelligence features in the VOX CDPSupport customer-facing teams with insights, findings, and data storiesEnsure models respect all PII and compliance rules across multi-tenant deployments.Requirements3+ years of experience as a Data Scientist, ML Engineer, or similar roleStrong Python skills (must-have) for modeling, feature engineering, and data analysisExperience working with distributed analytics using SparkStrong SQL skills and comfort working with Iceberg datasets via engines like Dremio or TrinoSolid background in machine learning (classification, clustering, time-series, scoring)Experience with model deployment, versioning, and CI/CD workflowsFamiliarity with building data products on top of large event datasetsUnderstanding of PII handling, compliance requirements, and secure data processingAbility to work in multi-environment, multi-deployment contexts (dev/test/prod + multiple MNOs)Nice to HaveExperience with telecom datasetsKnowledge of audience-building, relevancy scoring, or marketing activation modelsExperience with ML observe-ability (drift monitoring, model health checks)Understanding of Nessie branching workflows and Iceberg snapshot logicJoin the team and help shape the future of the telecom industry!

Job Application Tips

  • Tailor your resume to highlight relevant experience for this position
  • Write a compelling cover letter that addresses the specific requirements
  • Research the company culture and values before applying
  • Prepare examples of your work that demonstrate your skills
  • Follow up on your application after a reasonable time period