Dashmote is a start-up focusing on the next generation of data products powered by AI technology, with offices in Amsterdam (HQ) and Shanghai. We connect the offline and online worlds by decoding the digital footprint of
locations, allowing our enterprise clients to understand the market and make smarter decisions. Dashmote has ambitious plans in the upcoming years and therefore we need to make sure that we have the right people in place to put such plans into practice. Do you want to boost your career by contributing to Dashmote’s core product, used by some of the largest Fortune 500 companies? Then we're looking for you.
Role Description
We are seeking a talented and experienced Data Scientist to join our development team. The ideal candidate will have a strong background in data science, machine learning, and large language models (LLMs). This role will be pivotal in driving innovation and developing new products to enhance our offerings.
Main Responsibilities
Develop and implement large language models to support product innovation.
Analyze large datasets to extract meaningful insights and drive product decisions.
Collaborate with product managers, engineers, and other stakeholders to integrate data science solutions into new and existing products.
Stay updated with the latest advancements in AI, ML, and data science to improve our product capabilities continuously.
Design and conduct experiments to validate model performance and product hypotheses. Communicate findings and recommendations to both technical and non-technical audiences.
岗位职责
数据质量评估与改进:
科学研究现有数据质量现状,设计并实施提升策略,为未来业务升级提供数据支撑
数据匹配与整合:
实施两个数据源之间的实体匹配(Matching),确保数据对齐与一致性
负责CRM数据与公司内部数据的精准匹配与整合
信息提取与数据清洗:
从杂乱数据中提取有效信息(Portfolio),如在菜单数据中识别特定品牌(例如可口可乐)
运用NER技术,在无预设规则情况下自动识别数据中的命名实体(如识别菜单中的软饮料或啤酒)
预测与消歧:
利用数据分析预测各销售渠道或门店的销售潜力(Prospecting)
针对数据中的歧义问题开展消歧工作,提升数据准确性
大语言模型的应用与质量管理:
深度融合大语言模型于产品中,提升问答系统的回答质量,并对模型表现进行持续监控和优化
跨部门协作:
与产品经理、工程师及其他团队成员紧密合作,将数据科学解决方案高效融入产品和业务流程
Job requirements
Your background
Master’s in Computer Science, Data Science, Machine Learning, or related field
Proven experience in data science and machine learning, with a focus on large language models
Strong programming skills in Python and familiarity with data science libraries (e.g., TensorFlow, PyTorch, Scikit-learn)
Experience with natural language processing (NLP) and large language model frameworks
Excellent problem-solving skills and the ability to work independently and as part of a team
Strong communication skills and the ability to convey complex technical concepts to a non-technical audience