Frontiers is an award-winning open science platform and leading open access scholarly publisher.
We are one of the largest and most cited publishers globally. To date, our 200,000 freely available research articles have received more than 1 billion views and downloads and 2 million citations. Our journals span science, health, humanities and social sciences, engineering, and sustainability. And we continue to expand into new academic disciplines so more researchers can publish open access.
Be part of the publishing revolution and help us transform the way research is published, evaluated, and communicated to the world.
About the opportunity:
Frontiers catalog is growing. Every month we need to handle more journals, articles, and research topics, to process more data and enable new features for our users.
We are looking for an enthusiastic and hands-on Data Scientist to help us to empower Frontiers with highly precise data, suggestions and predictions. You’ll also help us tackle complex problems and produce solutions that will propel open science forward.
The role will involve working with data at scale built from several external and internal sources and applying Data Science models to it. We are therefore looking for talented individuals who have hands-on experience in designing data-intensive applications and big data solutions with a good knowledge of all types of data storage.
Tech Stack & Key Requirements:
Experience with python development.
Experience working with classifiers that scale well with a large number of samples (e.g. approximate kNN).
Experience working with Natural Language Processing (text classification, word & sentence embeddings, name entity recognition)
Knowledge of NLP libraries (NLTK, SpaCy)
Knowledge of clustering algorithms that scale well with large numbers of samples (e.g. minibatch K-means, OPTICS, BIRCH).
Knowledge of recommender systems (e.g. Matrix factorization).
Expertise in the processing of large datasets (e.g. via Spark, Azure Databricks) using large-data storage (e.g. Parquet on Azure blob storage, Databricks Delta).
Ability to prioritize
Experience working in a team environment.
Familiarity with Agile framework
Good English skills
Your Main responsibilities:
Design, implement, monitor, and optimize our data platforms storing key information to power our internal recommender systems and other ML pipelines
Understand functional requirements for defining the Data science tools to solve the problem at hand.
Collaborate closely with Machine Learning and Data Science Teams in order tackle larger problems.
Challenges you will face:
Apply Machine Learning and NLP methods to extract out specific tokens from unstructured text belonging to specific categories such as author names, organization names, geographical location.
Creating models that can tell apart authors with similar names (author disambiguation)
Creating models that are able to classify research papers.
Evaluating our pipelines and coming up with ways to improve their performance.
With more than 50 nationalities represented in our global team, you will work regularly with teammates in other countries, and with our community of researchers, editors, and authors from around the globe.
Our mission to create solutions for healthy lives also extends to the working environment we provide for our employees.
100% remote working
Employees now have the flexibility to choose where they want to work, with remote working available on a part- or full-time basis (not applicable to some Workplace/IT jobs due to nature of role requiring presence onsite, in the office).
Learning and development
All employees have access to LinkedIn Learning (and Pluralsight for our technology team), an annual personal learning budget, and dedicated L&D time.
We offer free online yoga classes, an employee assistance plan, access to the Headspace app, and four wellbeing days on top of your annual leave allowance.
Employees can dedicate three days each year to volunteer for a personal cause or through our volunteering partner platform, Alaya.
Frontiers actively embraces diversity and is a safe and welcoming workplace. Recruitment is free from discrimination – including based on race, national or ethnic origin, age, religion, disability, sex, gender identity or sexual orientation. With over 600 employees from more than 50 different nations, our diversity creates vibrant teams and constantly challenges us to appreciate multiple perspectives.