Snorkel: rapid training data creation with weak supervision

Volume: 11, Issue: 3, Pages: 269 - 282
Published: Nov 1, 2017
Abstract
Labeling training data is increasingly the largest bottleneck in deploying machine learning systems. We present Snorkel, a first-of-its-kind system that enables users to train state-of-the-art models without hand labeling any training data. Instead, users write labeling functions that express arbitrary heuristics, which can have unknown accuracies and correlations. Snorkel denoises their outputs without access to ground truth by incorporating...
Paper Details
Title
Snorkel: rapid training data creation with weak supervision
Published Date
Nov 1, 2017
Volume
11
Issue
3
Pages
269 - 282
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.