October 20, 2020

Classifying SpaCy’s ORG Named Entities with Machine Learning

Recently, we were tagging a lot of texts with spaCy in order to extract organization names. We faced a problem: many entities tagged by spaCy were not valid organization names at all. And it wasn’t actually the problem of spaCy itself: all extracted entities, at first sight, did look like organization names. The result could be better if we trained spaCy models more. However, this approach required a large corpus of properly labeled data which should also include a proper context. So we needed a simpler solution to filter out the wrong data.

you might also like…
Oct 20, 2020

Automated Document Classifier Solution For Banking

Recently, we were tagging a lot of texts with spaCy in order to extract organization names. We faced a problem: many... Read more

Oct 20, 2020

Predictive Sales Analytics Tool for Special Offers Evaluation

Recently, we were tagging a lot of texts with spaCy in order to extract organization names. We faced a problem: many... Read more

Contact Us

  • Contact Details

    +380 63 395 42 00
    team@mindcraft.ai
    Krakow, Poland

    Follow us