Dr. Benjamin Wilson

CSO & Co-founder

I have no trouble getting up every morning. I feel fortunate to work with extremely talented colleagues on truly challenging applications of machine learning that will fundamentally improve the work lives of many.

Machine Learning

The arXiv as Dataset

The arXiv is a repository of over 1 million preprints. It is truly open access, and excellent for testing language modelling / machine learning prototypes.

Machine Learning

How do machines learn meaning?

Computers consist of on/off switches and process meaningless symbols. So how is it that we can hope that machines learn meaning of words and documents?

Machine Learning

The Unknown Perils of Mining Wikipedia

If a machine is to learn about humans from Wikipedia, it must experience the corpus as a human sees it and ignore the mass of robot-generated pages.

Machine Learning

Leveraging machine learning to discover research

By ignoring citation graphs and keywords, you can discover research and researchers you never knew existed. Check it out!

By clicking “Agree”, you agree to the storing of cookies on your device to enhance site navigation, analyse site usage, and assist in our marketing efforts. View our Privacy Policy for more information.

More Options Deny Agree

Dr. Benjamin Wilson

The arXiv as Dataset

How do machines learn meaning?

The Unknown Perils of Mining Wikipedia

Leveraging machine learning to discover research

Get into flow.