Extracting Media from HTML

Over the last decade, there has been a rapid development from static webpages with little interactive content to media-rich and dynamic web pages. This can be attributed to new web technologies that make embedding of multimedia content incredibly simple. For example, adding a video in HTML5 is as easy as wrapping it in <video> tags.…

Read More

A Benchmark Comparison Of Content Extraction From HTML Pages

Introduction Content extraction is the task of separating boilerplate such as comments, navigation bars, social media links, ads, etc, from the main body of text of an article formatted as HTML. The main content typically accounts for only a small portion of a page’s source code (highlighted in red in the image below). Extraction is…

Read More

Summarising lists, a popular web content type.

  One of the biggest challenges we’re trying to tackle at Skim Technologies is how to extract the most important information from a web page, in order to vastly improve the way people consume information in a mobile first world…  This post details some of the work we’ve been doing to automatically summarise lists. We…

Read More

A benchmark comparison of extractive summarisation systems

In this post, we report the results of the comparative evaluation of our Skim API against similar commercial and open-source extractive summarisation systems. Results indicate that our summarisation system consistently outperforms the analysed benchmarks, in terms of ROUGE-N.   The Information Age we are living in, fuelled by the advent of the World Wide Web…

Read More

Why we’ve changed from Scrum to Kanban…

In the two short years we’ve been developing software at Skim.it, we’ve always taken an agile approach to our work. It’s not always been easy, and now more so than ever – like true Agile ninja’s – we’re questioning, testing and improving our processes. There’s a misguided belief among tech startups that the only way…

Read More

A brief introduction to summarisation

Our chief data scientist shares her thoughts on summarisation, by answering some of the most burning questions on this exciting discipline of information management.

Read More

Why you should stop running your own node server

In this post, Florian – our Fullstack developer – reveals why he thinks the future is serverless…

Read More

Our mission

Skim’s mission is to empower people to use data more effectively and to demystify artificial intelligence. Rather than holding up the common narrative of machines replacing humans, we see how machines can help humans to have easier lives and better businesses.

Supported by

Contact

London office
27 Finsbury Circus,
London EC2M 5NT

Portugal office

+44 207 129 7497
sales@skimtechnologies.com

skim-logo