Article Highlighter

Auto Highlight is a Chrome extension that automatically highlights the important content on article pages.

Here’s a link to the extension:
https://chrome.google.com/webstore/detail/highlight/dnkdpcbijfnmekbkchfjapfneigjomhh

The source code is on GitHub:
https://github.com/dstein64/highlight

After installing the extension, a highlighter icon appears in the location bar. Clicking that icon highlights important content on the page.

1

The extension is based on a separate project that I was recently working on for an adaptive information systems course. For my project, I implemented a Chrome extension that learns a user model based on the sites a user visits. Then on new sites, content was highlighted based on the similarity to the user model. I called the project extension Persightlight, for “personalized highlighting”.

The personalization component still needs more work, and it added complexity to the code. Auto Highlight is based on Persighlight, but all the user modeling and personalization code has been removed. Auto Highlight uses a heuristic that gives higher weight to words that appear often, rather than depending on a user model. I’d like to improve Auto Highlight, adding ideas from recent NLP research.

Persighlight essentially did extraction-based automatic summarization, scoring sentences relative to a user model. The model was represented by a term frequency vector, as were sentence candidates. Candidates were scored based on their cosine similarity with the user model. I’d like to also continue work on Persighlight. It would be interesting to see how personalization with a user model can contribute to highlighting, and summarization more broadly.

This entry was tagged , , . Bookmark the permalink.

Leave a Reply

Your email address will not be published. Required fields are marked *