CLOSED: [2015-05-09 Sat 22:46] :PROPERTIES: :CREATED: [2015-05-09 Sat 21:23] :ID: 2015-05-09-auto-tags :END: :LOGBOOK: - State "DONE" from "NEXT" [2015-05-09 Sat 22:46] :END: Today, I introduced a feature to [[https://github.com/novoid/lazyblorg][lazyblorg]] I love to present to you: auto-tags. The first auto-tag of lazyblorg is ... tadaaa: language detection. I was looking forward to this feature because I blog in English and German language. However, I am too lazy for manually tag meta-data like defining the (main) language of a blog entry. -------- Implementing language detection was rather straight-forward and not that hard. I retrieved lists of [[https://en.wikipedia.org/wiki/Stop_words][stop words]] for German and English languages. After parsing my Org-mode files, lazyblorg determines the percentages of language-specific stop words from the blog text. If there is a dominant value within the list of percentages, lazyblorg found the dominant text language. So from now on, all of my blog entries are tagged with "language:english" or "language:deutsch". Those tags are the result of my language detection algorithm. In future, I'd like to see separate [[https://en.wikipedia.org/wiki/Atom_%28standard%29][Atom]] feeds for each language so that, e.g., German-only people are able to follow my German articles. Also planned: auto-tags for other things like the size of the blog entry. This way, I am probably able to replace micro-blogging services like Twitter. Atom feeds for auto-tags like "size:small" could provide many cool features. Do you have ideas for other auto-tags that would enhance the reading experience? Please leave a comment below!