Changes between Initial Version and Version 1 of Justext_changelog


Ignore:
Timestamp:
10/09/18 15:41:45 (6 years ago)
Author:
admin
Comment:

Created from git log

Legend:

Unmodified
Added
Removed
Modified
  • Justext_changelog

    v1 v1  
     1Apr 11 2018
     2    Norwegian joined wordlist
     3Apr 11 2018
     4    More wordlists
     5Sep 11 2017
     6    Lowercased stoplist
     7Aug 24 2017
     8    New and updated wordlists
     9Aug 24 2017
     10    Justext 1.4
     11Aug 24 2017
     12    Web demo
     13Aug 24 2017
     14    max_good_distance, a new context classification parameter
     15    Maximum distance (in paragraphs) of a short paragraph from a good
     16    paragraph to re-classify the short paragraph as good.
     17Jun 30 2017
     18    Minor package updates
     19Jun 30 2017
     20    Justext 1.3
     21Jun 29 2017
     22    Preprocess split to get_html_root and preprocess_html_root
     23    Allows using the DOM root before the head (and other possibly useful
     24    elements) are removed. Needed to get the page title from the head.
     25Apr 12 2017
     26    new README
     27Apr 12 2017
     28    filter out HTML(5) elements
     29Feb 24 2017
     30    remove words containing Latin characters from Korean stoplist
     31Jan 12 2015
     32    Move * out of trunk/
     33Nov 11 2012
     34    Temporary workaround for issue #2: Remove any text nodes that cannot be decoded.
     35Jan 26 2012
     36    Added stoplists for Kazakh, Kyrgyz, Turkmen and Uzbek.
     37Dec 6 2011
     38    Fixed inserting spaces between text nodes. Before, content such as "abc<b>efg</b>" became "abc efg" after processing. Now it correctly becomes "abcefg".
     39Aug 8 2011
     40    jusText 1.2
     41Aug 8 2011
     42    Edited wiki page Algorithm through web user interface.
     43Aug 4 2011
     44    Use character counts instead of word counts where possible (length-low, length-high, max-heading-distance and for computing link density). This is to make the algorithm work well in the language independent mode (without a stoplist) for languages where counting words is not easy (Japanese, Chinese, Thai, etc). The default thresholds have been adjusted correspondingly.
     45Aug 4 2011
     46    More robust parsing of meta tags containing the information about used charset.
     47Jun 6 2011
     48    Bug fix: Corrected decoding of HTML entities &#128; to &#159;
     49Mar 28 2011
     50    Edited wiki page Algorithm through web user interface.
     51Mar 28 2011
     52    Edited wiki page Algorithm through web user interface.
     53Mar 23 2011
     54    Edited wiki page Algorithm through web user interface.
     55Mar 17 2011
     56    Edited wiki page Algorithm through web user interface.
     57Mar 9 2011
     58    Edited wiki page Algorithm through web user interface.
     59Mar 9 2011
     60    Edited wiki page Algorithm through web user interface.
     61Mar 9 2011
     62    Edited wiki page Algorithm through web user interface.
     63Mar 9 2011
     64    Edited wiki page Algorithm through web user interface.
     65Mar 9 2011
     66    Created wiki page through web user interface.
     67Mar 9 2011
     68    jusText 1.1
     69Mar 9 2011
     70    Initial import.