Changes between Version 45 and Version 46 of WikiStart


Ignore:
Timestamp:
08/21/25 14:07:47 (12 days ago)
Author:
admin
Comment:

change colors according to Ondrej's suggest

Legend:

Unmodified
Added
Removed
Modified
  • WikiStart

    v45 v46  
    77<table style="border-spacing: 1em"><tr>
    88
    9 <td class="app" style="background-color:#000080 ; background-image:url('/chrome/site/justext_nb.png')">
     9<td class="app" style="background-color:#DDA0DD ; background-image:url('/chrome/site/justext_nb.png')">
    1010<p><a href="/wiki/Justext">
    1111JusText is a HTML boilerplate removal tool. It can strip navigation links, headers, footers, etc. from HTML pages and leave just regular text containing full sentences.</a><p>
     
    1919</td>
    2020
    21 <td class="app" style="background-color:#800000 ; background-image:url('/chrome/site/chared_nb.png')">
     21<td class="app" style="background-color:#87CEEB ; background-image:url('/chrome/site/chared_nb.png')">
    2222<p><a href="/wiki/Chared">
    2323Chared is a tool for detecting the character encoding of a text in a known language. It contains models for a wide range of languages.</a><p>
     
    3333</tr><tr>
    3434
    35 <td class="app" style="background-color:#800080 ; background-image:url('/chrome/site/spiderling_nb.png')">
     35<td class="app" style="background-color:#20B2AA ; background-image:url('/chrome/site/spiderling_nb.png')">
    3636<p><a href="/wiki/SpiderLing">Spiderling is a web spider for linguistics. It can crawl text-rich parts of the web and collect a lot of data suitable for text corpora.
    3737</a><p>
     
    4545</td>
    4646
    47 <td class="app" style="background-color:#008000 ; background-image:url('/chrome/site/onion_nb.png')">
     47<td class="app" style="background-color:#6B8E23 ; background-image:url('/chrome/site/onion_nb.png')">
    4848<p><a href="/wiki/Onion">
    4949Onion (ONe Instance ONly) is a de-duplicator for large collections of texts. It can measure the similarity of paragraphs or whole documents and drop duplicate ones based on the threshold you set.</a></p>
     
    5959</tr><tr>
    6060
    61 <td class="app" style="background-color:#808000 ; background-image:url('/chrome/site/unitok_nb.png')">
    62 <p><a href="/wiki/Unitok">
     61<td class="app" style="background-color:#9ACD32 ; background-image:url('/chrome/site/unitok_nb.png')">
     62<p style="color:white;"><a href="/wiki/Unitok">
    6363Unitok is a universal text tokeniser with specific settings for many languages. It can turn plain text into a sequence of newline-separated tokens (“vertical” format), while preserving XML-like tags containing metadata.</a></p>
    6464<p>
     
    7171</td>
    7272
    73 <td class="app" style="background-color:#008080 ; background-image:url('/chrome/site/noske_icon_logo_only_white.png')">
    74 <p><a href="http://nlp.fi.muni.cz/trac/noske">NoSketch Engine is the open-sourced little brother of the corpus querying system Sketch Engine.
     73<td class="app" style="background-color:#9932CC ; background-image:url('/chrome/site/noske_icon_logo_only_white.png')">
     74<p style="color:white;"><a href="http://nlp.fi.muni.cz/trac/noske">NoSketch Engine is the open-sourced little brother of the corpus querying system Sketch Engine.
    7575</a><p>
    7676<p>
    77 <a class="lnk" href="https://link.springer.com/article/10.1007%2Fs40607-014-0009-9">Paper</a>
     77<a class="lnk" href="https://link.springer.com/article/10.1007%2Fs40607-014-0009-9" style="color:white;">Paper</a>
    7878|
    79 <a class="lnk" href="/wiki/noske_cite">Cite</a>
     79<a class="lnk" href="/wiki/noske_cite" style="color:white;">Cite</a>
    8080|
    81 <a class="lnk" href="http://www.gnu.org/licenses/gpl2.txt">Licence</a>
     81<a class="lnk" href="http://www.gnu.org/licenses/gpl2.txt" style="color:white;">Licence</a>
    8282</p>
    8383</td>
     
    8686<tr>
    8787
    88 <td class="app black" style="background-color:#a7d7f9; background-image: url('/chrome/site/w2c_44.png');">
    89 <p><a href="/wiki/wiki2corpus">wiki2corpus is a script which downloads Wikipedia articles (for a given language) and outputs them in the form of prevertical which can be further processed by other corpus tools.
     88<td class="app black" style="background-color:#A52A2A; background-image: url('/chrome/site/w2c_44.png');">
     89<p><a href="/wiki/wiki2corpus" style="color:white;">wiki2corpus is a script which downloads Wikipedia articles (for a given language) and outputs them in the form of prevertical which can be further processed by other corpus tools.
    9090</a><p>
    9191
    9292|
    93 <a class="lnk" href="https://choosealicense.com/licenses/mit/">Licence</a>
     93<a class="lnk" href="https://choosealicense.com/licenses/mit/" style="color:white;">Licence</a>
    9494
    9595</td>
    9696
    97 <td class="app black" style="background-color:#ff1493; background-image: url('/chrome/site/noske_nb.png');">
    98 <p><a href="/wiki/languagefilter">Language Filter is a language discriminating tool. It works with the vertical format. The language of paragraphs and documents is determined according to pre-defined lists of words with corpus frequency.
     97<td class="app black" style="background-color:#191970; background-image: url('/chrome/site/noske_nb.png');">
     98<p><a href="/wiki/languagefilter" style="color:white;">Language Filter is a language discriminating tool. It works with the vertical format. The language of paragraphs and documents is determined according to pre-defined lists of words with corpus frequency.
    9999</a><p>
    100100<p>
    101 <a class="lnk" href="https://nlp.fi.muni.cz/raslan/raslan19.pdf#page=137">Paper</a>
     101<a class="lnk" href="https://nlp.fi.muni.cz/raslan/raslan19.pdf#page=137" style="color:white;">Paper</a>
    102102|
    103 <a class="lnk" href="/wiki/languagefilter/Cite">Cite</a>
     103<a class="lnk" href="/wiki/languagefilter/Cite" style="color:white;">Cite</a>
    104104|
    105 <a class="lnk" href="http://www.gnu.org/licenses/gpl2.txt">Licence</a>
     105<a class="lnk" href="http://www.gnu.org/licenses/gpl2.txt" style="color:white;">Licence</a>
    106106</p>
    107107</td>