Changes between Initial Version and Version 1 of Onion


Ignore:
Timestamp:
01/12/15 18:00:01 (10 years ago)
Author:
admin
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • Onion

    v1 v1  
     1= Onion =
     2onion (ONe Instance ONly) is a tool for removing duplicate parts from large collections of texts.
     3
     4== Installation ==
     5
     6=== Prerequisites ===
     7* 64-bit CPU architecture
     8* libjudy (>=1.0.5)
     9
     10=== Configuration and installation ===
     111. Download the sources:
     12{{{
     13wget -O onion-1.2.tar.gz 'https://docs.google.com/uc?authuser=0&id=0B4SxKw5O_gLHUXZhOHBzUDNwcXM&export=download'
     14}}}
     152. Extract the downloaded file:
     16{{{
     17tar xzvf onion-1.2.tar.gz
     18}}}
     193. Configure the package by editing onion-1.2/Makefile.config:
     20  * set PREFIX (or INSTALL_BIN and INSTALL_DATA) according to where you want the executables and data (docs) installed
     21  * if you have libjudy installed in a non-standard path you need to:
     22    * set JUDY_INC to where Judy.h is located
     23    * set JUDY_LIB to where libJudy.a is located
     244. Install the package (you may need sudo or a root shell for the last command):
     25{{{
     26cd onion-1.2/
     27make
     28make install
     29}}}
     30
     31
     32== Quick start ==
     33{{{
     34onion -s <documents.vert >deduplicated_documents.vert
     35}}}
     36
     37There's also an usage example on a sample input.
     38
     39For usage information see:
     40{{{
     41onion -h
     42man onion
     43}}}
     44
     45== Acknowledgements ==
     46This software has been developed at the [http://nlp.fi.muni.cz/en/nlpc Natural Language Processing Centre] of Masaryk University in Brno with a financial support from PRESEMT and Lexical Computing Ltd. It also relates to author's PhD research.