Products of arabiccorpus.sourceforge.net

Arabic Corpus 2004 arabiccorpus.sourceforge.net 

Education \ Science

The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) is composed of arabic texts for text categorization. The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use. Freeware download of Arabic Corpus 2004, size 14.42 Mb.