The package can strip out HTML tags, handle badly declared character sets and invalid markup. Can also extract text from the meta section, not only the page body.The HTML parser used by this package is the one used by the Xapian search engine library
Practically, these are routines for categorizing text inside PostgreSQL. Requirements:· PostgreSQL Limitations:· The project has seen little development activity lately.
Seq for chars in C.
It was written for Linux and Windows stations using C and C++ code. Here are some key features of "S.T.E.":· Multiplatform application (runs on Windows, Linux and others)· Multitabular interface· Syntax highlighting for many languages/
Cross-platform operation execution is supported. Here are some key features of "Text Matrix":· Text in Chinese (GBK,GB2312) and English to matrix converting.· Sparse matrix storage.· Chinese character based ngram, Chinese word based ng
The script was written using the C programming language.
Urdu (Pakistan) Text Writer allows users to type Urdu.It is easy to use to compose Urdu and allows users to copy and paste the Urdu text to other applications such as MS Word ,Excel , Adobe Photoshop ,CorelDraw ,Ulead Media/Video Studio and any other
Tux Writer is a word processor developed for young children. It will be based on the SDL (Simple DirectMedia Layer) library, meaning it will be portable to various operating systems: Windows, Linux, Mac OS X, BeOS, and more. Tux Writer will have a ve
TEA is the GTK2-based text editor for Linux and *BSD. With an ultimate small size TEA provides you hundreds of functions.TEA depends on GTK 2.4 (or higher) and, optionally, on Aspell. TEA can also utilize the power of GtkSourceView (as the text
ASCII Shifter is a simple program to shift ASCII characters. It will shift the given string throughout possible combinations (so aaa would go to bbb then ccc etc).
fcat script allows you to concatenate two text files.
This program searches and (optionally) replaces a string in a file or in a set of files matching a pattern. The pattern may contain MS-DOS and Unix * and ? wilcards. Features:-It doesn't use regular expressions. -The search can be performed case
KFormula is a formula editor for KOffice. KFormula can be used to create and edit mathematical formulas that can be included in other KOffice documents. It provides simple input facilities and supports the functionality you expect from a KOffice appl
KWord is a frame-based word-processing and desktop publishing application. KWord is capable of creating demanding and professional looking documents. Whether you are a corporate or home user, production artist or student, KWord will prove a valuable
The Hebrew Editor package is intended mostly for Hebrew speaking users for creating and editing Hebrew/English LaTeX documents. This package provides a text (terminal) based word processor which is extreamly LaTeX oriented. Requirements:· LaTeX
This script has the function to display arabic text from right to left, and process it to change the letter shape according to its position in the word.You can run it in the background, and it will monitor one or more virtual consoles. If i
1337-Generator is plaintext to leettext convert. It uses GTK -2.x for the graphical userinterface. It is mainly written for Linux but should also be able to be compiled for Windows. It is very easy to be used. You just enter your text into the inputf
This package offers to programmers, translators, and even users, a well integrated set of tools and documentation. Specifically, the GNU `gettext' utilities are a set of tools that provides a framework to help other GNU packages produce multi-lingual
Protoeditor is a small KDE text editor developed for debugging scripts interactively. The goal is to provide a simple editor supporting a variety of debuggers for different languages.Protoeditor uses katepart as the text editor, so all the functional
Vile retains the "finger-feel" of vi, while adding the multiple buffer and multiple window features of emacs and other editors. It is definitely not a vi clone, in that some substantial stuff is missing, and the screen doesn't look quite the same.