The Engineering Company for the Development of Digital Systems

Contact Info
12A Haroun,Doqi, Giza Governorate, Egypt
info@rdi-eg.com
+20 2 37 49 94 63 +20 2 37 49 55 66 +20 2 37 49 95 61

RDI Technologies

Explore innovative RDI Technologies powered by the latest AI approaches.

OCR Technologies

Sotoor (OCR)

Get the most value out of your documents

Sotoor is one of RDI technologies with all-in-one typewritten optical character recognition (OCR) software package. The engine converts scanned images of documents, into fully editable and searchable text files, while maintaining the layout of the original document. Sotoor guarantees accurate and reliable recognition.

ID Reader

A sophisticated OCR engine for automatic extraction of IDs information

Automatic Arabic OCR for personal IDs and official documents like license cards, residency cards, etc. The ID Reader module consists of two engines, one for edge detection and another for information extraction.

Handwritten OCR

Transform your handwritten documents into valuable digital content

Convert handwritten texts found in scanned documents into editable and searchable text files with this unique OCR technology. Since handwritten documents represent a major challenge compared to typewritten documents, this technology requires customization to suit the various languages, fonts, and document types based on the client’s needs.

Main Features:

    • Our Handwritten OCR Engine can be adapted to fit customer’s needs, guaranteeing higher recognition rates compared to basic OCR engines.
    • The Handwritten OCR Engine can be used as a core engine to build full custom solutions such as form readers, and other handwritten document readers.
Speech technologies

Kateb - Speech-to-Text (STT)

Get the most value out of your audio and video files

Kateb is an Automatic Speech to Text (STT) software solution. It is one of best RDI technologies, an innovative engine transcribes Arabic voice from recorded audio/video files into fully editable and searchable text files. Kateb supports Modern Standard Arabic (MSA), as well as the Egyptian dialect and the Saudi dialect with high, fast, and reliable recognition rates.

Natiq - Text-to-Speech (TTS)

Transform your input text into valuable synthesized natural voice files

Natiq is an advanced text-to-speech software provided by RDI. Natiq enables users to convert Arabic raw text into spoken words in a male or female natural voice. This technology is built on Tashkeel (RDI Diacretizer) that converts the raw text into vowelized text for correct pronunciation. This easy-to-use robust software, enabling seamless audio-powered applications that enrich user experiences, and engage audiences.

HAFSS – Arabic Speech Verification

Automatic verification of the correct pronunciation of the Holy Quran verses

Hafss is an innovative technology for teaching the Holy Qur’an with the narration of Hafss as per ‘Assim. This is one of the most famous modes of reciting the Holy Qur’an. The main idea of the software depends on simulating the (session) of narrating the Qur’an. Hafss recites the verse, asks the user to do the same, and corrects the mistakes if any. Hafss is an easy-to-use technology that is both highly accurate and quick.

NLP technologies
Arabic Diacritization System

Tashkeel – Arabic Diacritization

Accurate Arabic diacritization in the matter of seconds

Tashkeel is an advanced automatic Arabic diacritization system that is able to process up to hundreds of Arabic words per second. The engine was carefully built on integrating statistical, rule-based, and deep learning approaches, which makes it one of unique RDI technologies. In addition to being a stand-alone system, Tashkeel’s accurate outputs can be utilized to improve the performance of other key systems such as Text to Speech, and Search Engines.

Araa – Sentiment Analysis & more

Valuable data insights

Araa is a ground-breaking sentiment analysis technology tailored for Arabic input and data. Araa integrates the latest Automatic Speech-to-Text, Sentiment Analysis technologies, and Arabic Natural Language Processing (NLP) techniques to provide accurate and reliable data insights from Arabic input, whether its text, or audio/video recordings. Araa provides insight about the content including trends, topic detection, automated sentiment analysis, and influence discovery.

Main Features:

    • Araa can be adapted to fit customer’s needs so that it can guarantee higher accuracies compared to the basic sentiment analysis
    • Araa can be used as a core engine to build full custom solutions such as Sentiment Analysis Platforms.

Kashef – Semantic Search

Kashef provides you with relevant search results using the latest semantic search technology, as its searches for contextual meaning of your query rather than a literal word.

Kashef is RDI’s Arabic text search technology. Kashef provides different search alternatives so that users can search at the level of the stem, lemma, or the word. It is also able to semantically expand queries by using appropriate synonyms for the context used in the original search query. Kashef helps you to explore your enormous data effectively and quickly compared to traditional search engines.

Main Features:

    • Kashef can be adapted to fit customer’s needs so that it can guarantee more optimized search results compared to basic search engines.
    • Kashef can be used as a core engine to build full custom solutions such as specific purpose Semantic Search engines.
Arabic named entity recognizer

Romooz – Named Entities Recognition

Categorize your raw data into valuable entities of different types

Romooz is the RDI’s Arabic named entity recognizer. Romooz can find mentions of specific entities in text. Romooz focuses on the entities of the news domain, mainly persons, locations, and organizations. Given sufficient data, Romooz can be easily applied to other domain texts, making it a powerful, reusable, and comprehensive tool.

RDI Summarizer

Transform your raw data into valuable summarized content

RDI Summarizer is a professional Arabic text summarization technology based on Natural Language Processing and Machine Learning technologies. It’s an extractive summarizer that automatically analyzes text and tries to identify the most significant sentences that keep important details and ignore sentences that do not add new information. The RDI Summarizer has a powerful sentence ranker which guarantees highly accurate results, helping you get a clearer picture in less time.

Main Features:

    • RDI summarizer can be adapted to fit customer’s needs so that it can guarantee more optimized summaries compared to the basic summary engines.
    • Using advanced deep learning approaches, the RDI summarizer can be used with any Arabic text from any domain with good results.
    • RDI summarizer is a dynamic summarizer that can be easily customized to get the number or percentage of sentences that the customer’s needs.

RDI Topics Classifier

Automatically classify your documents based on their topics

RDI Topic Classifier is an effective technology that helps users classify different Arabic documents, into their right class based on a predefined set of topics. Save your time and effort and use RDI’s highly accurate Topics Classifier that’s proven to classify data with 93-97% efficiency rate.

Main Features:

    • RDI Topic Classifier can be applied to articles of any domain. It only needs to have a sufficient number of pre-categorized documents to learn from them.
    • RDI Topic Classifier Engine can be customized to arrange documents through hierarchical classification mapping.

RDI Key-phrases Extractor

Transform your raw data into valuable key-phrases

RDI has a model for extracting key-phrases from Arabic texts using a set of techniques designed specifically for this purpose. Key-phrases provide a brief description of the content of the document; they can therefore be used for several aspects such as document classification, indexing, search, and summarization, as well as the possibility of use in semantic search.

Main Features:

    • RDI Key-phrases Extractor follows a hybrid model that combines statistical and morphological approaches and aims to achieve their maximum benefit, while avoiding the disadvantages of each approach if used separately.
    • RDI Key-phrases extractor can be adapted to fit new domains and new terminologies based on customer’s needs.