RDI Technologies
Explore innovative RDI Technologies powered by the latest AI approaches.
Sotoor (OCR)
Get the most value out of your documents
Sotoor is one of RDI technologies with all-in-one typewritten optical character recognition (OCR) software package. The engine converts scanned images of documents, into fully editable and searchable text files, while maintaining the layout of the original document. Sotoor guarantees accurate and reliable recognition.
ID Reader
A sophisticated OCR engine for automatic extraction of IDs information
Automatic Arabic OCR for personal IDs and official documents like license cards, residency cards, etc. The ID Reader module consists of two engines, one for edge detection and another for information extraction.
Handwritten OCR
Transform your handwritten documents into valuable digital content
Convert handwritten texts found in scanned documents into editable and searchable text files with this unique OCR technology. Since handwritten documents represent a major challenge compared to typewritten documents, this technology requires customization to suit the various languages, fonts, and document types based on the client’s needs.
Main Features:
- Our Handwritten OCR Engine can be adapted to fit customer’s needs, guaranteeing higher recognition rates compared to basic OCR engines.
- The Handwritten OCR Engine can be used as a core engine to build full custom solutions such as form readers, and other handwritten document readers.
Kateb - Speech-to-Text (STT)
Get the most value out of your audio and video files
Kateb is an Automatic Speech to Text (STT) software solution. It is one of best RDI technologies, an innovative engine transcribes Arabic voice from recorded audio/video files into fully editable and searchable text files. Kateb supports Modern Standard Arabic (MSA), as well as the Egyptian dialect and the Saudi dialect with high, fast, and reliable recognition rates.
Natiq - Text-to-Speech (TTS)
Transform your input text into valuable synthesized natural voice files
Natiq is an advanced text-to-speech software provided by RDI. Natiq enables users to convert Arabic raw text into spoken words in a male or female natural voice. This technology is built on Tashkeel (RDI Diacretizer) that converts the raw text into vowelized text for correct pronunciation. This easy-to-use robust software, enabling seamless audio-powered applications that enrich user experiences, and engage audiences.
HAFSS – Arabic Speech Verification
Automatic verification of the correct pronunciation of the Holy Quran verses
Hafss is an innovative technology for teaching the Holy Qur’an with the narration of Hafss as per ‘Assim. This is one of the most famous modes of reciting the Holy Qur’an. The main idea of the software depends on simulating the (session) of narrating the Qur’an. Hafss recites the verse, asks the user to do the same, and corrects the mistakes if any. Hafss is an easy-to-use technology that is both highly accurate and quick.
Tashkeel – Arabic Diacritization
Accurate Arabic diacritization in the matter of seconds
Tashkeel is an advanced automatic Arabic diacritization system that is able to process up to hundreds of Arabic words per second. The engine was carefully built on integrating statistical, rule-based, and deep learning approaches, which makes it one of unique RDI technologies. In addition to being a stand-alone system, Tashkeel’s accurate outputs can be utilized to improve the performance of other key systems such as Text to Speech, and Search Engines.
Araa – Sentiment Analysis & more
Valuable data insights
Araa is a ground-breaking sentiment analysis technology tailored for Arabic input and data. Araa integrates the latest Automatic Speech-to-Text, Sentiment Analysis technologies, and Arabic Natural Language Processing (NLP) techniques to provide accurate and reliable data insights from Arabic input, whether its text, or audio/video recordings. Araa provides insight about the content including trends, topic detection, automated sentiment analysis, and influence discovery.
Main Features:
- Araa can be adapted to fit customer’s needs so that it can guarantee higher accuracies compared to the basic sentiment analysis
- Araa can be used as a core engine to build full custom solutions such as Sentiment Analysis Platforms.
Kashef – Semantic Search
Kashef provides you with relevant search results using the latest semantic search technology, as its searches for contextual meaning of your query rather than a literal word.
Kashef is RDI’s Arabic text search technology. Kashef provides different search alternatives so that users can search at the level of the stem, lemma, or the word. It is also able to semantically expand queries by using appropriate synonyms for the context used in the original search query. Kashef helps you to explore your enormous data effectively and quickly compared to traditional search engines.
Main Features:
- Kashef can be adapted to fit customer’s needs so that it can guarantee more optimized search results compared to basic search engines.
- Kashef can be used as a core engine to build full custom solutions such as specific purpose Semantic Search engines.
Romooz – Named Entities Recognition
Categorize your raw data into valuable entities of different types
Romooz is the RDI’s Arabic named entity recognizer. Romooz can find mentions of specific entities in text. Romooz focuses on the entities of the news domain, mainly persons, locations, and organizations. Given sufficient data, Romooz can be easily applied to other domain texts, making it a powerful, reusable, and comprehensive tool.
RDI Summarizer
Transform your raw data into valuable summarized content
RDI Summarizer is a professional Arabic text summarization technology based on Natural Language Processing and Machine Learning technologies. It’s an extractive summarizer that automatically analyzes text and tries to identify the most significant sentences that keep important details and ignore sentences that do not add new information. The RDI Summarizer has a powerful sentence ranker which guarantees highly accurate results, helping you get a clearer picture in less time.
Main Features:
- RDI summarizer can be adapted to fit customer’s needs so that it can guarantee more optimized summaries compared to the basic summary engines.
- Using advanced deep learning approaches, the RDI summarizer can be used with any Arabic text from any domain with good results.
- RDI summarizer is a dynamic summarizer that can be easily customized to get the number or percentage of sentences that the customer’s needs.
RDI Topics Classifier
Automatically classify your documents based on their topics
RDI Topic Classifier is an effective technology that helps users classify different Arabic documents, into their right class based on a predefined set of topics. Save your time and effort and use RDI’s highly accurate Topics Classifier that’s proven to classify data with 93-97% efficiency rate.
Main Features:
- RDI Topic Classifier can be applied to articles of any domain. It only needs to have a sufficient number of pre-categorized documents to learn from them.
- RDI Topic Classifier Engine can be customized to arrange documents through hierarchical classification mapping.
RDI Key-phrases Extractor
Transform your raw data into valuable key-phrases
RDI has a model for extracting key-phrases from Arabic texts using a set of techniques designed specifically for this purpose. Key-phrases provide a brief description of the content of the document; they can therefore be used for several aspects such as document classification, indexing, search, and summarization, as well as the possibility of use in semantic search.
Main Features:
- RDI Key-phrases Extractor follows a hybrid model that combines statistical and morphological approaches and aims to achieve their maximum benefit, while avoiding the disadvantages of each approach if used separately.
- RDI Key-phrases extractor can be adapted to fit new domains and new terminologies based on customer’s needs.