This is the presentation of the SSNT service through a detailed description of his features. During this description you can discover several relevant aspects for a complete perception of this new service that we are offering. If you are not interested in such a detailed description and want just a quick view of the service we propose two different alternatives:
If you are interested in a detailed description of the service we propose the following set of points:
Nowadays there is a significant need to deal with large amounts of multimedia information. With this service we want to develop a selective dissemination of multimedia contents, mainly of TV broadcast news. The use of advanced techniques for the processing of BN programs, through a segmentation and categorization process, made possible the access to the contents of the programs based on an individual definition of the user profiles.
Through this new service we made available the 8 o'clock news program of the main channel of RTP (Telejornal). The users are able to define which thematic areas they are interested and after the automatic processing of the program they receive an email with the news that fit to the requested domains. Be one of them and start using now this new service.
This system was initially developed in consortium in the scope of the European project ALERT between INESC ID Lisboa, 4VDO and RTP. The developments of the large vocabulary continuous speech recognition system have been supported by the project POSI/33846/PLP/2000 financed by FCT.
In the next figure a functional diagram of the service is presented.
As we can observe from the functional diagram, the system analyses a generic multimedia document and based on the contents segment it in coherent blocks, through a video and/or audio segmentation.
When the document contains audio an automatic transcription is performed through a large vocabulary continuous speech recognition system. Based on the block segmentation and on the text inside each block, resulting from the transcription or because is only a text document, an automatic detection of topics is performed in each block, with the possibility of clustering together several blocks in homogeneous segments according to the topics contents.
With the multimedia document divided into segments, and a set of topics assigned to each segment, a search is performed on the user profiles requiring the topics from that segments and an alert message is generated for that users.
At the end of the process the multimedia document is loaded into a database where we keep the document segmentation and the appropriate categorization in topics.
The development of the system was based on a three main blocks structure: the CAPTURE block, responsible for the capture of the monitoring defined programs, the PROCESSING block, responsible to generate the relevant markup information associated to each program, and the SERVICE block, responsible for the user interface and database management. The control of the overall process is based on a simple semaphore scheme.
In the CAPTURE block we have access to the list of programs to monitorize and the information about the beginning and ending time of the programs. This information is the input to a capture program that, through a direct access to a cable TV network, starts the program recording at the specified time. This capture program generates a file with MPEG-1 codified video and audio. When the recording process is finished, an MPEG-1 file was generated together with the signalling to start the next block.
In the PROCESSING block the audio stream, extracted from the MPEG file, is processed through successive stages for segmenting, transcribing and indexing. The resulting information is compiled in an XML file.
In the SERVICE block we deal with the user interface, implemented through a set of web pages, and databases for user profiles and programs. Each time a program is processed an XML is generated and the database is updated. The matching between the program information and the user profiles generates a list of alerts sent to the users through an email service.
This system presents a set of innovative features based on speech processing techniques and topic detection. However due to the development conditions we know that the system still have a set of limitations. Among them we highlight the following ones:
After this detailed description you are in conditions to access to the system. We hope that you find in SSNT the necessary features to starting using this service. If you need any additional information or if you wish to draw any comment we are available here.