|
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Objectives Background Contact details Consortium members Work Areas Feedback Links ![]() |
AREA 4: PERSONAL TRAVEL SERVICES
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
As it is unlikely that a fully integrated, purpose-designed, harmonised system incorporating all data and services needed for travel will become a reality, at least in the near future, the task of providing high quality traveller services from door-to-door will tend to be broken down into the four following subtasks:
All subtasks can already be performed to a greater or lesser degree of sophistication. However, high-quality services cannot be built by collecting data with unstructured search methods. It requires well-standardised and harmonised systems at a high level with clearly defined interfaces. Furthermore, an intermodal seamless routing service is only possible if the contributing local data sources and information systems offer a high level of accuracy in terms of time, location and reliability, etc.
For these reasons, as already stated, the achievement of this scenario should be the long term aim, and requires a considerable amount of hard work. But in the more immediate future a new style of Value Added Service Provider (or Transport Information Broker) offering one-to-one services will probably play a central role. It may also retain certain importance in the longer term as there are always new offers created in the web-economy which do not fulfil standardisation requirements from the very beginning and therefore require unstructured searches in an unstructured environment.
The following is a status report on data retrieval methods in a distributed environment and identifies possible software developments and adaptations needed to efficiently operate a broker service in the travel field. As good progress is already being made in the deployment of seamless intermodal systems for traveller services, the paper also reflects on more sophisticated methods of harmonisation, co-operation and integration of systems. In addition, a brief description of some mathematical methods which help to establish users needs is given.
Information is provided in the Internet, digital libraries and (distributed) databases. In general, we distinguish between four methods for retrieving information, which are presented below:
Search engines are programs that search documents for specified keywords and return a list of the documents where the keywords were found. Although the search engine is really a general class of programs, the term is often used to specifically describe systems like Alta Vista and Excite that enable users to search for documents on the World Wide Web. Typically, a search engine works by sending out a spider to fetch as many documents as possible. It's called a spider because it crawls over the Web compiling the enormous lists of URLs that are the heart of any search engine. Another term for these programs is webcrawlers. Spiders are subsets of robots/agents (see below). Because most Web pages contain links to other pages, a spider can start almost anywhere. As soon as it sees a link to another page, it goes off and fetches it. Large search engines have many spiders working in parallel. Another program, called an indexer, then reads these documents and creates an index based on the words contained in each document. Each search engine uses a proprietary algorithm to create its indices such that, ideally, only meaningful results are returned for each query.
There are numerous search engines, total number exceeds 1000. The following table provides a short list of some of well-known ones. In order to cope with the situation, so called meta search engines (e.g. webcrawler.de) have been installed which make use of many search services.
Table 1: Examples of search engines
The main obstacle for retrieving relevant information from the web is the absence of a well defined underlying data model. Search engines use more or less unstructured methods to conduct searches on poorly structured databases. If we consider the search engines on the web today, we conclude that they continue to use indexes in a very similar to those used in libraries one century ago [2]. This is why most search engines provide too many matches not relevant for the user.
These difficulties have attracted new interest in information retrieval solutions. The recent developments of modern and inexpensive graphical user interfaces (GUI) and mass storage devices have also had impacts on the science of information retrieval itself.
The main problems generating research needs are to ensure
Search engines are undergoing a more profound evolution at the moment, the refinement of their special tools. It is expected that very soon the Web will evolve standards, such as standard categories, ways of automatically classifying information into these categories, and the search tools to take advantage of them, that will really improve the search process.
A robot (bot is short for robot) is a software tool for digging through data. The user gives a robot directions and it brings back answers. These computer programs run automatically without human intervention and have an unusual degree of autonomy. Typically, a robot is endowed with some artificial intelligence so that it can react to different situations it may encounter.
The term (ro)bot has become interchangeable with agent, to indicate that the software can be sent out on a mission delegated to it on behalf of somebody, usually to find information and report back. (Some robots operate in a specific place, for example, a robot in Microsoft Front Page automates work on a Web page).
Table 2: Characteristics of agents (source: [3])
The following table lists up all different categories of robots for all purposes:
| Newsgroup Bots | Chatter Bots |
| News Bots | Commerce Bots |
| Research On Bots | Data Mining Bots |
| Search Bots | E-Mail Bots |
| Shopping Bots | Fun Bots |
| Software Bots | Game Bots |
| Stock Bots | Government Bots |
| Update Bots | Knowledge Bots |
| Bot Design Misc. Bots |
Table 3: Categories of robot (source: www.botspot.com)
The wide-spread distribution and intelligent self-generation of the search mechanisms of agent technology makes the difference to other methods. That is why agents can search passive documents in the web or digital libraries and also adapt to active information systems front ends. However, it has to be stated, that agent technology up to now has not fulfilled the original promise, but is a powerful means with a huge potential.
Additionally, the application of multi-agent systems (system where agents communicate with other agents) requires a great deal of research work as there are not yet answers to the question of how they behave if their number increases.
Unlike the Internet which represents a practically unstructured data base, which means that search procedures are often rather similar to browsing, the contents of databanks are much better known, well-structured and, in addition, limited to certain subjects. This applies to central and distributed data bases architectures as well as the hierarchical and relational organisation of the data.
For this reason, the search procedure differs considerably from that used for browsing the web, as with robots and search engines. Search in distributed databases requires access to the data bank by an interface. Then a query has to be constructed using the methods provided by the database. Often there is the need to use a proprietary query language and also meta knowledge like a semantic model. In addition, a technical model of the data contents is needed for efficient exploration.
Information systems rely on underlying databases. Unlike passive data bases, to which users are simply given access, but then are left to generate queries on their own, using meta knowledge, information systems support the user by providing intelligent calculation algorithms to search within their own data and compute optimal results on that basis. These are proprietary search mechanisms and calculation algorithms according to the structure of the database. The user must formulate a query by filling out the front end or interface completely, and with correct input. On the other hand, there is no possibility for users to create their own queries other than the ones offered.
In both databases and information systems, a well designed interface helps save time in the information retrieval process in contrast to browsing the web with search engines or robots.
The ability to provide individual services represents one of the key issues for the success of a service. Here, we are not concerned with market research procedures which try to find out what customers want to know, but with the mathematical and technical methods needed to identify customers information requirements when initiating a query, as this has impacts on the user interaction with the system.
Unlike data retrieval, which aims to collect all objects satisfying clearly defined conditions, the characterisation and identification of users' (information) needs is not a simple problem. Information retrieval usually deals with natural language, which is not always well structured and could be semantically ambiguous [2]. Additionally, there is often uncertainty in users' wishes themselves. To be effective in its attempt, the identification tool must somehow interpret the content of the information items in a collection, and rank them according to a degree of relevance to the user query. The difficulty is not only to extract the information, but to perform filtering functions and decide its relevance.
The following are the most appropriate methods for solving this task:
These general remarks are also valid for the task of information retrieval in the area of travel and tourism.
Value Added Service Providers (VASPs) have to provide exhaustive and detailed traffic and travel information, covering the users needs in the geographic area of interest.
Internet is the most popular channel since it is low-cost and easy to access, but as the information is not organised, it has to be retrieved using specific methods and involved accessing huge amounts of data, as stated above.
A service provider (VASP) considering how to provide traveller information in a conurbation area faces many problems. First of all, the VASP has to be authorised to access the data of each of the local data providers. This involves keeping an updated list of local transport service providers, as well as acquiring knowledge of the data structure adopted and access methods available. Even when a VASP possesses the full set of data required to meet the user's request, a good deal of further elaboration and information retrieval still has to be done. In fact, the available information is usually raw data (e.g. position of the bus in the network, flow status in the network) and specific knowledge and elaboration are required to work out the information desired (e.g. the expected bus-stop arrival time, congestion warnings, etc.). When multimodal trip planning is required this involves further complexity, since it requires additional information (such as interchange nodes), usually not available and difficult to retrieve. The difficulty of the problems encountered increases with the number of cities involved.
The organisational and technical problems involved are summarised below:
These difficulties represent a serious obstacle to the VASP in achieving its objective, i.e. supplying the user effectively with information of the level of detail required.
To overcome the problems listed above, the following are important requirements:
Figure 3 - The current situation
One possible solution consists of setting up of open telematics platforms. These are systems (both hardware and software components) capable of providing a structured data pool accessible through a set of standard or de-facto interfaces with multiple physical connections. The platform is able to meet the requirements listed above, since it has the following features:
A platform client is an organisation/company interacting directly with the platform obtaining and/or providing data and information. Two main types of client can be identified:
1. The Data Provider who deals with tasks like:
2. The Value Added Service Provider who provides value-added services based on the data retrieved from the platform, dealing with further tasks such as:
The platform should be promoted and funded by the following actors:
The platform owner represents a local content provider who uses the platform to provide the information to interested VASPs, with agreements covering the economic, organisational and technical specifications.
The platform owner can also decide to act as a local VASP, with the provision of services based on the available data. In this case, other VASPs using the platform would run the risk of being bypassed. In fact, the traveller could decide to directly access the local travel information through the local VASP.
The VASPs themselves could be organised according to a hierarchical structure - local VASP, regional VASP, national VASP etc. - depending on the extent of the area of interest.
As the more important real-time features become, the more essential local and regional providers or platforms will become.
Mobility VASPs could also act as part of a portfolio of general service providers coming under the umbrella of a wider portal. Customers would possibly pay more willingly for travel services within the context of a complete service offer.
Transport operators are also increasingly likely to play the role of a VASP, with the tendency to associate with recognised travel brokers, as the importance of well-known brands cannot be underestimated in the web economy.
Figure 2 - A VASP with telematics platforms
A real-life example of implementation of this solution is TITOS (Torino ITS 2000 Open Showcase), the open platform which was set up in Turin in occasion of the 7th ITS World Congress. It is currently in operation and is being used to develop new ITS services for the city of Turin and give a strong impetus to the commercial deployment of ITS technology in Europe.
The platform provides its clients free of charge with a set of resources (data, communication channels and user terminals) used by organisations or companies wishing to promote their equipment and services by simply plugging into the platform.
The TITOS platform serves as a neutral demonstration environment for ITS operators. An important objective is to promote the integration and interoperability of ITS technologies, and to reinforce norms and standards at the European level. TITOS is supported by EC funding as part of the SMITH project and will continue to be operational as a "test bed" until the end of 2001. The platform provides the opportunity for testing and upgrading products or services. The necessary technical assistance for the Clients is provided free of charge.
The TITOS platform experience could be replicated in any city where traffic and travel information services are to be developed. To proceed, the key features of the platform have to be considered and analysed as reported in the following sections.
Figure 3 - Real life plug and play environment
As shown in Figure 4, the two main interfacing elements of the platform physical architecture are the Resources and the Clients.
TITOS Resources consist of data, communication channels or user terminals available through the platform. The Clients access the Resources using the TITOS Interfaces. The Interfaces allow the Clients to access the Resources using defined, emerging or de-facto standards. As a result, Clients can obtain data/information, provide data/information, provide services, demonstrate their equipment, exhibit and test their applications.
The TITOS platform is supported by a number of 'Actors' who make available their infrastructures to form the TITOS environment. The role of main Actor is played by 5T, which is committed to providing the basic traffic and transport real-time data for the whole transport network of Turin, and providing most of the Interfaces to access this data. Other Actors have already made available other important Resources (e.g. RAI for the FM and DAB transmitting infrastructure, ATM for public transport information and data, MIZAR Automazione for the WAP gateway). Other Actors include private companies contributing by enlarging the range of TITOS Resources available. This mechanism allows a constant evolution and updating of the platform features.
Figure 4 - Elements of the TITOS architecture
Resources are classified according to the following categories:
Data
The "background scenario" consists of the traffic and transport network of Turin. Detailed real time data is provided by 5T, the source of core data. This data relates to private and public transport over the whole Torino city transport network, including traffic flows, travel time, vehicle density, queue length, and arrival times (for public transport), etc. historical profiles, forecasts and targets are also available. There is, in addition, real time data on pollution levels and free spaces in city centre car parks. The platform is also fed with traffic and travel information from other data providers. This includes traffic flows on Italian motorways and the surrounding Trans European Network, as well as tourist information, flight and bus-link timetables for Turin airport. The TITOS data can then be split down into the following:
Another classification of data is based on the update frequency of the information provided, so from this point of view data can be defined as static or dynamic.
Data can be provided as a consequence of a customised request i.e. a Client may require route guidance information which concerns specific city intersections. In this case, the answer is based on the results of dedicated processing engines.
The platform uses specific engines to provide information involving complex data processing. The TITOS engines are described in the following:
Communication channels
These are the telecommunication chains needed to reach the final users via different media channels. The communication channels available through TITOS include SMS and WAP, available as information channels on the GSM network, a multimedia channel (DAB/ DMB) for data broadcasts and the TMC (Traffic Message Channel) for transmitting information via FM/RDS radio. Information is also accessible on TV channels (Televideo and Teletext) and, of course, the Internet, expected to be the channel most used by TITOS final users.
User Terminals
This is the equipment needed by the traveller, who is the final user of TITOS. The services available through the TITOS platform can be accessed through a wide range of user terminals, available either as commercial products or demonstrated by equipment suppliers as prototypes. Users will be able to access most services using mobile phones, home PCs, PTA, car navigation systems, car radio or TV. A number of prototypes are being demonstrated. These include new generation in-vehicle systems (currently known as on-board "infotainment" devices), which integrate the navigation function with mobile phones and displays with DAB receivers, offering interactive information services.
The TITOS platform is designed to be used by:
The basic traveller assistance services of the platform are freely accessible to the general public in Turin. A focus group including a sample of average people was set up to obtain feedback and user reactions.
For users registered with TITOS, it is possible to have access to both on-event and on-demand information Services. These Services include:
In some cases, similar services are provided by different providers, allowing users to test and compare the various features. The strength of TITOS resides in the fact that it is not simply an academic exercise, but a real life environment where solutions can be tuned and finalised after confronting practical implementation problems.
The TITOS platform has undergone an evaluation process. As well as tests of the physical functioning of the system through built-in evaluation software modules, questionnaires were distributed to evaluate the user acceptance of the platform. The first impressions which emerged from the TITOS Forum held at the 7th World Congress in November 2000 identified the following:
|
Physical connections
|
||||
| Interfaces I. |
LAN
|
WAN
|
ISDN
|
INTERNET
|
| DATEX |
FTP
|
FTP
|
FTP
|
|
| TMCGWAY |
FTP
|
FTP
|
FTP
|
|
| MOTGWAY |
|
|
|
FTP
|
| WAPGWAY |
WDP
|
WDP
|
WDP
|
WDP
|
| SMSGWAY |
HTTP FTP
|
HTTP FTP
|
HTTP FTP
|
HTTP FTP
|
| INETGWAY |
TCP-IP
|
TCP-IP
|
TCP-IP
|
TCP-IP
|
| TVDGWAY |
FTP
|
FTP
|
FTP
|
|
| SQLSRV |
|
|
|
TCP-IP
|
[1] Traveller Information - Functional specifications, system architecture issues, 2nd Intermodality Area Workshop, CODE project, Aix-en-Provence, 1999.
[2] Ricardo Baeza-Yates and Berthier Ribeiro-Neto: Modern Information Retrieval, Addison Wesley, ACM press, 1999.
[3] Josef Withalm: Agents solving strategic problems in tourism ENTER 2000, Barcelona, April 2000
[4] T.Kämpke, F.J.Radermacher, P.Wolf: Supporting preference elicitation - The FAW preference elicitation tool. Decision Support Systems,Vol. 9, No.4, 1993.
[5] TITOS Technical Support: http://www.torino2000.itscongress.org/showcase/technica.htm.
[6] TITOS Services home page: www.5t-titos.it.
[7] eEurope Initiative home page: http://europa.eu.int/information_society/eeurope/index_en.htm