Computational intelligence: imperfect
information representation, integration and optimal querying, and rule based reasoning under uncertainty,
incompleteness and inconsistency with multivalued
logics
This theme addresses the problem of representing and reasoning with imperfect
information using a logical approach based on multivalued
logics, as well as the integration and querying of information coming from
different sources, in a distributed environment.
Motivation comes from the area of knowledge acquisition, representation, and
reasoning based on imperfect knowledge. Indeed, in the real world information
may be incomplete or may have a bounded level of certainty and on the other
hand contradictions may occur during the process of integrating information
coming from various sources as it is the case of collecting knowledge from
different experts. In multi-agent systems, different agents may give different
answers to the same query. It is then important to be able to process the
answers so as to extract the maximum of information on which the various agents
agree, or to detect the items on which the agents give conflicting answers.
Incompleteness, uncertainty and inconsistency of the information may be treated
by using ready to employ hypotheses when information is completely missing, and
multivalued logics based on the algebraic concept of bilattice, when information is incomplete, uncertain or
inconsistent.
In our framework the information concerns the truth values of information
items, and is obtained through queries to the relevant sources. The answers of
such queries are combined or integrated using a set of rules. In such a
setting, imperfect information i.e. incomplete, uncertain information from a
source, or contradictory information coming from different sources, can
elegantly be expressed and dealt with using bilattices
and an approach of reasoning based on rules whose semantics is natural in these
bilattice based multivalued
logics.
A connected research direction we tackle concerns the problem of optimal querying
of imperfect information. Conventional techniques based on the concept of homomorphism
have traditionally been used in database research to study the containment of
queries evaluated against conventional data. We have extended and generalized these
techniques such that the problem of query containment and equivalence
(essential in optimal query evaluation) can be successfully studied in the
context of sources containing imperfect information.
Applications in knowledge acquisition and representation, intelligent systems,
logic based fuzzy agent systems and imperfect information integration and
querying.
Data mining & Machine learning: dimensionality
reduction, data sampling and design of new clustering/segmentation techniques
We have introduced an approach to reducing dimensions in data and data clustering inspired from computational models used to evaluate economical parameters. In particular the techniques of dimensionality reduction are based on calculating the highest risk or in other words the lowest return associated with each attribute in the dataset.
Research under development concerns the crystallization, formulation and
implementation of new segmentation techniques for large datasets based on three
phases that make the process more tractable and insure a good clustering
quality. In particular the initial and the final phases make the process
scalable since they are optimised for processing
large data fragmentally, while the middle phase, whose role is to enhance
clustering quality of the result passed from the initial phase, is performed
only on data which are wholly resident in main memory, since the middle phase
is computationally intensive. These techniques, currently designed for
numerical data, need to be extended to nominal and mixed data.
Applications in data pre-processing and segmentation,
knowledge discovery in databases.
Back to Daniel Stamate's homepage