Project (National)
Research on
Key Techniques of User-centered Emotion Elicitation and Regulation Dialogue
Generation. Natural Science
Foundation of China, 62272092, 2023.1~2026.12
The emotional interaction between humans
and computers via dialogue system is a research hotspot in artificial
intelligence. Most of the existing emotional dialogue generation techniques are
system-centered, and fail to leverage the user's personality and dialogue
situation to actively affect the user's emotional state. This proposal
introduces a new research problem called User-centered Emotion Elicitation and
Regulation Dialogue Generation and conducts the following research: (1) We
study on dataset construction and external knowledge reconstruction techniques
to prepare the task-specific corpus and knowledge base; (2) We study on user
persona and dialogue situation modeling techniques to detect persona and
situation in human-computer interaction; (3) We study on human-computer
emotional interaction reasoning technique to predict user' elicited emotional
state; (4) We study on emotion elicitation and regulation dialogue generation
technique to generate the responses in single-turn, multi-turn or multi-modal
scenarios; (5) We study on content safety detection and dialogue quality
evaluation techniques to ensure the safety and quality of training data; (6) We
study on the dialogue generation techniques after model development for continuous
learning and updating of the model. This proposal creatively proposes to endow
the dialogue system with the ability to elicit and regulate users' emotional
states. The research results have broad application prospects in the emotional
companionship and psychological counseling in chatbots.
Research on
Interpretable Emotion-Aware Dialogue Generation. Natural
Science Foundation of China, 62172086, 2022.1~2025.12
The "sentiment interaction"
between human and computer in the dialogue system plays an important role in
the emotion companionship and psychological counseling of robots. Due to the
lack of explainability in the process of dialogue generation, the computer is
difficult to convince itself and the user, the user is difficult to know, and
know why, so we study the interpretable dialogue generation technology of
emotion awareness. It includes: (1) Interpretable model-oriented conversational
knowledge construction technology to solve the problem of training corpus
required for modeling; (2) Interpretable user utterance emotion awareness
technology, to solve the problem of computer emotion awareness for users; (3)
Interpretable dialogue generated emotion prediction technology to solve the
problem of empathy between computer and users; (4) Interpretable dialogue
generation technology based on emotion prediction to solve the problem of
computer response to users; (5) Criterion for the relationship between emotion
awareness, dialogue generation results and their interpretation and model
optimization technology, to solve the evaluation and optimization problems of
models and processes. Finally, a prototype system of interpretable dialogue
generation for human-computer sentiment interaction with the functions of
knowledge construction, dialogue understanding, dialogue prediction, dialogue
generation and dialogue management will be implemented. Achieve: (1) users get
better experience through explanations. (2) The results generated by models and
their corresponding explanations are mutually promoted and co-enhanced.
Study on
Human-Machine Conversation Text Generation Technique for Emotional Interaction. Natural Science Foundation of China, 61872074,
2019.1~2022.12
Implementing human-machine dialogue using
natural language is the hot research topic in artificial intelligence. At the
early stage, the dialogue system has no emotion and no personality. Therefore,
embedding the machine with human emotions has become one of the long-term goals
for intelligent dialogue system. In this proposal, we focus on the realization
of communications between human beings and machines using text with emotions
and opinions. We attempt to improve the human-machine dialogue system with a series
of new features, based on which the new text generation techniques are studied
for emotional interaction. 1) Considering different personalities of users and
different traits of machines, we study on modeling techniques for user
personality and machine trait in conversation context. 2) We study on emotion
and stance perception techniques for tracking users' dynamic emotion states and
detecting users' opinion stances. 3) We study on controllable emotion and
opinion text generation techniques for generating the text with specific
emotions or opinions. 4) Based on above conditions, we study on personalized
emotional response generation as well as ingratiating and refutation
opinionated response generation. This proposal innovatively injects user
personalities and machine traits into emotional conversations. Moreover, we
propose a novel research problem called opinionated conversation generation.
The research achievement of this proposal has a broad application prospect for
emotional companionship and psychological counseling using chatbot system.
Real-Time
and High Precision Dynamic Scholars Profile Construction. National Key Research and Development Projects
(Sub-Projects), 2018YFB1004702, 2018.7~2021.6
The subject is
oriented to the high-precision evolutionary scholar profile of the knowledge
graph. Aiming at the characteristics of scholars with diverse fields and great
differences, the modeling method of accurately describing the scholar profile
is studied. Through the real-time collection of multidimensional research
behavior data, the NSF classification, Chinese library method and knowledge
graph analysis of academic data are used to accurately construct the academic
profile. In view of the complexity of expert evaluation and ranking in the
profile of scholars, an evaluation system based on distributed spectrum
analysis algorithm is studied, so as to efficiently extract high-level experts
in the field and complete comprehensive evaluation and multidimensional ranking
of experts. Aiming at the real-time evolution of scholar's profile, the dynamic
updating rules based on the profile activation model are studied, including the
self-updating of the portrait library and the dynamic evolution of the relational
graph. Aiming at the complexity of scholars' profile modeling, this paper
studies the three-dimensional, accurate and credible methods of academic
portrait, including the methods of peer evaluation and expert recommendation.
Diversified
Social Relation Active Learning Based on Social Media Mining. Natural Science Foundation of China, 61772122,
2018.1~2021.12
Social relation learning on social media
plays significant role for many applications, such as information retrieval,
personalized recommendation, and community discovery. In recent studies, there
are more applications based on specialized social relations, but less work of
actively discovering potential social relations. There are more applications
about direct, explicit, and static social relations, but less work about
indirect, implicit, and dynamic social relations. There are more efforts on
detecting the existence of social relations, but less work of learning
semantics of social relations. Using the resources and users in social media as
data sources, this proposal defines diversified
social relation from
the perspective of form, semantic, and operation for social relations, and
analyzes the challenges for discovering the diversified social relations. This
proposal applies the social media mining, deep learning, and especially active
learning techniques to address the aforementioned challenges, and the major
research contents include: (1) the active learning of direct social relation
based on relation definition and annotation; (2) the active learning of static
social relation for recognizing the semantic of social relation; (3) the active
learning of dynamic social relation for discovering evolving patterns and (4)
the active learning of the correlation between relations for multi-relation
fusion and refinement. As result, this proposal will construct a multigraph
model for effectively demonstrating the diversified social relations in social
media. For this purpose, this proposal also studies the relation storage,
update and visualization based on multigraph structure. The results of the
proposed research subjects can better support many applications such as social
relation search.
Study on
Multi-Channel Feature Learning for the Sentiment Analysis on Micro Video. Natural Science Foundation of China, 61402091,
2015.1~2017.12
Micro video, which is becoming a pervasive
media type on the web, provides a rich repository of people's opinion and
sentiment about a vast spectrum of topics. The analysis of such information
either in the area of brand monitoring, market prediction or personalized
recommending plays an important role in behavior sciences. This problem poses a
set of unique challenges as micro video data present their intrinsic
characteristics as shortness in content, uniqueness in scenario, diversity in
transmitting modals, and relativity in data. The existed algorithms on
sentiment analysis can not satisfy the requirement
from micro videos. For that, study on multi-channel feature learning for
sentiment analysis of micro videos is proposed in this task. aiming at the
above characteristics on micro video data, we study sentiment collecting on
real-time Web data, sentiment semantics learning in the micro video space, the
multi-channel feature expressing model based on the video sentiment ontology,
and multi-channel feature fusing for sentiment analysis. Finally, we will
implement a prototype system combining the above studies. This task will
provide the relative technology supports on the applications of micro videos,
such as affective auto-analyzing, collecting, real-time monitoring or
personalized recommending.
Modeling
Multi-Level Resource Recommendation of Social Media Based on Broad-Sense
Content Analysis. Natural
Science Foundation of China, 61370074, 2014.1~2017.12
A recommender
system concerns with two main components which are users’ interest analysis and
recommended information selection. The plenty of information resources and user
communities with various topics in social media provide broader space for the
analysis and selection. In social media based
recommendation, the analysis and selection require analyzing all contents
including visual and aural features, semantic information,
and meta-data. In this project, we define above
contents as broad-sense content, classify the social media resources into
single, compositive, integrated, and sub-resource
categories, and study on social media multi-level resource
recommendation based on broad-sense content analysis.
For foundation, broad-sense content analysis
approach is explored. For users’ interest analysis, the studies include constructing user-resource heterogeneous information network,
discovering user community, and mapping user community topics into users’ interest
profile. For resource selection, the studies include mining other categories of
resources based on single resources, modeling multi-level resource with
conditional correlation, and identifying quality of the resources. For the
model application, we study on the resource match and recommendation approach
based on above models. The work can both reflect the characteristics in
recommending multi-level resource proposed in our project and emphasize the
user social relationship in the current social media researches. As a result,
we will implement prototype systems based on current main stream social media
for showing above characteristics.
Research on Key Techniques of Opinion Mining for Real
Time Public Opinion Monitoring in Microblogging Sphere. Natural
Science Foundation of China, 61100026, 2012.1~2014. 12
As a new
information carrier and transmission way, micro-blog is playing an increasingly
important role in the initiation and dissemination of network public opinion
information. Analyzing the network public opinion among them is of great
significance for understanding social public opinion. However, due to the
real-time, brief, community, especially multi-modal characteristics of
micro-blog data, the existing view mining algorithm is not good enough to meet
the needs of micro-blog public opinion analysis. Based on this, this research
focuses on the key technologies of opinion mining for real-time public opinion
monitoring in micro-blog space. In view of the above characteristics of
micro-blog data, this work researches the real-time Web data viewpoint
collection and tracking technology, the fine-grained sentiment analysis
technology of short texts, the opinion leader discovery technology of
micro-blog public opinion events, and the sentiment analysis technology for
multi-modal micro-blog data. Combined with the above technologies, this study
realizes a prototype system for monitoring public opinions in the micro-blog
space. The research results will provide relevant technical support for
automatic analysis, summary and tracking of real-time Web public opinion
information such as microblog.
Effective
Search and Service for Web Visual Media.
National Basic Research 973 Program of China under Grant (Sub-Projects),
2011CB302206-G, 2011.1~2015.8
This project
research the key technology of data mining: multi-modal visual media for super
high dimension characteristic of multimodal information, different modal data
of heterogeneous and features of the complexity of the type, the different
modal contribution to describe objects of uncertainty, and the different
characteristics of the same modal contribution to describe objects of
uncertainty, the modal characteristics of the fusion model. Aiming at the
inapplicability of traditional data mining algorithms in multi-modal data
sources, the efficient data mining algorithm, algorithm fusion and mining
result fusion technology based on multi-modal feature fusion model are studied.
Study on
Key Techniques of Opinion Mining and Opinion-Based Community Discovery for Web
Public Opinion Analysis. Natural
Science Foundation of China, 60973019, 2010.1~2012.12
As
the Internet become a cradleland of social opinions that reflect social hot
topics, “Web public opinion analysis” has emerged as a novel approach for
government to know the popular thoughts
of people. “Opinion mining” is currently a new
research area which concerns with the humanities and social science
knowledge such as propagation and psychology as well as
natural science knowledge such as text mining, information retrieval, and
natural language process. Different from the traditional text mining technique
which focuses on the facts of an event, opinion mining prefers to users’ opinions
and views of the event. Therefore, the opinion mining technique has new
characteristics and confronts new challenges, and becomes one of the main
supporting techniques for the automatic Web public opinion analysis. This project studies on the opinion mining key
techniques for Web public opinion analysis. By analyzing Web public opinion
documents, we summarize the characteristics of public opinion including broad
distribution, complex content, fuzzy emotion, time-variant orientation,
and exploring requirement for the application. For these characteristics, we
study on how to collect public opinion data and describe event and opinion, how
to cluster and classify public opinion, how to detect opinion drift and new hot
topics in opinion text stream, and how to discover opinion community and
opinion leader. As the result, we build an opinion mining prototype system
supporting Web public opinion analysis. The research result will be helpful for
automatic Web public opinion analysis.
Research on
User Motivation Deduction Model for New Generation Search Engine. Natural Science Foundation of China, 60573090,
2006.1~2008.12
New
generation searchs engine have such characteristics as interactive searching,
classific navigation, accurately related querying, and rapid updating. It is
necessary to accurately understand the users’ motivation for implementing the
functions of the new generation search engines. In this research, for deduction
users’ search motivation, the techniques of information retrieval, data mining,
data stream analysis and text mining are utilized, and natural language
processing, behavior science and cognition science are imported. For building a
user motivation deduction model, the analysis and deduction mechanism, content
and structure of its assistant information are studied and implemented.
Finally, a prototype for the functions of the model is built, and a front tool
with the characteristics of the new generation search engine based on the model
is provided in current search environment.
Study on
Zero-Input Personalization Techniques for High Quality E-Services on Internet. NSFC (Natural Science Foundation of China), 60173051, 2002.1~2004.12
This
project focuses on the key techniques for high quality E-Services
personalization on Internet, which aims to provide the technique of information
discovery and recommendation based on the personalized requirements of a user
without extra input efforts besides his natural browsing operations. To support
high quality E-Services on Internet, we proposed a zero-input personalization
CMR approach that integrates the techniques of data mining, rule resolution,
and information integrating. The research contents include personal data
collecting technique, Web data base/warehousing technique,
personalization-oriented Web mining technique, personalization rule resolution
technique, and personalization service recommending technique. Based on the CMR
technique, we have designed and implemented a personalization midware system
SmartWeb. The research involves data precessing, artificial intellgence,
behavior science and so on, and has important theoretical value and broad
application expectation.