Institute for Systems Research  
 


search


ISR     UMD

Search ISR news archives



Risk-Sensitive and Mixed Estimation and Control of Markov Decision Processes

Research team

Steven I. Marcus (ECE/ISR), S.P. Coraluppi, S. Dey, P. Fard, E. Fernández-Gaucherand, D. Hernández-Hernández

Risk-sensitive control

The motivation is risk sensitive control. Derived risk-sensitive and mixed risk-neutral/minimax controllers and state estimators for discrete time Markov processes. Proved results on the structure of risk-sensitive controllers for Hidden Markov Models. Robust HMM Filtering for Dynamic Classification and Prediction.

The problem was detecting or predicting a change in operating condition. The applications are condition-based maintenance in manufacturing (e.g. tool wear detection) and vehicle detection and classification in the battlefield. HMMs are over-confident and give unrealistic confidence scores for dynamic classification in the tool wear problem.

Robust HMM Filters are filters and state estimators (decoders) for Hidden Markov Models (HMMs) that are robust with respect to noise models. Risk-sensitive filters (Dey, Moore), based on risk-sensitive control (exponential sum of costs – Baras, Marcus). Mixed estimators that minimize an expected cost--with a constraint on the worst-case (Dey & Marcus, 1998 IEEE Conf. on Decision and Control). Key point: Posterior probability distribution not a sufficient statistic (information state)

Risk sensitive and mixed estimation

Observations
Similar to risk sensitive (r.s.) control, but Ut replaced with state estimate

Cost
Recursive equations for information state and state estimate as in control case

Robustness of risk sensitive estimation. Penalizes higher-order moments of m.s. estimation error. More robust (approaches minimax or worst case) as . Is there a more direct way to trade performance and robustness?

Mixed HMM Estimation

Minimize m.s. estimation error subject to a constraint on the worst-case estimation error. Can directly trade off performance and robustness. Mixed estimators that minimize an expected cost--with a constraint on the worst-case (Dey & Marcus, 1998 IEEE Conf. on Decision and Control). These estimators and controllers are more robust with respect to modeling errors. Built on work of Baras, James, & others. Invited address at 1996 Conference on Mathematical Theory of Networks and Systems. Applications in portfolio management (Stettner; Bielecki, Hernandez-Hernandez, and Pliska). Application to prediction of failure of drilling tools in the Center for Auditory and Acoustics Research.

For more information

S.P. Coraluppi & S.I. Marcus, "Mixed Risk-Neutral/Minimax Control of Discrete-Time, Finite-State Markov Decision Processes," IEEE Trans. Automatic Control, June 2000.

D. Hernández-Hernández, S.I. Marcus, & P. Fard, "Analysis of a Risk SensitiveControl Problem for Hidden Markov Chains,'' IEEE Trans. Automatic Control, 44, May 1999, 1093-1100.

S.I. Marcus, et. al., "Risk Sensitive Markov Decision Processes,'' in Systems and Control in theTwenty-First Century, C. I. Byrnes, et. al., eds. Boston: Birkhauser, 1997, 263-279.

S. Dey & S. I. Marcus, "A Framework for Mixed Estimation of Hidden Markov Models,'' Proc. 37th IEEE Conf. Decision and Control, Dec. 1998, Tampa, FL.

Additional references available at Steven I. Marcus' web site

 

   
Back to top      
Clark School Home UMD Home ISR Home