I am trying to model the following problem as a Markov decision process. In a steel melting shop of a steel plant, iron pipes are used. These pipes generate rust over time. Adding an anti-rusting solution can delay the rusting process. If there is too much rust, we have to mechanically clean the pipe. I ..

#### Category : markov-decision-process

I am eager to apply Markov decision process on the following: In steel melting shop of a steel plant iron pipes are used. These pipes generate rust over time. Adding anti rusting solution can delay the rusting process. If there is too much rust we have to mechanically clean the pipe. I have categorised the ..

I’m looking for an example-based answer, whether that’s code directly in the answer or a link to a tutorial, but regardless more than a text-only answer. I’m curious- how would one define an arbitrary Markov Decision Process in OpenAI Gym for purposes of reinforcement learning solutions? The sort of problem I see frequently in my ..

I am trying to use MDP Toolbox to implement an algorithm for the "average infinite" reward criteria for a random MDP I have generated through Python’s MDPToolbox library. While this library provides an optimal policy for such an objective over all initial states, I wish to find an existing implementation of an algorithm which provides ..

I’m working on a time-series problem and I want to estimate the transition probability matrix of a 2D time-series data (n_samples=997, state height= 5, state width= 3) and there are only 5 possible numbers {-100,0,100,200,300} for each element of the states. I’m wondering if anyone can suggest an efficient way to estimate the transition probabilities ..

