PropertyValue
nif:beginIndex
  • 0 (xsd:integer)
nif:broaderContext
nif:endIndex
  • 124 (xsd:integer)
nif:isString
  • Furthermore, a separate target network (with parameters θ′ in the formula above) is used for estimating the maximal Q-value.
rdf:type