sentence74 | SoMeSci

Property	Value
nif:beginIndex	0 (xsd:integer)
nif:broaderContext	sms:PMC5381785
nif:endIndex	292 (xsd:integer)
nif:isString	The statistics are collected after each training epoch by letting the DQNs (in their current state) play against each other for 10 games, each game initialized with a different random seed (In Pong one game consists of multiple exchanges and lasts until one of the agents reaches 21 points.).
is nif:referenceContext of	sms:PMC5381785/sentence74/T10
rdf:type	nif:Context nif:OffsetBasedString nif:Sentence