ALEXANDRIA, Va., Sept. 23 -- United States Patent no. 12,423,571, issued on Sept. 23, was assigned to SONY GROUP Corp. (Tokyo).
"Training actor-critic algorithms in laboratory settings" was invented by Piyush Khandelwal (Austin, Texas), James MacGlashan (Riverside, R.I.) and Peter Wurman (Acton, Mass.).
According to the abstract* released by the U.S. Patent & Trademark Office: "Reinforcement learning methods can use actor-critic networks where (1) additional laboratory-only state information is used to train a policy that much act without this additional laboratory-only information in a production setting; and (2) complex resource-demanding policies are distilled into a less-demanding policy that can be more easily run at production with ...