Parameters are optimized by the Downhill Simplex method as described in the book Numerical Recipes, chapter 10. The sample input file contains realistic values for the optimization parameters.
Indeed, the focus of the prior literature is to learn supervised models from hyperbolic data, rather than representing supervised models in hyperbolic geometry.