To me now it's pretty clear that Bayeselo model cannot fit the usual 400 logistic no matter what "scale" is used. The fit for small rating differences is achieved by the "default", and not scale=1, as the "default" fits the derivative of the ususal logistic in 0. In fact, now I think that "default" should be used by rating lists, but keeping in mind that it compresses ratings for large rating differences. I plotted the "default" and the true logistic on logarithmic scale to see the tails, with drawelo=200:

You were right, 200 on the Bayeselo curve is 212 on the logistic, a compression of 6%.
Here are the Bayeselo "default" compressions for several values of rating differences
100 Elo points: 2%
200 points: 6%
400 points: 15%
800 points: 25%
Therefore, if one doesn't use Ordo, then use Bayeselo "default", it is good for small rating differences, but keep in mind that on large scales Bayeselo compresses ratings by 5-20%. Scale=1 distorts the ratings for small rating differences, giving Bayeselos instead of Elos. I used a large drawelo of 200, smaller drawelo gives smaller compression.
Kai