Verifiable data points like node count, public channel count, and public capacity.
Anything else used to "build a better model" is not verifiable, so it arguably makes the model worse.
You could probe for unannounced channels and publish their funding transactions and channel point. That would be a "value add" in terms of estimating true usage of the network.
reply