Looking back at the situation you had described (many business units with each same database, differing sizes of their local data) I recommended using MSSQL instead of consolidation of commonly used data, i.e. stem data. You may make a distinction between this data you use globally and other data only needed locally, that would be a partitioning of the data making sense, you separate the globally used data only and put it in MSSQL.
A customer I will leave January 2017 does have not only this part but all data of a reasearch&development module in one head quarter on premise SQL Server database (not the azure cloud) and has regions use terminal server to use the application and data at head quarter. So there is no need for replication and/or consolidaiton of data. I would still prefer replication with each region, but their internal test results stated it wouldn't work. I strongly assume without having the full insight, they failed on a mixture of false assumptions/interpolation of sclaing needs, bad setup, missing knowledge about the database design. For example many parts of the data are indeed not needed global, even in the head quarter some of it only is relevant to a single user, his projects, his formulas. And so a setup of a general full database replication was replicating way too much data, they also didn't make use of SSL tunneled ast internet connections but private data lines, which are much more expensive for same bandwidth and only have the advantage of guarantee of bandwidth, but are not more safe than SSL tunnels can also be.
Anyway, what data do you consolidate and why, if you answer that question (for yourself) in detail you knwo which part of the database should be centralized first. You have to decide whether it would be okay if each local unit has slower access to this data, when you centralize it, but still fast or even faster access for remaining on premise data. How fast does this data change, for example. If it's your product inventory, do you really need the knowledge of europe inventory in usa? If it's the products you sell, would it be ok to have a central single products table and each unit downloads and merges it into local data once a day only? Such ideas will lower the needs for replication bandwidth a lot better than blindly replicating all data, for example.
What you should see from these examples is, much broader and general thoughts on needs for which parts of data is needed now, before going into technical details. Whether the construct I have to maintain now pays - with one main business unit having the full data local and regional units connecting via terminal server - whether that pays also depends on how large business units are. The regional units put together still only are a split number <10% of total users, so the central unit has most data access. This wouldn't work out for all units being símilarly sized.
Bye, Olaf.