I guess it's more an academic question - although one important practical difference is that the second case actually requires an implementation of __atomic_load_1 to be provided from somewhere (e.g. -latomic), whereas the first doesn't.
↧