Ok that explains it… does select_one_external support multilingual audio and images just like the normal choices sheet of the XLS form?
Based on this, it looks like media is not fully supported via an external CSV even monolingually, only with the workaround that involves populating the internal choice list (choices) tab of the XLSForm.