Some follow-ups to your follow-ups.
* Re: Ptolemy and Copernicus. It seems to me that simply because one choice of coordinate system may be especially perspicuous or useful for making theoretical progress, as the coordinate system with the sun at the origin helped Galileo and Newton in a way that the earth-centered coordinate system did not, this does not imply that formulations of a theory that differ only in their choice of coordinate make different physical claims about the world. It is often the case that one choice of convention is more perspicuous and useful than another. I agree that the Copernican system did a better job of providing qualitative and intuitive explanations of astronomical phenomena. In fact, my impression is that the Ptolemaic system only described the phenomena and didn't really seek to explain them. Still one can imagine a proponent of the Ptolemaic picture trying to provide such explanations and doing a worse job of it than the Copernicans. But the point would remain that an elegant solution to a problem and a proper understanding of how to explain it often requires a good choice of conventions. So yes, it was the model of Copernicus that ultimately led to Newtonian mechanics but even by the lights of this final product, there is no physical significance to the coordinate system in which the sun is at the origin - only the *relative positions* of the planets to the sun are relevant for the Newtonian explanation of their motion. This is the sense in which I think that the methodological principle is right when it asserts that the choice of coordinate system is only conventional. The same sort of history might repeat itself in the sphere of the interpretation of quantum theory --- a particular choice of convention might provide more insight than other such choices. Nonetheless, I still think that we benefit from understanding which elements of our physical theories are merely conventions and which are not and that we stand a better chance of making progress when we don't mistake conventional differences for physical differences.
Let me also say something about the great debate over GRW, dBB, MWI and other comers. GRW makes different empirical predictions from dBB and MWI, so one needn't appeal to my methodological principle to decide between them. Even in comparing dBB and MWI, given that they each lead to different extensions or modifications of quantum theory - I'm thinking here of Valentini's nonequilibrium modification of dBB for instance - we also needn't appeal to my methodological principle. However, for the sake of argument, let's consider two models that do make precisely the same operational predictions. Let me take the example of GRW and a version of MWI wherein the anomalous decoherence postulated by GRW is reproduced by virtue of there being additional hitherto-unknown degrees of freedom that couple unitarily to the known degrees of freedom. Simulating the predictions of GRW in this way is always possible by virtue of the Stinespring dilation theorem, as I describe in the article. What should one say regarding these two models? Let's leave aside the stories that Everettians and collapsicans tell for a moment and consider someone who adopts a purely operational interpretation of quantum theory. The operational difference between the two views is that in one, the extremal set of allowed transformations corresponds to a set of nonunitary but linear dynamics, while in the other the set of allowed preparations includes additional degrees of freedom while maintaining unitarity of the extremal transformations. I take the view that this equivalence of operational statements teaches us something about the correct ontological interpretation of quantum theory. I suspect that neither the Everettian nor the collapsican would agree with me. They don't draw any ontological lesson from that equivalence. I'm saying that they're mistaken not to do so. The phenomenology of the purely operational version of the theory is, I think, our best guide to the correct ontology. So my view on the great debate over realist interpretations of quantum theory is that the existing camps are not likely to be the correct story, and to find the correct story we will probably need to focus more on the operational version of the theory.
* Re: spin in dBB. Looking at Holland's book again, I see now that in fact the evolution of the particle position in the BST model *does not* depend on the orientation of the particle, so the error was mine. Therefore, I now agree with you that the orientation in the BST model seems, relative to Bell's minimalist model, to be like the colour of the objects in the colour-and-shape model relative to the shape-only model for Plato's cave. Given this, I agree that this example does not do a good job of making the point that I want to make. Nonetheless, I think that my point can still be made using other examples of underdetermination in dBB. For instance, whether we use the electric or magnetic field strengths as our supplementary variables in a pilot-wave version of QED seems to be an example of a distinction in kinematics where neither variable seems to be like the colour in the Plato's cave example (but perhaps you can convince me otherwise). These cases are meant to demonstrate that while the combination of kinematics and dynamics has explanatory power, the fact that we can achieve empirical adequacy using different choices of kinematics and dynamics demonstrates that some aspect of the pair is purely conventional, purely colour-like. Perhaps my use of the Plato cave analogy has been a bit misleading. Given that the colour and the shape of the objects in Plato's cave are part of the kinematics of the prisoners' theories (nothing has been said about dynamics), the analogy might suggest that the methodological principle should be used only to decide among competing choices of kinematics, such as the kinematics in the BST model and the Bell model. But my argument is really that there are conventional elements that are a part of the pairing of kinematics and dynamics without being a part of the kinematics alone or the dynamics alone and to eliminate these conventional elements we must adopt a framework that doesn't take the distinction between kinematics and dynamics to be fundamental.
* Re: kinematical locality. I've been using the term "locality" in "kinematical locality" in a general sense, not wedded to spatial separation, but I'm happy to talk about kinematical locality in the case of spatially separated systems. I think I now understand the sense in which you accuse me of presupposing the notion in its definition. Let me see if I've got it right. I'll start with the background to my definition. It is common to make a distinction between systems and the attributes/properties of those systems. Often, the systems are taken to be primitive and the attributes are assigned to these. This, I think, is how one usually makes sense of claims about holistic properties. There is a system A, a system B, and a system which is the composite of these, call it AB. Each of these systems is assigned a set of properties. There exist holistic properties when some of the properties of AB cannot be understood as supervening upon properties of A and properties of B. My sense is that you want to say that systems are not primitive, that somehow they are defined as the locus of a set of properties. In this case, if a set of properties cannot be "factorized" into sets of properties of a pair of components, then we simply aren't warranted in saying that we have a pair of systems rather than a single indivisible system (interestingly, the only other person I've heard make such an argument for the incoherence of holism is Chris Fuchs). I have been assuming that we can make sense of claims of holism in the standard way. But, in the end, I think that the notion of kinematical locality is not really well-formed, so if you want to put more nails in its coffin by criticizing the standard way of making sense of it, please do so! Unfortunately, I'm not familiar with Albert's "marvelous point" theory, so you'll have to explain it to me.
* Re: Bell's formulation(s) of locality. You say "one can and should understand the second of the two formulations/diagrams as following from the first." I disagree. In the first formulation, the variables that screen off region 1 from region 2 are entirely confined to the backward lightcone of region 1. If Bell had wanted to define a region that both screened off 1 from 2 and screened off 2 from 1, then he should have taken a union of regions, one within the backward lightcone of regions 1, and the other within the backward lightcone of region 2. The fact that he included regions of space-time outside either backward lightcone implies, I think, that one cannot derive his second formulation from his first. I agree with what you say in the second half of your comment, namely, that Bell's second formulation does yield a notion of locality that does not presume kinematical locality.
*On the last point. I guess our intuitions simply diverge here. Still, I look forward to seeing what you come up with in the department of theories of exclusively local beables. The issue of kinematical locality is an important one that has not yet received the attention it deserves.
Thanks once again for your thoughtful and informed responses.