They'll need to prove all of the new trains as they come in, yes? Not just the first set?
To varying degree. The test track is only for the telemetry logging where you're picking off faults that would appear
anywhere after enough duty cycles of operation. And those tend to be weighted heaviest to pilot cars and early deliveries where the fixes result in actual change requests to the manufacturer techs. i.e. The test track is going to weight early and 'obvious' at the faults it logs. It's largely agnostic to application, so for example this is why the Blue Line 0700's started their Boston careers on the OL Wellington test track for several months before ever being shipped to Orient Heights (Blue of course not having any test track of its own).
Then you have line-specific testing, where you have to get your reps in doing simulated runs. That's where you find out how these things handle "switching problems at Alewife" or speed enforcement on particular mission-critical stretch of track for proper headway management, or test for door-stick issues by making all-stops non-revenue runs, or see how they handle running at full HVAC power draw over a potentially vulnerable at-capacity power section, or do the weight simulations with sandbag 'passengers' to test bridge loadings. You can only do that on the mainline...mostly overnight, but also as we saw this past week with the Red pilot set on available off-peak slots.
Once you get deeper into the orders and have accumulated most of your change requests, each new delivery spends a lot less time on the test track before graduating straight to mainline testing. Because at that point most of the new faults you uncover are purely line-situational (door-stick, etc.) and only uncoverable by skewing your testing reps to the mainline. You'll see fewer and fewer test cars packed inside and out with telemetry wires and sensors duct-taped to every surface and snaking out under the door lips to the undersides, because focus turns from breadth of telemetry collection to
consistency of telemetry collection. This is about the threshold where Blue Line deliveries 16 years ago stopped coming to Wellington and started getting unload straight to Orient Heights; the ones that malingered at Wellington were generally the most heavily modded earlier cars needing add'l testing on the mods, and not the factory-fresh deliveries that baked in any change requests right from Day 1 assembly.
By that mid-order threshold through the end where cars are getting graduated way more quickly into service after delivery, they're not spending much time on the test track at all before going into mainline test rotation. And are doing so with little to no extra onboard telemetry equipment. That's simply burn-in...racking up reps of simulated runs to see if the consistency holds up. The test tracks will get their use because the rate of deliveries is so much quicker on the backside, but each pair spends very little time there before going straight to a short-duration but intense duty-cycle run on the mainline before graduating quickly into revenue service.
Thus...the most critical period for test track usage is in that "15% in" threshold of the order where you're past the first purely-test pilots, past the first large-scale warranty mods, and in that sensitive part where you're trying to bring down the rate of new telemetry faults. That point where you can start reducing the number of sensors painstakingly taped up across the cars and start multitasking with mainline testing of more total cars watched by fewer tech eyes...filling up the overnight shift with simulated runs, slotting a few on the far off-peak without fear of it hosing revenue service. Red is basically sitting at this threshold right now. The pilot set has already moved onto purely mainline testing and simulated runs, in pursuit of that quoted early-Fall timetable for first revenue tests.
Now...what's changed is the great COVID pause. Orange has gotten months of bonus time on the Wellington test track, and simultaneously all the bandwidth in the world to multitask its sets with mainline testing because of the much-reduced service levels. Orange being the identical base model in all but dimensions for Red means that a lot of the activity you would've seen on Track 61 has been shifted to Wellington, with less urgency for CRRC to rush-release another 6-10 cars of Red testers because the OL base model has been able to take on much more completist telemetry logging. So by circumstance you are now going to see a Red testing schedule skewed much more heavily towards mainline testing than test track. That makes sense and is fully explainable by the COVID reshuffle. Shit happened, and they chose the most expedient way to adjust.
What doesn't make any sense is why Track 61 hasn't been used when
right now and for the last 1-2 months would've been the ripest time of all to do back-and-forth repeat telemetry logging of the Red sets already on the property. It was a fair question from the beginning when this Track 61 build was proposed "Why aren't the Cabot leads and their crossovers enough test track for doing this?" They were adamant in front of the FCMB that it wasn't enough...that there were things beyond simply dispatching test moves vs. regular deadheads on the Cabot leads that made them not enough, and that the Cypher St. maint shed too was a critical piece. That's the part that baffles. It appears the Cabot leads were indeed enough, because they crammed 3 months worth of testing on those alone. Granted, that was easier with daily revenue service running on reduced schedule, but what of the "above-and-beyonds" mandating the never-used shed? They apparently got all that done right from the main carhouse...despite the fact that the carhouse has been
busier than usual rotating the higher quantity of daily unused trains in for maint TLC during the reduced service levels.
So if the Red pilots now thoroughly on the mainline-only portion of their testing are going to graduate to revenue trials without using it, and Orange is absorbing more of the late-stage shakedown testing as the base order...it is fair question for the Board to revisit the supposed urgency they were sold on for appropriating the $$$ for this never-used test track and shed. Because even with the unforeseen COVID reshuffle there's clearly also a large gap in
advertised urgency vs.
actual urgency. Where did all the shed's functions disappear to in the reshuffle? Where did the shakedown inadequacy of the Cabot leads disappear to in the reshuffle? Those are completely fair ass-covering questions, because right now is when we were supposed to be seeing that 'urgency' value proposition for the 61+shed build being realized...not on the backside of the order that's bulk-weighted to mainline testing.