"""Cache PE-Core (or other registered vision-encoder) features. Mirrors the live PE encoding done at training time so callers can read patch-token features off disk instead of running the encoder ...
Modern deep-learning AMT systems perform well on Western instruments, but their effectiveness on traditional Chinese instruments like the pipa (琵琶) is largely unverified. The pipa poses distinctive ...