Storage system using multiple memory tiers (e.g., fast GPU memory and slower CPU memory) to balance speed and capacity.