Abstract: This paper investigates the recursive state estimation for time-varying systems under time-correlated fading channels and multiple description coding scheme in the presence of observation ...
vLLM currently uses a single max_num_seqs parameter to control the batch size for both prefill and decode stages. Maybe support for PD hybrid scenarios because: Prefill stage: Requires smaller batches ...