Posts

Unifying Uniform and Binary-coding Quantization for Accurate Compression of Large Language Models

Context-aware Sequential Bundle Recommendation via User-specific Representations