This project introduces WeTok, a powerful discrete visual tokenizer designed to resolve the long-standing conflict between compression efficiency and reconstruction fidelity. WeTok achieves ...
Abstract: This paper proposes an improved FT-Transformer model to improve the efficiency of intrusion detection in computer networks. Key modifications include: integrating the ColumnEmbeddingAdder ...
Abstract: In-context imitation learning (ICIL) is a new paradigm that enables robots to generalize from demonstrations to unseen tasks without retraining. A well-structured action representation is ...