More Fields

Filters

Profiles which have:

Recent Changes

Profiles with recent changes to:
Sign up to Download

1-11 of 11 results

  • hanlab.mit.edu
  • 22
Quantization can accelerate large language model (LLM) inference. Going beyond INT8 quantization, the research community is actively exploring even lower precision, such as INT4. Nonetheless, state-of-the-art INT4 quantization techniques only..

Relevance: 11.382136
  • hanlab.mit.edu
  • 22
The attention mechanism is becoming increasingly popular in Natural Language Processing (NLP) applications, showing superior performance than convolutional and recurrent architectures. However, general-purpose platforms such as CPUs and GPUs are..

Relevance: 11.363364
  • hanlab.mit.edu
  • 22
Deep learning on point clouds has received increased attention thanks to its wide applications in AR/VR and autonomous driving. These applications require low latency and high accuracy to provide real-time user experience and ensure user safety...

Relevance: 11.363364
  • hanlab.mit.edu
  • 22
We address the challenging problem of efficient inference across many devices and resource constraints, especially on edge devices. Conventional approaches either manually design or use neural architecture search (NAS) to find a specialized neural..

Relevance: 11.363364
  • hanlab.mit.edu
  • 22
Quantization can accelerate large language model (LLM) inference. Going beyond INT8 quantization, the research community is actively exploring even lower precision, such as INT4. Nonetheless, state-of-the-art INT4 quantization techniques only..

Relevance: 11.291088
  • hanlab.mit.edu
  • 22
Quantization can accelerate large language model (LLM) inference. Going beyond INT8 quantization, the research community is actively exploring even lower precision, such as INT4. Nonetheless, state-of-the-art INT4 quantization techniques only..

Relevance: 11.106606
  • hanlab.mit.edu
  • 22
Tiny machine learning (TinyML) is a new frontier of machine learning. By squeezing deep learning models into billions of IoT devices and microcontrollers (MCUs), we expand the scope of AI applications and enable ubiquitous intelligence. However,..

Relevance: 11.106606
  • hanlab.mit.edu
  • 22
Quantization can accelerate large language model (LLM) inference. Going beyond INT8 quantization, the research community is actively exploring even lower precision, such as INT4. Nonetheless, state-of-the-art INT4 quantization techniques only..

Relevance: 11.106606
  • www.stratasense.com.au
  • 24
  • 1
  • 23
At Strata Sense our philosophy is to take action and make sense of your property needs - from building, environment to community living. All which aims to improve the value of the development as a whole. And by having our client's best interest in..

Relevance: 9.482418
  • www.iiwas.org
  • 2
  • 1
  • 58
iiWAS provides a platform for sharing knowledge, promoting innovation, and advancing the state of the art in information integration and web intelligence. Through this conference, attendees can gain valuable insights and contribute to the advancement..

Relevance: 4.4425097
  • www.chhs.niu.edu
  • 205
  • 3
  • 183
At the heart of the community, the College of Health and Human Sciences collaborates with local, national and global partners, co-creating innovative, interprofessional practices that champion health and wellness, create belonging, and celebrate..

Relevance: 1.592128