We propose a widespread adoption of Data Portraits: artifacts that record data and allow for inspection. These are a complement to existing forms of data and model documentation artifacts. We introduce our solution based on data sketching (compressed and approximate views of large data). Our implementation is minimal and efficient in that it supports membership testing and nothing more. We document an open source large language modeling dataset - try it below!