{ "cells": [ { "cell_type": "markdown", "id": "6f8bf5c97b4d23d8", "metadata": {}, "source": [ "# Removing Semantic Duplicates\n", "\n", " In this recipe, we show how to use LLMs — in this case, OpenAI's `gpt-4o-mini` — to answer remove semantic duplicates from lists and objects.\n", "\n", "
Mirascope Concepts Used
\n", "Background
\n", "\n", "Semantic deduplication, or the removal of duplicates which are equivalent in meaning but not in data, has been a longstanding problem in NLP. LLMs which have the ability to comprehend context, semantics, and implications within that text trivializes this problem.\n", "
\n", "Additional Real-World Examples
\n", "