How Google Scholar reads PDF metadata
Google Scholar primarily uses on-page metadata (the first page of the PDF, the title in <title> tags if hosted on a webpage) plus citation graphs. But it falls back to the PDF's info dictionary when on-page extraction fails. A clean Title field can mean the difference between Scholar showing your paper's correct title vs. "Microsoft Word - draft_v3.docx". For preprints and gray literature, this is especially important.