How LVMH Tracked Brand Visibility Across 14,000 Pages

Sasha Bouloudnine
14 Mar 2022

3 min read

LVMH Digital runs e-commerce intelligence for the group's beauty brands. Their Indirect E-commerce team tracks how LVMH brands — Dior, Guerlain, Givenchy — are displayed and performing on third-party e-retailers like Sephora.

To do that, they rely on SimilarWeb traffic exports. But there's a gap in the data.

SimilarWeb dashboard showing raw URL export.png

The problem with raw traffic data

SimilarWeb delivers URLs with traffic share percentages. Nothing else. A typical export looks like this:

sephora.fr/dior/DIOR-HubPage.html — 0.11% sephora.fr/guerlain/GUERL-HubPage.html — 0.06% sephora.fr/shop/maquillage-c302 — 0.33%
f

Without page categorization, meaningful analysis is impossible. Which URLs are brand pages? Which are product listings? The raw data doesn't tell you.

Raw CSV export from SimilarWeb.png

The scale of the challenge

The team needed to categorize thousands of URLs across 5 major e-retailers:

  1. Sephora US
  2. Sephora FR
  3. Ulta
  4. Nocibé
  5. Marionnaud

Each URL required manual inspection: page type identification, metadata extraction, and proper classification. For a team focused on strategic insights, spending weeks on data preparation was simply not viable.

Automated page categorization with lobstr.io

LVMH Digital partnered with lobstr.io to automate the entire workflow. The process was straightforward.

Step 1: Upload the raw SimilarWeb CSV containing all URLs to analyze.

Step 2: lobstr.io crawls each URL and classifies it into categories: ProductPage, BrandLandingPage, CategoryListPage, SearchPage, or BrandListPage.

Step 3: Extract 20+ attributes per page type, including traffic share, product counts, price ranges, and brands featured.

lobstr.io workflow diagram.png

Structured output, ready for analysis

Beyond classification, lobstr.io extracted rich metadata for each page:

{ "url": "https://www.sephora.fr/dior/DIOR-HubPage.html", "share": 0.11, "change": 0.02, "page_type": "BrandLandingPage", "title": "DIOR", "breadcrumb": "Accueil / DIOR", "total_products": 38, "lowest_price": 27, "highest_price": 329, "landing_subcategories": "Les nouveautés / Les incontournables" }
f

All delivered in structured spreadsheets, ready for pivot tables and dashboards.

Screenshot of final Google Sheets output Sephora FR.png

14,000+ pages processed

The final output covered 5 e-retailers:

E-Retailer Pages Categorized
Sephora FR 3,818
Ulta 3,392
Sephora US 2,847
Nocibé 2,156
Marionnaud 1,934
Total 14,147

Manual categorization would require approximately 2.5 minutes per page. That's 570+ hours of work, or 70 business days saved.

With lobstr.io, data was delivered in days, not months.

Pages categorized per e-Retailers.png

From raw data to brand visibility insights

With categorized data, the Indirect E-commerce team could finally report on what matters to LVMH's Maisons:

  1. Which brand pages are gaining or losing traffic on Sephora (+52% for Anastasia Beverly Hills, -21% for Guerlain)
  2. How many products each brand has featured on their landing pages
  3. How LVMH brands compare to competitors in terms of e-retailer visibility

The structured output enabled brand visibility tracking at scale — exactly what the team needed to brief internal stakeholders.

blog-not-found
We had these SimilarWeb exports with thousands of URLs, and every time it was the same question: what type of page is this? Before, I'd either ask my team to check them manually, or we'd simply deprioritize the analysis. Now I send the file, get the categories back the next day, and it's clean. It allowed us to properly scope our traffic analysis instead of working with incomplete data.
blog-not-found
Lina Truong,Indirect E-Commerce Program Manager, LVMH Digital

Key takeaways

  1. 14,000+ pages categorized across 5 e-retailers
  2. 20+ data points extracted per page
  3. 70 days of manual work saved
  4. Structured output ready for brand visibility reporting

Related Articles

Related Squids