How to search on Google for filetypes under the same domain

        How to, Web Design

I was trying to figure out how many HTML pages did a website ( have. I went to Google, and the first command I used to look for content under the website was:

I got 469 results with that parameter (see image below), but it found all the files that are indexed on Google under that domain, not only HTML files, but PDFs, Word documents, etc.

Looking for filetypes (HTML) on a website (content under the same domain)

I did a research, and I found that there is a parameter called “filetype” that allows to specify the filetype of the content that you are looking for. ¬†Then, my next search was: filetype:html

I was able to narrow down my search results to 4 results (see image below), because it found files with .html extension.

Looking for filetypes (HTML) on a website (HTML files under a website)

Since I knew there were more HTML files on that site, I changed my search to: filetype:htm

And then, I got the rest of HTML pages (with .htm extension) under that website.

Looking for filetypes (HTML) on a website (.htm extension)

With that experiment, I was able to conclude that the website contains about 100 HTML pages indexed on Google.

The parameter “filetype” works for other formats like PDF,¬†Word (DOC), Excel (XLS), Powerpoint (PPT) and PostScript (PS).

The following are more resources to refine your search on Google:

About Luis Cuellar

Co-founder and Agile Coach, in charge of leading, managing and coaching agile project teams to achieve a high level of performance and quality in delivering projects that provide exceptional business value to government agencies and nonprofit organizations.
This entry was posted in How to, Web Design and tagged . Bookmark the permalink.

Comments are closed.