For the Crawler Spam filter you will be referring back to the list that you originally created and labeled. And you will be creating a REGEX pattern too but a slightly different one than the one you created for the Ghost Spam.

This will be an "Exclude Filter" which you will more than likely need to update every once and awhile to add any new Referral Crawler Spam that you will notice. Many times the same spammers just change their hostnames.

Create A Crawler Spam REGEX Pattern

Step 1. Look through your list and find the ones that you labeled as Crawler Spam.

Step 2. Cut & Paste these somewhere else in your document to create your REGEX pattern.

Note that in this REGEX pattern we do not enter the .com or .net as we did in the ghost spam REGEX pattern.

A Crawler Spam REGEX pattern should look something like this.

(best|100dollars|success|top1)\-seo|(videos|buttons)\-for|anticrawler|^scripted\.|\-gratis|semalt|forum69|7makemoney|sharebutton|ranksonic|sitevaluation|dailyrank|vitaly|video\-|profit\.xyz|rankings\-|dbutton|\-crew|uptime(bot|check)

The ones in parenthesis are different hostnames that have the same suffix or extension. As you will see in the beginning of the pattern (best|100dollars|success|top1), these are each different hostnames with the same suffix "-seo". So, individually they would look like best-seo, 100dollars-seo, success-seo, top1-seo. And remember that in the Crawler Spam REGEX pattern we don't include the .com's or .net's.

Grouping them in parenthesis shortens the REGEX pattern and simplifies it.

This is how mine currently looks.

(best|100dollars|success)\-seo|(videos|buttons)\-for|anticrawler|musica\-gratis|semalt|forum69|7makemoney|sharebutton|ranksonic|sitevaluation|dailyrank|vitaly|video\-|profit\.xyz|rankings\-analytics|obrazky\.cz|urlopener|top1\-seo\-service|wordpress\-crew

Also, don't include a pipe or bar at the end of the REGEX pattern or you will get an error message when trying to verify the filter.

You will then be pasting your Regex pattern into an "Exclude Filter".



Join the Discussion
Write something…
Recent messages
JKAshipala Premium
Very helpful article indeed.
Reply
NRosales Premium
I found you and though I am not applying the training on my GA, I know for sure that it will help me, my site is almost saturated with spam. I am almost a year old in Wa, is this incident normal.
I would like to thank you for this training as I know it will help.
Reply
Aliee Premium
Hi, yes my site had lots of "referral spam" on it so I wanted to find a way to get rid of it. The website I found seemed to be the best and easiest one to help me do this.

If you've never applied any filters for the spam then yes, I think that's normal. I'm glad I could help! :)
Reply
GolfSpecialt Premium
Thanks Aliee, this was very helpful as a newbie here. One quick question? My tracking numbers look like this UA-#########-# do I just enter this exactly as it appears in the REGEX string separated by a |?
Reply
Aliee Premium
Hi, sorry to get back to you so late. I've taken another hiatus and hope to be back to work here soon! The tracking number that you're talking about with the UA at the front doesn't seem to be a spam name to enter into a REGEX string. This tracking number is something else and is not what is supposed to be entered so I suggest that you don't enter that tracking number. Feel free to send me a private message if you need more info. on this.
Reply
electrobot Premium
I stopped using GA. I found GA to have zero value to me.
Reply
Aliee Premium
Really?? What are you using now to track your incoming traffic?
Reply
electrobot Premium
I just roughly guage my traffic based on comments, site feedback, and email I receive. Also, I do a google search regularly to see how my posts and pages appear. I've just had way to many problems with GA. Also, studying the stats has never helped me to increase my site traffic or produce more quality content. Just my thoughts and opinions on the matter. Thanks for your informative and instructive training. :)
Reply
Aliee Premium
Ok, makes sense. I mainly wanted to do this for my website because I didn't want it to interfere with getting a good ranking in google. I didn't want to have mostly spam as traffic to my site and then google think that my site was spam too, especially when you're a new site and want to make a good impression. It just makes me feel better to know that I can control it to a certain extent. Take care :)
Reply
Top