Welcome to the JaguarPC Community
JaguarPC
Sales: (888) 338-5261
Support: (888)-551-3050
Results 1 to 11 of 11

This is a discussion on OMG!!! Those Naughty PPL at DMOZ... in the Shared & Semi-Dedicated forum
I've been porting a DMOZ interface for PHP-Nuke for a while now. This has been quite a project, but highly popular on my site. I ...

  1. #1
    Yeah, I know a LOT! Vin DSL's Avatar
    Join Date
    Mar 2003
    Location
    Arizona Uplands
    Posts
    10,775

    Cool OMG!!! Those Naughty PPL at DMOZ...

    I've been porting a DMOZ interface for PHP-Nuke for a while now. This has been quite a project, but highly popular on my site.

    I was watching visitors the other night - spying on them actually - and holy crap!!! DMOZ is loaded with, shall we say, adult content. Oh, alright, porno...

    There it was, on my site - 1000's of links to perv sites, with thumbnails.

    Here's the deal... I can keep them from doing searches for adult sites, and I can stop the links from displaying for adult categories, but I need a list of words they might use to do such searches.

    I'm not into that whole swinging sex scene, so I don't know what words to block in the searches. I added all the words that come out of my filthy mouth, but I don't have a very large vocabulary when it comes to groupsex and so forth.

    Anybody know where I can get such a list?
    DISCLAIMER Any resemblance between the views expressed above and those of the owners and operators of this system is purely coincidental. Any resemblance between these views and my own are non-deterministic. The existence of Vin DSL is questionable. The existence of views in the absence of anyone to hold them is problematic. The existence of the reader is left as an exercise in the second-order coefficient.

    No Guts, No Story! VinDSL © 2010

  2. #2
    Community Leader jason's Avatar
    Join Date
    Sep 2001
    Location
    Rochester, NY
    Posts
    6,003
    Couldn't you just block the return of anything from /Adult categories. I know that by default the DMOZ search doesn't return hits from the Adult category, nor is it displayed on the home page. Maybe you could implement something similar.

    --Jason
    Jason Pitoniak
    Interbrite Communications
    www.interbrite.com www.kodiakskorner.com

  3. #3
    Loyal Client
    Join Date
    Sep 2001
    Location
    Wichita, KS
    Posts
    1,647
    many popular bulliten boards (such as YaBB) come with nice long censor lists, and while they don't have everything, there are quite a few in there.

  4. #4
    Yeah, I know a LOT! Vin DSL's Avatar
    Join Date
    Mar 2003
    Location
    Arizona Uplands
    Posts
    10,775
    Quote Originally Posted by jason
    Couldn't you just block the return of anything from /Adult categories. I know that by default the DMOZ search doesn't return hits from the Adult category, nor is it displayed on the home page. Maybe you could implement something similar.

    --Jason
    LoL! Oh, yeah? Go do a search for 'xxx'...

    EDIT: http://search.dmoz.org/cgi-bin/search?search=xxx

    This is a search using the default DMOZ search engine. Now, imagine that with thumbnails...
    Last edited by Vin DSL; 08-24-2004 at 06:31 PM.
    DISCLAIMER Any resemblance between the views expressed above and those of the owners and operators of this system is purely coincidental. Any resemblance between these views and my own are non-deterministic. The existence of Vin DSL is questionable. The existence of views in the absence of anyone to hold them is problematic. The existence of the reader is left as an exercise in the second-order coefficient.

    No Guts, No Story! VinDSL © 2010

  5. #5
    Yeah, I know a LOT! Vin DSL's Avatar
    Join Date
    Mar 2003
    Location
    Arizona Uplands
    Posts
    10,775
    Quote Originally Posted by mattsiegman
    many popular bulliten boards (such as YaBB) come with nice long censor lists, and while they don't have everything, there are quite a few in there.
    Good idea!

    I don't want you to think I'm being weird or prudish, but I have a lot of ppl watching my site, for content, because of 'union animous', e.g. 'anti-union animus'. I don't need the headache...
    DISCLAIMER Any resemblance between the views expressed above and those of the owners and operators of this system is purely coincidental. Any resemblance between these views and my own are non-deterministic. The existence of Vin DSL is questionable. The existence of views in the absence of anyone to hold them is problematic. The existence of the reader is left as an exercise in the second-order coefficient.

    No Guts, No Story! VinDSL © 2010

  6. #6
    Community Leader jason's Avatar
    Join Date
    Sep 2001
    Location
    Rochester, NY
    Posts
    6,003
    Quote Originally Posted by Vin DSL
    LoL! Oh, yeah? Go do a search for 'xxx'...

    EDIT: http://search.dmoz.org/cgi-bin/search?search=xxx

    This is a search using the default DMOZ search engine. Now, imagine that with thumbnails...
    OK, I stand corrected... I thought that normal searches blocked adult content, but I guess I'm wrong. Regardless, though, all of the "dirty" content is still listed in the Adult category, so you could filter the results and remove anything in that category. It might take a little work, but it wouldn't be impossible.

    --Jason
    Jason Pitoniak
    Interbrite Communications
    www.interbrite.com www.kodiakskorner.com

  7. #7
    Yeah, I know a LOT! Vin DSL's Avatar
    Join Date
    Mar 2003
    Location
    Arizona Uplands
    Posts
    10,775
    Quote Originally Posted by jason
    OK, I stand corrected... I thought that normal searches blocked adult content, but I guess I'm wrong. Regardless, though, all of the "dirty" content is still listed in the Adult category, so you could filter the results and remove anything in that category. It might take a little work, but it wouldn't be impossible.

    --Jason
    Yeah, the code's no problem...

    Code:
    [snip]
    
        $dirtywords = array(
            "dirt1",
            "dirt2",
            "dirt3",
            "et cetera");
    
        $browse = cleanBrowse($HTTP_GET_VARS["browse"]);
        $ver = $HTTP_GET_VARS["ver"];
    
        $searchstring = $HTTP_POST_VARS["search"];
        if($searchstring == "") $searchstring = $HTTP_GET_VARS["search"];
        $searchstring = cleanSearch($searchstring);
        
        $start = intval($HTTP_GET_VARS["start"]);
        $morecat = htmlspecialchars($HTTP_GET_VARS["morecat"]);
    
        if( $browse != "") {
            $catarr = explode("/", $browse);
            $i=0;
            while($catarr[$i] == "") $i++;
            if(strtolower($catarr[$i]) == "adult") {
                include("includes/blocked.php");
                die();
            }
        } else if($searchstring != "") {
    
            if(in_array(strtolower($searchstring), $dirtywords)) {
                include("includes/blocked.php");
                die();
            }
        }
    
    [snip]
    I'm just having trouble thinking of dirty words...
    DISCLAIMER Any resemblance between the views expressed above and those of the owners and operators of this system is purely coincidental. Any resemblance between these views and my own are non-deterministic. The existence of Vin DSL is questionable. The existence of views in the absence of anyone to hold them is problematic. The existence of the reader is left as an exercise in the second-order coefficient.

    No Guts, No Story! VinDSL © 2010

  8. #8
    Ron
    Ron is offline
    Loyal Client
    Join Date
    Aug 2002
    Posts
    7,306
    Here's a quick thought---
    capture the requests and monitor what categories come back, and then add those keywords to your list.

    Then you can manually remove the fluff words (e.g. and, big, Vin DSL, etc.) and add the rest to your blocks.

    But don't forget you need to be extra careful with dual meaning words, like ####, ####, and ESPECIALLY #### ## ########## ### ### !!!!

  9. #9
    JPC Guru
    Join Date
    Jan 2004
    Location
    I'm right behind you....
    Posts
    389
    Name every dirty word you can think of? Hehe, we played this game on my site months ago. Enjoy.

    (WARNING: Anyone who is easily offended or under 18 should NOT click that URL)

    http://www.snipeme.com/board/viewtopic.php?t=85

    Maybe that'll help ya

  10. #10
    Loyal Client
    Join Date
    Sep 2001
    Location
    Wichita, KS
    Posts
    1,647
    heehee

  11. #11
    Jag Veteran
    Join Date
    Sep 2002
    Posts
    650
    Just an idea:

    Go to Google, make a search with words you do know, click on the first few results and copy all the words in KEYWORDS meta tag.

    You can even write a PHP script that would do it automatically

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •