Phishing Data Science example

Goal:

Given a list of suspected phishing url can we build a machine leaning model to predict malicious URL's

Process

Step 1

get list of potential phishing URL's
go to  https://openphish.com/feed.txt

Step 2

Get list of not suspected phishing URl's from Common Crawl
We cann't train on all url so we'll need to find a way to sampple
OpenDNS maintains random sample of 10,000 domains on github
Go to https://github.com/opendns/public-domain-lists get list of domains
once we have list use python to get complete URL list for all domains
https://dmorgan.info/posts/common-crawl-python/

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
README.MD		README.MD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Phishing Data Science example

Goal:

Process

Step 1

Step 2

About

Releases

Packages

Languages

mccushjack/phishing

Folders and files

Latest commit

History

Repository files navigation

Phishing Data Science example

Goal:

Process

Step 1

Step 2

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages