Installation

WebsiteAnalyzer

Description

Website Analyzer, which analyse the following things

The title of the web site
The HTML version being used by the web site
Identify login forms in the web site?
Determine distinct, external links and there reachability.

Installation

git clone [email protected]:nabilshafi/WebsiteAnalyzer.git
go to project directoy
for install the dependencies run pipenv install
run pipenv shell for running the environment
run python web_analyzer.py https://www.facebook.com for running the project

Code Structure

The project is comprised of
- web_analyzer.py: Includes functions to gather link, identify forms and html version etc.
- test.py: Includes test for the above mentioned functionality

Dependencies

Beautifulsoup used for scrapping the web page. Validator used validate the correct url. Urlparse and urllib used to parse and accessing the url

There are alternatives available but I found them easy to use.

Challenges

Html versions: Identifying the html version was one of the challenge because there is no function to read doctype tag.
Login Form: Classification of a website containing the login form was another challenge.

Results

Ran the Web Analyzer on several different websites and manually matched them in order to verify the results.

Library

I will expose the check_version function because I haven't found any function to identify it.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
test_main.py		test_main.py
web_analyzer.py		web_analyzer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WebsiteAnalyzer

Description

Installation

Code Structure

Dependencies

Challenges

Results

Library

About

Releases

Packages

Languages

nabilshafi/WebsiteAnalyzer

Folders and files

Latest commit

History

Repository files navigation

WebsiteAnalyzer

Description

Installation

Code Structure

Dependencies

Challenges

Results

Library

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages