Web Filtering and AI for School

Web filtering and AI evaluation

These days it’s unusual to find a web filter vendor not making use of machine learning or intelligence somewhere in their products. But how can schools compare them?

Artificial Intelligence systems are essential to keep up with user-generated content and the ever-evolving list of filter bypassing tools. These systems are usually effective against similar but widespread types of content, such as pornographic material, gambling sites, or anonymizer tools.  

It’s difficult to compare the underlying technology however, largely because it’s possible to use AI in a multitude of different ways.

For example, closed-loop learning, human-directed learning, and then various models beneath, such as simple HMM or TensorFlow. All of these techniques can be applied well or poorly.

The most important question to ask is where does your filter apply these AI techniques?

Artificial intelligence is commonly applied to one of two areas:

Inline with the web filtering in real-time

Real-time filtering is either baked into a network appliance or as part of a filtering client. You’ll see occasional updates to the rules database, but other than that, the filter makes all the decisions locally.  

Out-of-band offline processing 

With out-of-band intelligence, uncategorized URLs are fed back to the filter vendor, and the site is then visited by an automated web crawler or “spider”. The results are then passed through the intelligent system, and a categorization attached to the URL. The categorization makes it back to the point of filtering in regular URL list updates. 

InlineOut of band
Speed of ReactionInstant. Any filtering decision is applied straight away, leaving no opportunity for harmful content to get by.Hours. Unknown content is queued waiting for the offline process to occur. Filtering is then caught up at the next regular update.
Effectiveness: Real-time ContentExcellent – real-time or rapidly changing content is reassessed each time, so a correct decision is made against up to date data.Poor – generally the categorization of a site is either permanently fixed, or fixed for months. This leaves sites with changing content open to misclassification.
Effectiveness: ContextWeak. Inline filters only see one page at a time and can’t make decisions based on what’s linked to.Strong – with plenty of time to make a decision, an out-of-band filter can download links and images.
Effectiveness: Logged-in ContentExcellent – as these filters work on the data the user sees, even content behind a login such as a forum or social media will get scanned.Useless – the out of band filter sees only the login page, which rarely provides any actionable content.
Additional LatencyLow – usually adding intelligence will add latency to each request. Properly designed systems will limit this, so it isn’t noticed by the user.Zero – as all intelligence is out of band, there’s no additional latency.


Looking at this table, it’s clear that an inline filter is far more effective against today’s web which is increasingly volatile, and often behind a login. It’s also worth noting that an inline approach does not preclude additional out-of-band filtering – if you can find a vendor that combines these you will get the best of everything. 
 

Interested in discussing your school or district’s web filtering needs?
Click here to contact us with any questions you have or here to schedule a demonstration of the only content-aware web filter solution for schools.

Site URL: https://us.smoothwall.com | Locale is :