How to design a threshold filter for machine learning?

Question

1.00/5 (1 vote)

See more:

Hey Guys,

I am receiving telemetry data at the rate of about 50 records/second.
Each record is variable length, with an average length of about 80 characters.
As you can see, I do not have a lot of time to run a binary classification prediction on each record.

I am building a tabular dataset that has six feature columns, and a binary response column (0, 1).
All features have floating point values, and there is no missing data.
All the features should be positive, but this is necessary, and not sufficient condition.

I am trying to filter out the obviously bad records, so I can avoid sending this data to the classification code.

What I am trying to do is determine the minimum values of the six features (f1, f2, ... f6), and use this as a filter.

To calculate of all six features requires about 400 milliseconds.
 
I have a cost function, C(f1,f2,f3,f4,f5,f6), that computes the actual (ground truth) values for each set of features, but it is compute-intensive (about four seconds).

My question is: How can I calculate the minimum values for f1, f2, ... f6, such that they maximize the prediction accuracy, so I can use these values as a threshold filter?

Charles

What I have tried:

I have only tried to guess at the minimum values.

Posted 8-Sep-20 11:06am

CBrauer

Add a Solution

Comments

[no name] 8-Sep-20 23:14pm

Do a plot of the frequency distribution.

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)