Generalization Introduction

Introduction to Generalization

Generalization is the act of calculating the generalization of the overall predictions which allows you to accurately predict the correct results over the span of multiple samples rather than just one individual sample. While you can preform generalization calculations on your own program natively, CoffeeHouse provides the developers with the ability make this process easier by storing the generalization data on our servers and having CoffeeHouse use a pointer-like mechanism to calculate the generalization without any additional performance loss or overhead.
This works by storing all the predictions on a table then by adding up all the predictions and dividing the final results by the total size of the prediction table. So while there may be one or two false predictions, the overall prediction is accounted for by the dominate label which in most cases is the true and correct prediction. This is great for determining the final prediction of a set of a samples or to accurately determine the detected language of a chat conversation. While not all CoffeeHouse features supports generalization, the documentation will indicate when generalization is supported and what labels to expect. This part of the documentation will explain how to initialize generalizations and understand the results.
Find Content Find Content

Pointers

But now as you may realize, if your generalization table can only store 5 predictions, what if there's more predictions that you would like to apply? While you can't increase the generalization table size; you can overwrite older prediction values, CoffeeHouse keeps track of a position pointer which indicates what sector of the table is going to be written/overwritten to upon adding a new prediction, in the example shown the current pointer is "5", which is the last item on the table, when you add another prediction value to the generalization table then CoffeeHouse will overwrite the current pointer position and go to the next or back to the initial position which is 1.
Find Content