ProductOpener::Tags

NAME
SYNOPSIS
DESCRIPTION
GLOBAL VARIABLES
- %tags_fields
FUNCTIONS

ProductOpener::Tags provides functions to build multilingual tags taxonomies from source files, to use those taxonomies to canonicalize lists of tags, and to display them in different languages.

DESCRIPTION

GLOBAL VARIABLES

%tags_fields

This defines which are the fields that are list of values. To this initial list, taxonomized fields will be added by retrieve_tags_taxonomy

FUNCTIONS

get_property_from_tags ($tagtype, $tags_ref, $property)

Return the value of a property for the first tag of a list that has this property.

Parameters

$tagtype

$tags_ref Reference to a list of tags

$property

get_inherited_property_from_tags ($tagtype, $tags_ref, $property)

Return the value of an inherited property for the first tag of a list that has this property, and the corresponding matching tag.

Parameters

$tagtype

$tags_ref Reference to a list of tags

get_matching_regexp_property_from_tags ($tagtype, $tags_ref, $property, $regexp)

Return the value of a property for the first tag of a list that has this property that matches the regexp.

Parameters

$tagtype

$tags_ref Reference to a list of tags

$property

$regexp

get_inherited_property_from_categories_tags ($product_ref, $property) {

Iterating from the most specific category, try to get a property for a tag by exploring the taxonomy (using parents).

Parameters

$product_ref - the product reference

$property - the property - string

Return

$property_value

$matching_category_id

get_inherited_properties ($tagtype, $canon_tagid, $properties_names_ref, $fallback_lcs = ["xx", "en"]) {

Try to get a set of properties for a tag by exploring the taxonomy (using parents).

This methods take into account if a property is defined as "undef" (but it cuts value only for the considered branch and might still lead to a value if there are multiple parents branches).

Warning: The algorithm is a bit rough and my not work as you would expect on a DAG. It does not (currently) respect exploration of nodes that joins from multiple parent (in those case you would expect to first explore children from both branches). If we want to change the algorithm for this to work we should first explore parents, and then decide the order, but this methods is more eager to save time.

Parameters

$tagtype - str, name of taxonomy

$canon_tagid - tag id for which we want properties

$properties_names - ref to a list of property name

$fallback_lcs - fallback language code to try

If may search a description:fr but if fallback is ['xx', 'en'] and we find a description:xx or description:en property we will use this value.

Return

A ref to a hashmap where keys are property names and values are found value. If a property name is not present it means it was not found.

get_tags_grouped_by_property ($tagtype, $tagids_ref, $prop_name, $props_ref, $inherited_props_ref, $fallback_lcs = ["xx", "en"])

Retrieve properties of a series of tags given in $tagids_ref and return them, but grouped by $prop_name, also fetching $props_ref and $inherited_props_ref

Return

A ref to a hashmap, where keys are property $prop_name values, and values are in turn hashmaps where keys are tag ids, and values are a hashmap with of properties and their values.

Example

we asks for quality tags, grouped by fix_action, while getting descriptions { "add_nutrition_facts" => { "en:kcal-does-not-match-other-nutrients" => { "description:en" => "Kcal is not matching value computed from other nutriments" }, "en:kcal-does-not-match-kj" => { "description:en" => "Kcal is not matching kJ value" }, }, "add_categories" => { "en:detected-category-baby-milk" { "description:en" => "Detected category … may be missing baby milks" } } }

get_all_tags_having_property ($product_ref, $tagtype, $prop_name)

For each tag of a given field ($tagtype, can be "labels" or "categories", for example), and a given property ($prop_name, without last column (:). Can be "incompatible_with:en", for example), return a hash of tagid <-> property_value remark: this DOES NOT handle property inheritance

Return

Example, get_all_tags_having_property($product_ref, "labels", "incompatible_with:en")

remove_stopwords_from_start_or_end_of_string ( $tagtype, $lc, $string )

Remove stopwords (that are specific to each category) from the start or end of a string that has not been normalized. This function differs from remove_stopwords() that works on normalized tags instead of strings and that also removes stopwords in the middle.