title: analytics/mediawiki/content_translation_event description: > Tracks events related to use of Extension:ContentTranslation, which is maintained by the Wikimedia Foundation's Language Engineering team. Some tracked events are user-initiated (like opening the dashboard and starting a new translation), while others are system-intitiated (like warning the user about using too much unmodified machine translation). For further details, see the readme in the schema repository. $id: /analytics/mediawiki/content_translation_event/1.4.0 $schema: https://json-schema.org/draft-07/schema# type: object allOf: - $ref: /fragment/analytics/common/2.0.0# - $ref: /fragment/analytics/web_identifiers/1.0.0# - properties: wiki_db: description: > The database code for the wiki which the user is currently interacting with (and which is shown in the current URL). This is less meaningful than usual because Content Translation presents a single, cross-wiki interface to the user. type: string access_method: description: > The type of client that the user is using. Normally, this field also accepts the value "mobile app", but no mobile app client supports Content Translation. For historical reasons, the values use spaces rather than underscores. type: string enum: - desktop - mobile web content_translation_session_id: description: > A unique, cross-wiki ID for a particular session of Content Translation activity. Expires after 15 minutes has passed without the user generating an event in this stream. type: string content_translation_session_position: description: > A zero-indexed counter giving position of this event within the current Content Translation session (as defined by `content_translation_session_id`). type: integer minimum: 0 maxiumum: 9007199254740991 event_type: description: > The type of Content Translation event that occurred. For detailed definitions, see the readme in the schema repository. type: string enum: - invite_show - dashboard_open - dashboard_search - dashboard_discard_suggestion - dashboard_refresh_suggestions - dashboard_translation_start - dashboard_translation_continue - dashboard_translation_discard - editor_open - editor_segment_add - editor_segment_edit - editor_segment_skip - editor_close_warn - editor_close - editor_error - editor_warning_dismiss - editor_warning_generate - editor_warning_withdraw - publish_attempt - publish_failure - publish_success event_subtype: description: > The subtype of Content Translation event that occured. Not all events have a subtype. For detailed definitions, see the readme in the schema repository. type: string enum: - invite_translation_another_section - error_not_extended_confirmed_on_enwiki - error_translation_in_progress_elsewhere - warning_major_change_to_source - warning_template_not_found - warning_template_parameters_not_mapped - warning_too_much_unmodified_MT - publish_attempt_with_warnings - publish_failure_due_to_abuse_filter - publish_failure_due_to_MT_limit - publish_failure_due_to_warnings - publish_as_addition - publish_as_new - publish_overwrite - publish_overwrite_own - publish_to_draft_space - publish_to_user_space event_source: description: > The source of the Content Translation event that occured. Not all events have a source. For detailed definitions, see the readme in the schema repository. type: string enum: # Sources for `dashboard_open` - content_language_selector - contributions_menu - contributions_page - direct - direct_preselect - frequent_languages - invite_new_article_creation - invite_translate_another_section - notification_deletion - notification_milestone - notification_new_recommendations - recent_edit - recent_translation - language_selector_options - followup_after_publishing # navigate back to the dashboard (dashboard open) - return_from_search - return_from_confirmation - return_from_section_selection - editor_close # Sources for `dashboard_translation_start` - continue_published # - direct_preselect (already included) - for_later # - invite_translate_another_section (already included) - suggestion_featured - suggestion_nearby - suggestion_no_seed - suggestion_recent_edit - suggestion_topic_area - search_result # Sources for `editor_open` - dashboard_translation_continue - dashboard_translation_start # - content_language_selector (already included) # - direct (already included) # - notification_deletion (already included) translation_type: description: > The type of translation or suggestion the user is interacting with. Applies only to events which relate to a specific translation or suggestion, including all `editor_` and `publish_` events and some `dashboard_` events (e.g. `dashboard_start_translation`, `dashboard_delete_translation`). `lead_section` means a translation of the first section of an article, which will result in the creation of a new page except in the unusual case that the user has chosen to translate a section that already exists (i.e. where translation_target_exists). type: string enum: - article - lead_section - section translation_id: description: > The ID used in the cx_translation MediaWiki table for the translation the user is currently interacting with. Omitted if it has not yet been generated or does not exist (as with section translations). Applies only to events which relate to a specific translation, including all `editor_` and `publish_` events and some `dashboard_` events (e.g. `dashboard_continue_translation`, `dashboard_delete_translation`). type: integer translation_source_language: description: > The ISO 639 code for the source language of the current translation or, when on the dashboard, of the suggestions. type: string translation_source_title: description: > The title of the source article of the current translation. Applies only to events which relate to a specific translation or suggestion: all `editor_` and `publish_` events and some `dashboard_` events (e.g. `dashboard_start_translation`, `dashboard_delete_translation`) type: string translation_source_section: description: > The title of the source section of the current translation, if it is a section translation. Applies only to events which relate to a specific translation or suggestion: all `editor_` and `publish_` events and some `dashboard_` events (e.g. `dashboard_start_translation`, `dashboard_delete_translation`). type: string translation_target_language: description: > The ISO 639 code for the target language of the the current translation or, when on the dashboard, of the suggestions. type: string translation_target_title: description: > The title of the page in which the current translation has been or will be published. Applies only to events which relate to a specific translation: all `editor_` and `publish_` events and some `dashboard_` events (e.g. `dashboard_continue_translation`, `dashboard_delete_translation`). Omitted if the user has not yet translated the title. type: string translation_target_section: description: > The title of the section in which the current translation has been or will be published, if it is a section translation. Applies only to events which relate to a specific translation or suggestion: all `editor_` and `publish_` events and some `dashboard_` events (e.g. `dashboard_start_translation`, `dashboard_delete_translation`). Omitted if the user has not yet translated the section title. type: string translation_target_exists: description: > Whether the target article or section already exists. Applies only to events which relate to a specific translation or suggestion: all `editor_` and `publish_` events and some `dashboard_` events (e.g. `dashboard_start_translation`, `dashboard_delete_translation`). In section translation, if the user discards the mapping to an existing target section, the value should change to false in future events. type: boolean translation_provider: description: > The translation engine that is supplying the base text for new segments when the event occurs. When the editor is first opened, this will be the language default, but can change if the user chooses a different provider for a particular chunk or sets a new default. `blank` and `source` are not real translation engines; `blank` means the user is starting with a blank segment and `source` means the user is starting with the untranslated source text. Applies only to `editor_` and `publish_` events. type: string enum: - apertium - blank - google - lingocloud - matxin - opusmt - source - yandex - youdao human_modification_rate: description: > The proportion of the document or segment which has been manually changed by the user from the output of the translation provider. Applies to `publish_` events (where it is the proportion of the whole translation) and to `editor_segment_edit` (where it is the proportion of the edited segment). type: number minimum: 0 maximum: 1 human_modification_threshold: description: > The lowest whole-translation human modification rate where the user will be allowed to publish the translation. Different thresholds are configured at different wikis. Applies only to `publish_` events. type: number minimum: 0 maximum: 1 published_page_id: description: > The MediaWiki ID of the page where the published text has been added. Applies only to `publish_success` events. type: integer minimum: 0 maxiumum: 9007199254740991 published_revision_id: description: > The MediaWiki ID of the revision where the published text has been added. This is necessary since multiple section translations can be made to the same page. Applies only to `publish_success` events. type: integer minimum: 0 maxiumum: 9007199254740991 user_name: description: > The name of the user's account. Not applicable to anonymous users. type: string user_is_anonymous: description: > Whether the user is an anonymous (unregistered) user. type: boolean user_global_edit_count: description: > The user’s edit count across all Wikimedia wikis, available from CentralAuth. Not applicable to anonymous users. In the context of Content Translation, the global edit count is much more useful than a single local edit count, since the user will have different edit counts at the source and target wikis. type: integer minimum: 0 maximum: 9007199254740991 user_global_edit_count_bucket: description: > A bucketed version of `user_global_edit_count`. Unlike that field, this one can be retained in the sanitized version, since it provides less information that could be used to reidentify the user. Not applicable to anonymous users. The values are the ones returned by EventLogging's `getUserEditCountBucket` function, which can be used to populate this field. type: string enum: - '0 edits' - '1-4 edits' - '5-99 edits' - '100-999 edits' - '1000+ edits' utm_source: description: > The value of the URL's `utm_source` query parameter, used to identify the site or product that is the source of the traffic (e.g. Wikipedia GapFinder or Facebook). type: string utm_campaign: description: > The value of the URL's `utm_campaign` query parameter, used to identify the campaign responsible for the traffic (e.g. Wikipedia Asian Month 2020). If present, the `campaign` query parameter should also be placed in this field for compatibility with preexisting URLs. type: string required: - wiki_db - access_method - content_translation_session_id - content_translation_session_position - event_type - user_is_anonymous examples: - $schema: $ref: '#/$id' meta: stream: mediawiki.content_translation_event domain: ca.m.wikipedia.org dt: '2020-11-11T00:40:00.649Z' web_session_id: d6ad7a70aaa03c225a6f web_pageview_id: d258b68fcb966aa2218d wiki_db: cawiki access_method: mobile web event_type: dashboard_open event_source: direct content_translation_session_id: cb7716f5ec946f2dec88 content_translation_session_position: 1 translation_type: section translation_source_language: hi translation_target_language: ca user_name: Cronopio user_is_anonymous: false user_global_edit_count: 914 user_global_edit_count_bucket: 100-999 edits utm_campaign: asianmonth2020 - $schema: $ref: '#/$id' meta: stream: mediawiki.content_translation_event domain: ar.wikipedia.org dt: '2020-09-30T00:10:23.704Z' web_session_id: 826be29bfa18ed3ffe29 web_pageview_id: fd274fbf0e9e38183e65 wiki_db: arwiki access_method: desktop event_type: publish_success event_subtype: publish_overwrite content_translation_session_id: 796fee08fc0e594c3895 content_translation_session_position: 603 translation_type: article translation_id: 1245215 translation_source_language: sa translation_source_title: ब्राह्मस्फुटसिद्धान्तः translation_target_language: ar translation_target_title: السندهند translation_target_exists: true translation_provider: google human_modification_rate: 0.7932 human_modification_threshold: 0.2 published_page_id: 529023 published_revision_id: 52129754 user_name: Abu_Rayhan_of_Khwarazm user_is_anonymous: false user_global_edit_count: 123901 user_global_edit_count_bucket: 1000+ edits