{rfName}

License and use

Altmetrics

Grant support

This workwas supported in part by the National Natural Science Foundation of China under Grant 62171353 and Grant 62101409 and in part by the Fundamental Research Funds for the Central Universities under Grant JB190116. The work of Luis Herranz was supported by the Grant PID2021-128178OB-I00(Ministry of Science, Innovation and Universities (MICINN), Spain) and inpart by the Ramon y Cajal under Grant RYC2019-027020-I.

Analysis of institutional authors

Herranz, LuisAuthor
Share
Publications
>
Article

Task-Switchable Pre-Processor for Image Compression for Multiple Machine Vision Tasks

Publicated to:IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY. 34 (7): 6416-6429 - 2024-07-01 34(7), DOI: 10.1109/TCSVT.2023.3348995

Authors: Yang, Mingyi; Yang, Fei; Murn, Luka; Blanch, Marc Gorriz; Sock, Juil; Wan, Shuai; Yang, Fuzheng; Herranz, Luis

Affiliations

BBC Res & Dev, London EC4Y 0DS, England - Author
Nankai Univ, Coll Comp Sci, Tianjin 300350, Peoples R China - Author
Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China - Author
RMIT Univ, Sch Engn, Melbourne, Vic 3001, Australia - Author
Univ Autonoma Barcelona, Comp Vis Ctr, Barcelona 08193, Spain - Author
Xidian Univ, Sch Telecommun Engn, Xian, Peoples R China - Author
See more

Abstract

Visual content is increasingly being processed by machines for various automated content analysis tasks instead of being consumed by humans. Despite the existence of several compression methods tailored for machine tasks, few consider real-world scenarios with multiple tasks. In this paper, we aim to address this gap by proposing a task-switchable pre-processor that optimizes input images specifically for machine consumption prior to encoding by an off-the-shelf codec designed for human consumption. The proposed task-switchable pre-processor adeptly maintains relevant semantic information based on the specific characteristics of different downstream tasks, while effectively suppressing irrelevant information to reduce bitrate. To enhance the processing of semantic information for diverse tasks, we leverage pre-extracted semantic features to modulate the pixel-to-pixel mapping within the pre-processor. By switching between different modulations, multiple tasks can be seamlessly incorporated into the system. Extensive experiments demonstrate the practicality and simplicity of our approach. It significantly reduces the number of parameters required for handling multiple tasks while still delivering impressive performance. Our method showcases the potential to achieve efficient and effective compression for machine vision tasks, supporting the evolving demands of real-world applications.

Keywords
Bit rateCodecsFeature extractionImage codingImage compression for machine visionMachine visionMultiple taskPre-processorSemanticsTask analysis

Quality index

Bibliometric impact. Analysis of the contribution and dissemination channel

The work has been published in the journal IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY due to its progression and the good impact it has achieved in recent years, according to the agency WoS (JCR), it has become a reference in its field. In the year of publication of the work, 2024 there are still no calculated indicators, but in 2023, it was in position 20/353, thus managing to position itself as a Q1 (Primer Cuartil), in the category Engineering, Electrical & Electronic. Notably, the journal is positioned above the 90th percentile.

Independientemente del impacto esperado determinado por el canal de difusión, es importante destacar el impacto real observado de la propia aportación.

Según las diferentes agencias de indexación, el número de citas acumuladas por esta publicación hasta la fecha 2025-05-25:

  • Google Scholar: 1
  • Scopus: 4
Impact and social visibility

From the perspective of influence or social adoption, and based on metrics associated with mentions and interactions provided by agencies specializing in calculating the so-called "Alternative or Social Metrics," we can highlight as of 2025-05-25:

  • The use of this contribution in bookmarks, code forks, additions to favorite lists for recurrent reading, as well as general views, indicates that someone is using the publication as a basis for their current work. This may be a notable indicator of future more formal and academic citations. This claim is supported by the result of the "Capture" indicator, which yields a total of: 4 (PlumX).

With a more dissemination-oriented intent and targeting more general audiences, we can observe other more global scores such as:

    Leadership analysis of institutional authors

    This work has been carried out with international collaboration, specifically with researchers from: Australia; China; United Kingdom.

    There is a significant leadership presence as some of the institution’s authors appear as the first or last signer, detailed as follows: Last Author (HERRANZ ARRIBAS, LUIS).