Vis enkel innførsel

dc.contributor.authorSolberg, Lars Erik
dc.contributor.authorDahl, Tobias Gulden
dc.contributor.authorNæs, Tormod
dc.date.accessioned2022-01-18T10:14:37Z
dc.date.available2022-01-18T10:14:37Z
dc.date.created2021-11-17T13:53:44Z
dc.date.issued2021
dc.identifier.citationJournal of Chemometrics. 2021, 1-24.
dc.identifier.issn0886-9383
dc.identifier.urihttps://hdl.handle.net/11250/2837865
dc.description.abstractMultiblock analysis attacks the problem of how to combine data from various data sources for purposes such as prediction, classification, clustering, or visual data analysis. A key concept is the distinction between “common” and “distinct” parts, that is, what information repeats itself across the blocks and what is unique to an individual block. The statistical field of multiblock analysis holds many different approaches, which leads to different treatments both of the terms distinct and common themselves and to differences in the numerical results. In this article, we extend the discussion of distinct and common in multiblock analysis to the domain of distance matrices, that is, the situation where data point sets, so-called configurations, are analyzed via relative distances either because configurations are not available directly or because a distance representation is favorable. Situations typical for chemometrics will be highlighted and illustrated in examples. When analyzing different methods, we have focused on three key aspects. First, during the transition from the distance to configuration domains, one needs to consider how multiple distance matrices are treated. Second, when extracting common and distinct parts, one needs to manage a tradeoff between explaining variance and ensuring similarity between subspaces. Third, there is a design choice to be made as to whether the subspace containing the common parts is “shared” between blocks or if separate subspaces are associated with each individual block. The three aspects help to categorize and explain well-known methods in the field. A selection of methods was analyzed and subsequently applied to examples.
dc.language.isoeng
dc.titleMaking sense of multiple distance matrices through common and distinct components
dc.typePeer reviewed
dc.typeJournal article
dc.description.versionpublishedVersion
dc.source.pagenumber1-24
dc.source.journalJournal of Chemometrics
dc.identifier.doi10.1002/cem.3372
dc.identifier.cristin1955569
dc.relation.projectNofima AS: 201702
dc.relation.projectNorges forskningsråd: 262308
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel