<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="en"
    xmlns:mml="http://www.w3.org/1998/Math/MathML"
    xmlns:xlink="http://www.w3.org/1999/xlink"
    xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
    <processing-meta tagset-family="jats" base-tagset="publishing" mathml-version="2.0" table-model="xhtml"/>
    <front>
                        
                        <journal-meta>
            <issn>1732-3916</issn>
                                </journal-meta>
        <article-meta>
            <title-group>
                                    <article-title>Mixture of Metrics Optimization for Machine Learning Problems</article-title>
                            </title-group>

                        <contrib-group>
                                                            <contrib contrib-type="author" corresp="yes">
                            <name>
                                <surname>Wiercioch</surname>
                                <given-names>Magdalena</given-names>
                            </name>
                            <role>author</role>
                                                                                                                                    <xref ref-type="aff" rid="aff-1"/>
                                                                                        <xref ref-type="corresp" rid="cor-1"/>
                        </contrib>
                                            <contrib contrib-type="author" corresp="yes">
                            <name>
                                <surname>Śmieja</surname>
                                <given-names>Marek</given-names>
                            </name>
                            <role>author</role>
                                                                                                                                    <xref ref-type="aff" rid="aff-2"/>
                                                                                        <xref ref-type="corresp" rid="cor-2"/>
                        </contrib>
                                                </contrib-group>

                                                                                        <aff id="aff-1">
                    <institution-wrap>
                        <institution>Uniwersytet Jagielloński w Krakowie, Polska, ul. Gołębia 24, 31-007 Kraków</institution>
                                            </institution-wrap>
                </aff>
                                                                        
            <author-notes>
                                    <corresp id="cor-1">Correspondence to: Magdalena Wiercioch <email>magdalena.wiercioch@ii.uj.edu.pl</email></corresp>
                                    <corresp id="cor-2">Correspondence to: Marek Śmieja <email>marek.smiejag@ii.uj.edu.pl</email></corresp>
                            </author-notes>

                            <pub-date date-type="pub" publication-format="electronic" iso-8601-date="2016-04-11">
                    <day>11</day>
                    <month>04</month>
                    <year>2016</year>
                </pub-date>
            
            <volume>Volume 24</volume>
            <issue>2015</issue>
                        <fpage>83</fpage>
                                    <lpage>92</lpage>
            
            <permissions>
                <copyright-statement>Copyright &#x00A9; 2016</copyright-statement>
                                    <copyright-year>2016</copyright-year>
                            </permissions>

            <funding-group specific-use="Crossref">
                <funding-statement></funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        &lt;p style=&quot;text-align: left;&quot;&gt;The selection of data representation and metric for a given data set is one of the most crucial problems in machine learning since it affects the results of classification and clustering methods. In this paper we investigate how to combine a various data representations and metrics into a single function which better reflects the relationships between data set elements than a single representation-metric pair. Our approach relies on optimizing a linear combination of selected distance measures with use of least square approximation. The application of our method for classification and clustering of chemical compounds seems to increase the accuracy of these methods.&lt;/p&gt;
    </body>
    <back>
                    <ref-list>
                                                                                <ref id="B1">
                            <label>1</label>
                            <article-title>[1] Aczel A., Sounderpandian J., Complete Business Statistics. McGraw Hill, New York 2009.</article-title>
                        </ref>
                                                                                                    <ref id="B2">
                            <label>2</label>
                            <article-title>[2] Atkeson C., Moore A., Schaal S., Locally weighted learning. Artiﬁcial Intelligence Review, 1997, 11, pp. 11–73.</article-title>
                        </ref>
                                                                                                    <ref id="B3">
                            <label>3</label>
                            <article-title>[3] Bar-Hillel A., Hertz T., Shental N., Weinshall D., Learning a Mahalanobis metric from equivalence constraints. Journal of Machine Learning Research, 2005, 6, pp. 937–965.</article-title>
                        </ref>
                                                                                                    <ref id="B4">
                            <label>4</label>
                            <article-title>[4] Cover T., Hart P., Nearest Neighbor Pattern Classiﬁcation. IEEE Transactions on Information Theory, 1967, 13, pp. 21–27.</article-title>
                        </ref>
                                                                                                    <ref id="B5">
                            <label>5</label>
                            <article-title>[5] Cox T.F., Cox M.A.A., Multidimensional Scaling. Chapman and Hall, London 1994.</article-title>
                        </ref>
                                                                                                    <ref id="B6">
                            <label>6</label>
                            <article-title>[6] Deng Z., Chuaqui C., Singh J., Knowledge-based design of target-focused libraries using protein-ligand interaction constraints. Journal of Medicinal Chemistry, 2006, 49(2), pp. 490–500.</article-title>
                        </ref>
                                                                                                    <ref id="B7">
                            <label>7</label>
                            <article-title>[7] Domeniconi C., Gunopulos D., Adaptive nearest neighbor classiﬁcation using support vector machines. Advances in Neural Information Processing Systems, 2002, 14, pp. 665–672.</article-title>
                        </ref>
                                                                                                    <ref id="B8">
                            <label>8</label>
                            <article-title>[8] Geppert H., Vogt M., Bajorath J., Current Trends in Ligand-Based Virtual Screening: Molecular Representations, Data Mining Methods, New Application Areas, and Performance Evaluation. Journal of Chemical Information and Modeling, 2010, 50, pp. 205–216.</article-title>
                        </ref>
                                                                                                    <ref id="B9">
                            <label>9</label>
                            <article-title>[9] Goldberger J., Roweis S., Hinton G., Salakhutdinov R., Neighbourhood Components Analysis. Advances in Neural Information Processing Systems, 2004, 17, pp. 513–520.</article-title>
                        </ref>
                                                                                                    <ref id="B10">
                            <label>10</label>
                            <article-title>[10] Hastie T., Tibshirani R., Discriminant Adaptive Nearest Neighbor Classiﬁcation. IEEE Trans. Pattern Anal. Mach. Intell., 1996, 18, pp. 607–616.</article-title>
                        </ref>
                                                                                                    <ref id="B11">
                            <label>11</label>
                            <article-title>[11] Hubert L., Arabie P., Comparing partitions. Journal of Classiﬁcation, 1985, 2, pp. 193–218.</article-title>
                        </ref>
                                                                                                    <ref id="B12">
                            <label>12</label>
                            <article-title>[12] Jaakkola T.S., Haussler D., Exploiting Generative Models in Discriminative Classiﬁers. Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems II, 1999, pp. 487–493.</article-title>
                        </ref>
                                                                                                    <ref id="B13">
                            <label>13</label>
                            <article-title>[13] Kedem D., Tyree S., Weinberger K.Q., Sha F., Lanckriet G., Non-linear Metric Learning. Advances in Neural Information Processing Systems, 2012, 25, pp. 2582– 2590. Available via http://books.nips.cc/papers/ﬁles/nips25/NIPS20121223.pdf.  </article-title>
                        </ref>
                                                                                                    <ref id="B14">
                            <label>14</label>
                            <article-title>[14] Klekota J., Roth F.P., Chemical Substructures That Enrich for Biological Activity. Bioinformatics 2008, 21, pp. 2518–2525.</article-title>
                        </ref>
                                                                                                    <ref id="B15">
                            <label>15</label>
                            <article-title>[15] Kohavi R., A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. Proceedings of the 14th International Joint Conference on Artiﬁcial Intelligence (IJCAI ’95), 1995, pp. 1137–1143.</article-title>
                        </ref>
                                                                                                    <ref id="B16">
                            <label>16</label>
                            <article-title>[16] Lloyd S., Least Squares Quantization in PCM. IEEE Trans. Inf. Theor., 1982, 28, pp. 129–137.</article-title>
                        </ref>
                                                                                                    <ref id="B17">
                            <label>17</label>
                            <article-title>[17] Roweis S.T., Saul L.K., Nonlinear dimensionality reduction by locally linear embedding. Science, 2000, 290, pp. 2323–2326.</article-title>
                        </ref>
                                                                                                    <ref id="B18">
                            <label>18</label>
                            <article-title>[18] Scholkopf B., Smola A.J., Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, Cambridge, MA, 2001.</article-title>
                        </ref>
                                                                                                    <ref id="B19">
                            <label>19</label>
                            <article-title>[19] Shalev-Shwartz S., Singer Y., Ng A.Y., Online and Batch Learning of Pseudometrics. Proceedings of the Twenty-ﬁrst International Conference on Machine Learning (ICML ’04), 2004, pp. 743–750.</article-title>
                        </ref>
                                                                                                    <ref id="B20">
                            <label>20</label>
                            <article-title>[20] Shental N., Hertz T., Weinshall D., Pavel M., Adjustment Learning and Relevant Component Analysis. Proceedings of the 7th European Conference on Computer Vision-Part IV (ECCV ’02), 2002, pp. 776–792. [21] ´Smieja M., Warszycki D., Tabor J., Bojarski A.J., Asymmetric Clustering Index in a Case Study of 5-HT1A Receptor Ligands. PloS ONE 9(7): e102069, doi:10.1371/journal.pone.0102069, 2014.</article-title>
                        </ref>
                                                                                                    <ref id="B21">
                            <label>21</label>
                            <article-title>[22] Sneath P.H.A., The Application of Computers to Taxonomy. J. Gen. Microbiol., 1957, 17, pp. 201–226.</article-title>
                        </ref>
                                                                                                    <ref id="B22">
                            <label>22</label>
                            <article-title>[23] Takeda H., Farsiu S. and Milanfar P., Robust kernel regression for restoration and reconstruction of images from sparse noisy data. IEEE International Conference on Image Processing, 2006, pp. 1257–1260.</article-title>
                        </ref>
                                                                                                    <ref id="B23">
                            <label>23</label>
                            <article-title>[24] Xing E.P., Ng A.Y., Jordan M.I., Russell S., Distance Metric Learning, With Application To Clustering With Side-Information,. Advances in Neural Information Processing Systems, 2003, 15, pp. 505–512.</article-title>
                        </ref>
                                                                                                    <ref id="B24">
                            <label>24</label>
                            <article-title>[25] Warszycki D., Mordalski S., Kristiansen K., Kafel R., Sylte I., Chilmonczyk, Z., Bojarski A. J., A Linear Combination of Pharmacophore Hypotheses as a New Tool in Search of New Active Compounds An Application for 5-HT1A Receptor Ligands. PloS ONE 8(12): e84510, doi:10.1371/journal.pone.0084510, 2013.</article-title>
                        </ref>
                                                                                                    <ref id="B25">
                            <label>25</label>
                            <article-title>[26] Weinberger K.Q., Saul L.K., Distance Metric Learning for Large Margin Nearest Neighbor Classiﬁcation. J. Mach. Learn. Res., 2009, 10, pp. 207–244.</article-title>
                        </ref>
                                                                                                    <ref id="B26">
                            <label>26</label>
                            <article-title>[27] Weinberger K.Q., Saul L.K., Fast solvers and eﬃcient implementations for distance metric learning. ACM International Conference Proceeding Series, 2008, 307, pp. 1160–1167.</article-title>
                        </ref>
                                                </ref-list>
            </back>
</article>
