iLoc-Animal: a multi-label learning classifier for predicting subcellular localization of animal proteins. Predicting protein subcellular localization is a challenging problem, particularly when query proteins have multi-label features meaning that they may simultaneously exist at, or move between, two or more different subcellular location sites. Most of the existing methods can only be used to deal with the single-label proteins. Actually, multi-label proteins should not be ignored because they usually bear some special function worthy of in-depth studies. By introducing the ”multi-label learning” approach, a new predictor, called iLoc-Animal, has been developed that can be used to deal with the systems containing both single- and multi-label animal (metazoan except human) proteins. Meanwhile, to measure the prediction quality of a multi-label system in a rigorous way, five indices were introduced; they are ”Absolute-True”, ”Absolute-False” (or Hamming-Loss”), ”Accuracy”, ”Precision”, and ”Recall”. As a demonstration, the jackknife cross-validation was performed with iLoc-Animal on a benchmark dataset of animal proteins classified into the following 20 location sites: (1) acrosome, (2) cell membrane, (3) centriole, (4) centrosome, (5) cell cortex, (6) cytoplasm, (7) cytoskeleton, (8) endoplasmic reticulum, (9) endosome, (10) extracellular, (11) Golgi apparatus, (12) lysosome, (13) mitochondrion, (14) melanosome, (15) microsome, (16) nucleus, (17) peroxisome, (18) plasma membrane, (19) spindle, and (20) synapse, where many proteins belong to two or more locations. For such a complicated system, the outcomes achieved by iLoc-Animal for all the aforementioned five indices were quite encouraging, indicating that the predictor may become a useful tool in this area. It has not escaped our notice that the multi-label approach and the rigorous measurement metrics can also be used to investigate many other multi-label problems in molecular biology. As a user-friendly web-server, iLoc-Animal is freely accessible to the public at the web-site .

References in zbMATH (referenced in 20 articles )

Showing results 1 to 20 of 20.
Sorted by year (citations)

  1. Butt, Ahmad Hassan; Rasool, Nouman; Khan, Yaser Daanial: Prediction of antioxidant proteins by incorporating statistical moments based features into Chou’s PseAAC (2019)
  2. Hussain, Waqar; Khan, Yaser Daanial; Rasool, Nouman; Khan, Sher Afzal; Chou, Kuo-Chen: SPrenylC-PseAAC: a sequence-based model developed via Chou’s 5-steps rule and general PseAAC for identifying S-prenylation sites in proteins (2019)
  3. Shen, Yinan; Tang, Jijun; Guo, Fei: Identification of protein subcellular localization via integrating evolutionary and physicochemical information into Chou’s general PseAAC (2019)
  4. Tian, Baoguang; Wu, Xue; Chen, Cheng; Qiu, Wenying; Ma, Qin; Yu, Bin: Predicting protein-protein interactions by fusing various Chou’s pseudo components and using wavelet denoising approach (2019)
  5. Cheng, Xiang; Xiao, Xuan; Chou, Kuo-Chen: pLoc_bal-mGneg: predict subcellular localization of Gram-negative bacterial proteins by quasi-balancing training dataset and general PseAAC (2018)
  6. Mei, Juan; Fu, Yi; Zhao, Ji: Analysis and prediction of ion channel inhibitors by using feature selection and Chou’s general pseudo amino acid composition (2018)
  7. Zhang, Shengli; Duan, Xin: Prediction of protein subcellular localization with oversampling approach and Chou’s general PseAAC (2018)
  8. Jia, Jianhua; Liu, Zi; Xiao, Xuan; Liu, Bingxiang; Chou, Kuo-Chen: pSuc-Lys: predict lysine succinylation sites in proteins with PseAAC and ensemble random forest approach (2016)
  9. Jiao, Ya-Sen; Du, Pu-Feng: Prediction of Golgi-resident protein types using general form of Chou’s pseudo-amino acid compositions: approaches with minimal redundancy maximal relevance feature selection (2016)
  10. Jiao, Ya-Sen; Du, Pu-Feng: Predicting Golgi-resident protein types using pseudo amino acid compositions: approaches with positional specific physicochemical properties (2016)
  11. Golzari, Fahimeh; Jalili, Saeed: VR-BFDT: a variance reduction based binary fuzzy decision tree induction method for protein function prediction (2015)
  12. Kou, Gaoshan; Feng, Yonge: Identify five kinds of simple super-secondary structures with quadratic discriminant algorithm based on the chemical shifts (2015)
  13. Han, Guo-Sheng; Yu, Zu-Guo; Anh, Vo: A two-stage SVM method to predict membrane protein types by incorporating amino acid classifications and physicochemical properties into a general form of Chou’s PseAAC (2014)
  14. Lyons, James; Biswas, Neela; Sharma, Alok; Dehzangi, Abdollah; Paliwal, Kuldip K.: Protein fold recognition by alignment of amino acid residues using kernelized dynamic time warping (2014)
  15. Tahir, Muhammad; Khan, Asifullah; Kaya, Hüseyin: Protein subcellular localization in human and hamster cell lines: employing local ternary patterns of fluorescence microscopy images (2014)
  16. Feng, Peng-Mian; Ding, Hui; Chen, Wei; Lin, Hao: Naïve Bayes classifier with feature selection to identify phage virion proteins (2013)
  17. Huang, Chao; Yuan, Jing-Qi: Predicting protein subchloroplast locations with both single and multiple sites via three different modes of Chou’s pseudo amino acid compositions (2013)
  18. Pan, Lu-Lu; Wang, Yu; Hu, Jian-Hui; Ding, Zhao-Tang; Li, Chen: Analysis of codon use features of stearoyl-acyl carrier protein desaturase gene in \textitCamelliasinensis (2013)
  19. Rahimi, Amir; Madadkar-Sobhani, Armin; Touserkani, Rouzbeh; Goliaei, Bahram: Efficacy of function specific 3D-motifs in enzyme classification according to their EC-numbers (2013)
  20. Xiao, Xuan; Min, Jian-Liang; Wang, Pu; Chou, Kuo-Chen: iCDI-PseFpt: identify the channel-drug interaction in cellular networking with PseAAC and molecular fingerprints (2013)