期刊文献+

Threshold Selection Study on Fisher Discriminant Analysis Used in Exon Prediction for Unbalanced Data Sets

Threshold Selection Study on Fisher Discriminant Analysis Used in Exon Prediction for Unbalanced Data Sets
在线阅读 下载PDF
导出
摘要 In gene prediction, the Fisher discriminant analysis (FDA) is used to separate protein coding region (exon) from non-coding regions (intron). Usually, the positive data set and the negative data set are of the same size if the number of the data is big enough. But for some situations the data are not sufficient or not equal, the threshold used in FDA may have important influence on prediction results. This paper presents a study on the selection of the threshold. The eigen value of each exon/intron sequence is computed using the Z-curve method with 69 variables. The experiments results suggest that the size and the standard deviation of the data sets and the threshold are the three key elements to be taken into consideration to improve the prediction results. In gene prediction, the Fisher discriminant analysis (FDA) is used to separate protein coding region (exon) from non-coding regions (intron). Usually, the positive data set and the negative data set are of the same size if the number of the data is big enough. But for some situations the data are not sufficient or not equal, the threshold used in FDA may have important influence on prediction results. This paper presents a study on the selection of the threshold. The eigen value of each exon/intron sequence is computed using the Z-curve method with 69 variables. The experiments results suggest that the size and the standard deviation of the data sets and the threshold are the three key elements to be taken into consideration to improve the prediction results.
出处 《Communications and Network》 2013年第3期601-605,共5页 通讯与网络(英文)
关键词 FISHER DISCRIMINANT Analysis THRESHOLD Selection Gene PREDICTION Z-Curve Size of Data Set Fisher Discriminant Analysis Threshold Selection Gene Prediction Z-Curve Size of Data Set
  • 相关文献

参考文献2

二级参考文献17

  • 1[1]HSU C W,LIN C J.A comparison of methods for multiclass support vector machines[J].IEEE Transactions on Neural Networks,2002,13(2):415-425.
  • 2[2]KIJSIRKUL B,USSIVAKUL N.Multiclass support vector machines using adaptive directed acyclic graph[A].Proceedings of the 2002 International Joint Conference on Neural Networks[C].Honolulu,HI,USA,2002,1(5):980-985.
  • 3[3]PHETKAEW T,KIJSIRIKUL B,RIVEPIBOON W.Reordering adaptive directed acyclic graphs:an improved algorithm for multiclass support vector machines[A].Proceedings of the 2003 International Joint Conference on Neural Networks[C].Portland,OR,USA,2003.
  • 4[4]TIAN X,DENG F Q.An improved multi-class SVM algorithm and its application to the credit scoring model[A].Proceedings of the fifth World Congress on Intelligent Control and Automation[C].Hangzhou,China,2004.
  • 5[5]ANGUITA D,RIDELLA S,STERPI D.A new method for multiclass support vector machines[A].Proceedings of the 2004 International Joint Conference on Neural Networks[C].Budapest,Hungary,2004.
  • 6[8]TAKAHASHI F.Decision-tree-based multi-class support vector machines[A].Proceedings of the 9th International Conference on Neural Information Processing[C].Orchid Country Club,Singapore,2002.
  • 7[9]PLATT J,CRISTIANINI N,SHAWE-TAYLOR J.Large margin DAGs for multiclass classification[A].Proceedings of Neural Information Processing Systems[C].[s.l.],Cambridge,2000.
  • 8[10]PAUL G.Cox,Reza Adhami.Multi-class support vector machine classifier applied to hyper-spectral data[A].Proceedings of the Thirty-Fourth Southeastern Symposium on System Theory[C].Huntsville,Alabama,2002.
  • 9[11]FRANC V,HLAVAC V.Multi-class support vector machine[A].Proceedings of the 16th International Conference on Pattern Recognition[C].Quebec City,Canada,2002.
  • 10[12]LIN Chunfu,WAMG Shengde.Fuzzy support vector machines[J].IEEE Transactions on Neural Networks,2002,13(2):464-471.

共引文献61

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部