Gene ANIA_03853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagANIA_03853 
Symbol 
ID
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameAspergillus nidulans FGSC A4 
KingdomEukaryota 
Replicon accessionBN001302 
Strand
Start bp2808499 
End bp2811716 
Gene Length3218 bp 
Protein Length1049 aa 
Translation table 
GC content51% 
IMG OID 
ProductMitochondrial presequence protease Precursor (EC 3.4.24.-) [Source:UniProtKB/Swiss-Prot;Acc:Q5B6H7] 
Protein accessionCBF75238 
Protein GI259481583 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.901124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGGT CATACTTGCA CCTAGGCAGG CATCGAACGC CGGCTTTTCG CCAGCCACTC 
GGACGATTGC TAAGACCAAC CGCGTCGATA CTGCAGTATG CTCAGTCTCG GACGCTCGCT
TCAGTATCTA GCTTGGAGAG CTTGCCGGAG GTGGGAGACC AGCTCCATGG GTTTACAGTT
CAGGAAAAGA AACAAGTGCC GGAATTGCAC CTTACAGCTA TCCGTTTAAG GCATGACAAA
ACCCACGCGG ATTACCTGCA TATCGCACGA GAAGACAAGA ACAATGTTTT CGGCATCGGT
TTCAAGACAA ACCCTCCGGA CGCCACAGGT GTGCCCCATA TTCTGGAGCA TACCACTCTC
TGCGGCAGCG AGAAGTAAGT CGCTTCATTT CAGGAAATCT GCGCCAGAAA TGACTTGATG
CTGACCTTTT GGCCTTTTTT AGATATCCTA TCCGCGATCC GTTTTTCAAA ATGTTACCGC
GGTCCCTCTC AAATTTCATG AACGCTTTTA CTTCGTCCGA CCACACTATG TATCCGTTTG
CCACAACGAA TCAACAAGAT TTTCAGAACC TCTTGTCGGT CTACCTAGAC GCCACTATGC
ATCCTCTGCT TAAAGAGGAA GACTTTAGGC AGGAAGGATG GCGTTTGGGA CCTGAGGATC
CCCGTGCAAT CCAAACTCAG GAGGGAAATC TGAAGCCTGA AGACATTCTG TTTAAGGGTG
TTGTGTACAA CGAAATGAAG GGTCAGATGT CAGATGCGAA CTACTTATAT TGGATTAGGT
TCCAGGAAAG CATCTTTCCG GCCATCAATA ACTCCGGAGG AGACCCCCAA CATATCACGG
ATCTGACGCA CAAGCAACTC GTGGAGTTTT CCAAGAAGAA CTACAATCCC AGTAATGCTA
AGATCATCAC TTACGGCGAC ATGCCTTTGG CCGATCATCT GAAGCAGGTC GGTGGCGTCC
TAAACGACTT TTCGAAAGGG GCTGTCGACA CAACGGTGAA GTTGCCCATT GAGCTCCGAG
GCCCCATAAA TGTGACTGTC CCGGGACCAA TCGACACTTT CGTAAGTGAA GACAGACAAT
TCAAAACATC CACTTCTTGG TACATGGGCG ATATCACGGA TACTGTCGAG ACCTTTTCGG
CCGGCATCCT CTCGTCCCTT TTGTTAGATG GTTATGGGTC TCCCATGTAC AAAGCTCTGA
TTGAAAGTGG CTTGGGCTCC TCGTTCACTC CTAACACGGG ACTCGACACT TCTGGTAAAA
TCCCTATTTT TTCGATTGGA GTTACAGGTG TGAGCGAGGA GCAAGCTCCT CGGGTCAAAG
AAGAGATTCA GCGAGTCCTT CAGGAAACTC TTCAGAGGGG CTTCAACGAT GAAAAGGTCC
AAGGATTTCT CCACCAGCTG GAGCTTGCTC TGCGTCATAA GACAGCGAAC TTTGGCCTCG
GTGTTATTCA GAAAACCTTC ACCTCTTGGT TCAACGGCTC CGATCCTATG AAGGAGCTTG
CATGGAACGA GGTTATCAAT GCCTTCAAAA GTAGGTACGA AAAAGGAGGC TACTTAGAGG
CGTTGATGCA AAAGTACCTC ATCAACGATA ACTGCCTGAC TTTTACGATG GTTGGTACCC
CTTCATTTAA CAAGGAATTG GATGATAAGG AAATGGCGCG GAAGGAGAAA AAGTTTGAGC
AATTAACTCA GCAACATGGT TCTGTTGAGA AGGCTGTGAC TGAGCTTGCT AAAGCAGAGC
TGCAGCTGCT TGAAGTTCAG GAAAAGGCCC AGCATGCTGA CCTGAGCTGT TTGCCGTCTT
TGCGTGTGGA GGATATTTCA CGGCAGAAAG AGCACAAGCC AGTCCGCGAA TCTAAGGTCG
AAGGCACAGA TATTGTTTGG CGCGAAGCTC CAACCAATGG ACTGACCTAT TTTCAAGCTG
TGAATGCCTT CGCCGATCTT CCTGATGATC TCCGCCTGCT CCTGCCACTT TTCAACGATG
CAATCATGCG GCTTGGAACA CCAACTAGGA CCATGGAGCA GTGGGAAGAC CTGATAAAAC
TCAAAACGGG TGGTGTCTCG ACCTCGAATT TCCATACTAC ATCGCCAACA GAGATGGGCA
AATACACTGA GGGACTTCAA TTTTCTGGTT TCGCTCTGGA CAAAAATGTC CCCGACATGT
TGGAAATTCT CACAGCTCTC GTTACTGAGA CAGACTTCAC CAGCCCTTCT GCTCCGGCCA
TGATCCAGGA GCTTTTGCGG CTGACTACCA ACGGGGCATT GGATGCTGTT GCAGGAACTG
GTCACCGATA CGCTCTAAAC GCAGCTGCTG CCGGTCTTTC TCGTAGCTTC TGGGCACAAG
AGCAGACCTC AGGTCTAGCA CAGCTGCAAG CTACGGCGAA TCTCCTGCGT GATGCCGAAA
CTTCTCCAGA ACGCCTAGCC GAGCTGATCG AAAAGCTTCG TCTCATTCAA TCATTCGCTA
TCTCGAAGAC ATCTGGTCTT CGTGTTCGCC TCGTCTGCGA ACCGGCCAGC TCGACCCAGA
ATGAATCTGT TCTGCAAAGG TGGGTTACTG GACTGCCTAA GGTCCCGTCG CCCACGTCCC
AACCTCAAAG GTTTGACTTG AGCACACCTT CCAAGAAGGC GTTCTATGAT CTACCTTACA
AGGTTTACTA TTCTGGTCTG GCTTTGCCTA CCGTGCCATT TACACACTCA TCCAGCGCGA
CTCTGAGCGT CCTTTCTCAG CTTCTGACGC ACAATTACCT CCATCCTGAG ATCAGAGAGA
AAGGTGGCGC CTATGGTGCA GGAGCAAGCA ACGGTCCGGT CAAGGGATTA TTCGCATTCA
CCAGCTACCG CGACCCCAAC CCTGCCAACA CCCTCAAGGT CTTCAAGAAC AGTGGGGTCT
TCGCGCGTGA CCGGGCTTGG TCAGACCGTG AAATCAACGA GGCTAAGCTT GGTATCTTCC
AAGGACTCGA TGCCCCCGTT AGCGTTGATG AGGAAGGATC TAGATACTTC CTGAATGGTA
TCACGCATGA GATGGACCAG CGCTGGAGAG AACAGGTCCT AGACGTCACC GCCAAGGACG
TCAACGAGGT TGCGCAGACC TTCCTGGTAG ATGGCACCCG GCGGTCTGTC TGTCTCCTCG
GTGAGAAGAA GGACTGGGCC GAATCTGAGG GCTGGGAGGT CCGCAAGCTC TCTATGAATC
CCAATGGTTC CAACATTCCC TCCGGTGATG CCGCATAA
 
Protein sequence
MLRSYLHLGR HRTPAFRQPL GRLLRPTASI LQYAQSRTLA SVSSLESLPE VGDQLHGFTV 
QEKKQVPELH LTAIRLRHDK THADYLHIAR EDKNNVFGIG FKTNPPDATG VPHILEHTTL
CGSEKYPIRD PFFKMLPRSL SNFMNAFTSS DHTMYPFATT NQQDFQNLLS VYLDATMHPL
LKEEDFRQEG WRLGPEDPRA IQTQEGNLKP EDILFKGVVY NEMKGQMSDA NYLYWIRFQE
SIFPAINNSG GDPQHITDLT HKQLVEFSKK NYNPSNAKII TYGDMPLADH LKQVGGVLND
FSKGAVDTTV KLPIELRGPI NVTVPGPIDT FVSEDRQFKT STSWYMGDIT DTVETFSAGI
LSSLLLDGYG SPMYKALIES GLGSSFTPNT GLDTSGKIPI FSIGVTGVSE EQAPRVKEEI
QRVLQETLQR GFNDEKVQGF LHQLELALRH KTANFGLGVI QKTFTSWFNG SDPMKELAWN
EVINAFKSRY EKGGYLEALM QKYLINDNCL TFTMVGTPSF NKELDDKEMA RKEKKFEQLT
QQHGSVEKAV TELAKAELQL LEVQEKAQHA DLSCLPSLRV EDISRQKEHK PVRESKVEGT
DIVWREAPTN GLTYFQAVNA FADLPDDLRL LLPLFNDAIM RLGTPTRTME QWEDLIKLKT
GGVSTSNFHT TSPTEMGKYT EGLQFSGFAL DKNVPDMLEI LTALVTETDF TSPSAPAMIQ
ELLRLTTNGA LDAVAGTGHR YALNAAAAGL SRSFWAQEQT SGLAQLQATA NLLRDAETSP
ERLAELIEKL RLIQSFAISK TSGLRVRLVC EPASSTQNES VLQRWVTGLP KVPSPTSQPQ
RFDLSTPSKK AFYDLPYKVY YSGLALPTVP FTHSSSATLS VLSQLLTHNY LHPEIREKGG
AYGAGASNGP VKGLFAFTSY RDPNPANTLK VFKNSGVFAR DRAWSDREIN EAKLGIFQGL
DAPVSVDEEG SRYFLNGITH EMDQRWREQV LDVTAKDVNE VAQTFLVDGT RRSVCLLGEK
KDWAESEGWE VRKLSMNPNG SNIPSGDAA