Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_03853 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001302 |
Strand | + |
Start bp | 2808499 |
End bp | 2811716 |
Gene Length | 3218 bp |
Protein Length | 1049 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | Mitochondrial presequence protease Precursor (EC 3.4.24.-) [Source:UniProtKB/Swiss-Prot;Acc:Q5B6H7] |
Protein accession | CBF75238 |
Protein GI | 259481583 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.901124 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCGGT CATACTTGCA CCTAGGCAGG CATCGAACGC CGGCTTTTCG CCAGCCACTC GGACGATTGC TAAGACCAAC CGCGTCGATA CTGCAGTATG CTCAGTCTCG GACGCTCGCT TCAGTATCTA GCTTGGAGAG CTTGCCGGAG GTGGGAGACC AGCTCCATGG GTTTACAGTT CAGGAAAAGA AACAAGTGCC GGAATTGCAC CTTACAGCTA TCCGTTTAAG GCATGACAAA ACCCACGCGG ATTACCTGCA TATCGCACGA GAAGACAAGA ACAATGTTTT CGGCATCGGT TTCAAGACAA ACCCTCCGGA CGCCACAGGT GTGCCCCATA TTCTGGAGCA TACCACTCTC TGCGGCAGCG AGAAGTAAGT CGCTTCATTT CAGGAAATCT GCGCCAGAAA TGACTTGATG CTGACCTTTT GGCCTTTTTT AGATATCCTA TCCGCGATCC GTTTTTCAAA ATGTTACCGC GGTCCCTCTC AAATTTCATG AACGCTTTTA CTTCGTCCGA CCACACTATG TATCCGTTTG CCACAACGAA TCAACAAGAT TTTCAGAACC TCTTGTCGGT CTACCTAGAC GCCACTATGC ATCCTCTGCT TAAAGAGGAA GACTTTAGGC AGGAAGGATG GCGTTTGGGA CCTGAGGATC CCCGTGCAAT CCAAACTCAG GAGGGAAATC TGAAGCCTGA AGACATTCTG TTTAAGGGTG TTGTGTACAA CGAAATGAAG GGTCAGATGT CAGATGCGAA CTACTTATAT TGGATTAGGT TCCAGGAAAG CATCTTTCCG GCCATCAATA ACTCCGGAGG AGACCCCCAA CATATCACGG ATCTGACGCA CAAGCAACTC GTGGAGTTTT CCAAGAAGAA CTACAATCCC AGTAATGCTA AGATCATCAC TTACGGCGAC ATGCCTTTGG CCGATCATCT GAAGCAGGTC GGTGGCGTCC TAAACGACTT TTCGAAAGGG GCTGTCGACA CAACGGTGAA GTTGCCCATT GAGCTCCGAG GCCCCATAAA TGTGACTGTC CCGGGACCAA TCGACACTTT CGTAAGTGAA GACAGACAAT TCAAAACATC CACTTCTTGG TACATGGGCG ATATCACGGA TACTGTCGAG ACCTTTTCGG CCGGCATCCT CTCGTCCCTT TTGTTAGATG GTTATGGGTC TCCCATGTAC AAAGCTCTGA TTGAAAGTGG CTTGGGCTCC TCGTTCACTC CTAACACGGG ACTCGACACT TCTGGTAAAA TCCCTATTTT TTCGATTGGA GTTACAGGTG TGAGCGAGGA GCAAGCTCCT CGGGTCAAAG AAGAGATTCA GCGAGTCCTT CAGGAAACTC TTCAGAGGGG CTTCAACGAT GAAAAGGTCC AAGGATTTCT CCACCAGCTG GAGCTTGCTC TGCGTCATAA GACAGCGAAC TTTGGCCTCG GTGTTATTCA GAAAACCTTC ACCTCTTGGT TCAACGGCTC CGATCCTATG AAGGAGCTTG CATGGAACGA GGTTATCAAT GCCTTCAAAA GTAGGTACGA AAAAGGAGGC TACTTAGAGG CGTTGATGCA AAAGTACCTC ATCAACGATA ACTGCCTGAC TTTTACGATG GTTGGTACCC CTTCATTTAA CAAGGAATTG GATGATAAGG AAATGGCGCG GAAGGAGAAA AAGTTTGAGC AATTAACTCA GCAACATGGT TCTGTTGAGA AGGCTGTGAC TGAGCTTGCT AAAGCAGAGC TGCAGCTGCT TGAAGTTCAG GAAAAGGCCC AGCATGCTGA CCTGAGCTGT TTGCCGTCTT TGCGTGTGGA GGATATTTCA CGGCAGAAAG AGCACAAGCC AGTCCGCGAA TCTAAGGTCG AAGGCACAGA TATTGTTTGG CGCGAAGCTC CAACCAATGG ACTGACCTAT TTTCAAGCTG TGAATGCCTT CGCCGATCTT CCTGATGATC TCCGCCTGCT CCTGCCACTT TTCAACGATG CAATCATGCG GCTTGGAACA CCAACTAGGA CCATGGAGCA GTGGGAAGAC CTGATAAAAC TCAAAACGGG TGGTGTCTCG ACCTCGAATT TCCATACTAC ATCGCCAACA GAGATGGGCA AATACACTGA GGGACTTCAA TTTTCTGGTT TCGCTCTGGA CAAAAATGTC CCCGACATGT TGGAAATTCT CACAGCTCTC GTTACTGAGA CAGACTTCAC CAGCCCTTCT GCTCCGGCCA TGATCCAGGA GCTTTTGCGG CTGACTACCA ACGGGGCATT GGATGCTGTT GCAGGAACTG GTCACCGATA CGCTCTAAAC GCAGCTGCTG CCGGTCTTTC TCGTAGCTTC TGGGCACAAG AGCAGACCTC AGGTCTAGCA CAGCTGCAAG CTACGGCGAA TCTCCTGCGT GATGCCGAAA CTTCTCCAGA ACGCCTAGCC GAGCTGATCG AAAAGCTTCG TCTCATTCAA TCATTCGCTA TCTCGAAGAC ATCTGGTCTT CGTGTTCGCC TCGTCTGCGA ACCGGCCAGC TCGACCCAGA ATGAATCTGT TCTGCAAAGG TGGGTTACTG GACTGCCTAA GGTCCCGTCG CCCACGTCCC AACCTCAAAG GTTTGACTTG AGCACACCTT CCAAGAAGGC GTTCTATGAT CTACCTTACA AGGTTTACTA TTCTGGTCTG GCTTTGCCTA CCGTGCCATT TACACACTCA TCCAGCGCGA CTCTGAGCGT CCTTTCTCAG CTTCTGACGC ACAATTACCT CCATCCTGAG ATCAGAGAGA AAGGTGGCGC CTATGGTGCA GGAGCAAGCA ACGGTCCGGT CAAGGGATTA TTCGCATTCA CCAGCTACCG CGACCCCAAC CCTGCCAACA CCCTCAAGGT CTTCAAGAAC AGTGGGGTCT TCGCGCGTGA CCGGGCTTGG TCAGACCGTG AAATCAACGA GGCTAAGCTT GGTATCTTCC AAGGACTCGA TGCCCCCGTT AGCGTTGATG AGGAAGGATC TAGATACTTC CTGAATGGTA TCACGCATGA GATGGACCAG CGCTGGAGAG AACAGGTCCT AGACGTCACC GCCAAGGACG TCAACGAGGT TGCGCAGACC TTCCTGGTAG ATGGCACCCG GCGGTCTGTC TGTCTCCTCG GTGAGAAGAA GGACTGGGCC GAATCTGAGG GCTGGGAGGT CCGCAAGCTC TCTATGAATC CCAATGGTTC CAACATTCCC TCCGGTGATG CCGCATAA
|
Protein sequence | MLRSYLHLGR HRTPAFRQPL GRLLRPTASI LQYAQSRTLA SVSSLESLPE VGDQLHGFTV QEKKQVPELH LTAIRLRHDK THADYLHIAR EDKNNVFGIG FKTNPPDATG VPHILEHTTL CGSEKYPIRD PFFKMLPRSL SNFMNAFTSS DHTMYPFATT NQQDFQNLLS VYLDATMHPL LKEEDFRQEG WRLGPEDPRA IQTQEGNLKP EDILFKGVVY NEMKGQMSDA NYLYWIRFQE SIFPAINNSG GDPQHITDLT HKQLVEFSKK NYNPSNAKII TYGDMPLADH LKQVGGVLND FSKGAVDTTV KLPIELRGPI NVTVPGPIDT FVSEDRQFKT STSWYMGDIT DTVETFSAGI LSSLLLDGYG SPMYKALIES GLGSSFTPNT GLDTSGKIPI FSIGVTGVSE EQAPRVKEEI QRVLQETLQR GFNDEKVQGF LHQLELALRH KTANFGLGVI QKTFTSWFNG SDPMKELAWN EVINAFKSRY EKGGYLEALM QKYLINDNCL TFTMVGTPSF NKELDDKEMA RKEKKFEQLT QQHGSVEKAV TELAKAELQL LEVQEKAQHA DLSCLPSLRV EDISRQKEHK PVRESKVEGT DIVWREAPTN GLTYFQAVNA FADLPDDLRL LLPLFNDAIM RLGTPTRTME QWEDLIKLKT GGVSTSNFHT TSPTEMGKYT EGLQFSGFAL DKNVPDMLEI LTALVTETDF TSPSAPAMIQ ELLRLTTNGA LDAVAGTGHR YALNAAAAGL SRSFWAQEQT SGLAQLQATA NLLRDAETSP ERLAELIEKL RLIQSFAISK TSGLRVRLVC EPASSTQNES VLQRWVTGLP KVPSPTSQPQ RFDLSTPSKK AFYDLPYKVY YSGLALPTVP FTHSSSATLS VLSQLLTHNY LHPEIREKGG AYGAGASNGP VKGLFAFTSY RDPNPANTLK VFKNSGVFAR DRAWSDREIN EAKLGIFQGL DAPVSVDEEG SRYFLNGITH EMDQRWREQV LDVTAKDVNE VAQTFLVDGT RRSVCLLGEK KDWAESEGWE VRKLSMNPNG SNIPSGDAA
|
| |