Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_01210 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001308 |
Strand | + |
Start bp | 1134173 |
End bp | 1137060 |
Gene Length | 2888 bp |
Protein Length | 843 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | M protein repeat protein (AFU_orthologue; AFUA_1G10690) |
Protein accession | CBF87928 |
Protein GI | 259488470 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTTTCCCTTC CCTTGTCGTC TGTCGCGCTT CGCTTCTTCT TTTCATGGAT TTTTACTCCG AACCAGTCTC TCGTTCTACC GTGCTCCTGT CTGCGATTGA CGCAAGCAAT CCGAGGCACG GCTTCCCGAA CCCATAGTAA CGCAAGCAGG CAGCCTCGTA TCACCATTGT TAACTGCCGT ACGCTCGACA GACATATGCT GAGGAGCTGA AACCCTTGAA ATTTTGATTT CCCCGACTGC GATTGTACAC TATCTAGAGC CATGTCGCAA GCCAGGTGGA AAGTCGGCTC TTTCCTGCAG CAGGCCGTCG CAGGTGTCGA GTCGCGGCTA GATCAAATGC TGACAGAGGA GGAAGACGCC AAACGGTCGC AACACAAACA GGCGGCCGTG AGAACAAAGT CGGGCGAACA AACTGGTAGT ATGTTGCTGG TTAGCATCAT AATAGCTTGG TTGCTGACTT GATAGACATC TCGCGCAGTT CGTCGAACGC CCGGAAAAAT GACCGACTTC AGGAACGCCT AGCTCGTGCT ATGGCCAAGA ACAACGCTAT GAATACTCCA GATTCCTCCT CCGCCGTTGT ATCTCCTATT AGCAGCCCGG TTCAGAGCAA CGGCGCACGA TCGAGCATGG ACATAGAATC TAGTCTCGGC TCACCTCCCA GGGAAATAAC TCCACTTCCG GATACTGGGT CTGGCTCGCC GGCGGCGGCG CTCTCACGTA TGAGCCATGA CTCTTCATCC TCCCCGCGCG TGTCGTCGGA AGCTGCGGCA CCAACACCTA GCGAGAAAGA TACTTTGGCA TCTGCTCCGG CTTCTGAGGC AGGCGGAGAG ACTGATCCTT CATCTGCACC AAAAGAGCCC GCACCGATTG GTAGTGGCTC TTCTGCGGAG AGGGGTATCA CTCCGGCAAA TGAGGAAGAG AATCCGGATG GACTTCAGCA ATCGGATAAG AAAGCTGTCG AATCCGAATT ACAAGAGGAG ATACACGGAT ATATTGAGCG AATCGATGCC TTACAGTCGA AACTGAAATA CCTAGCCCAG GAAGCCGCAG AGTCGGCACG AAAGGCGGCG GCTACAGCGG AACCTGGAAG TGTTGATAGA CAGCTTCGGG AGAAGGATGA GAGGATCGCG CTTCTGTTGG AGGAGGGACA GAAGCTCTCC AAGACGGAGA TGGACCATCG AACGCTGATC AAGAAGCTCC GACAACAGCT AGCAGAAAAT TCCAAGCTCC AGGCCGAGGC GAAGAAGAAA AATGACCGGC TAGAGAGGGA CTTAGCCAAT GCGGAAGCTA GGGTTAAGCG GGCAGAGGCA GCGGAGAAGA GAGCCACTGG GTCTCTTTCC GCGCAGACAA AGACTGCTCG AGACTTGGAG ACCGTAACTG CTGAACGGAA TGCTTTAAGC CAAACGGTTC AGGAAATGAA GGGACAGCTT GCTCGAGCAG TCTCGAGGGC GGATGCAGCA GAGGCAAAGG CCAACTCCGA TGCCTTGGAG CGGGAAAAGC AGCGTGCCAA TCAGCTGGAA GAGGAACTCT CAAGCGCTAG GATTGAACGC GAAATCAGCG AGGAAAAGCT CAAGAGAGAA ATAGCTGATC TCAAGGAGGC TATTGAACAG GAGAAAGAAA GGGCTCGAGT TCTGGAAGTG GAGCTGAAAG GGGAGCAGTC CGTATTGGAG AGCAAGATGG AGTCTCTACG GTCAAGGGCG GAGGAAGCAT CTTCTGGGGT GGCTGGGGAT GCGCAGGTTA AACTCCTACG TCAAATTGAG ACGTTGCAGA CACAGTACGC CGCTGCCAGC GAGAATTGGC AGGCTTTAGA AGGCTCTCTA CTTTCGCGTC TAGCGAATGT AGAAAAGGAA CGCGACGAGG TCGCGCGGCG GGAGGCTGAG GCACGAAGAA AGATTCGCGA GATGGTGAGT GTGTTCTTTC GCAATCAACC ACAAAAGGCT GACAGGAGGT AGAACCTCAA GGTGAAGCGA CTTGAAGAGG ACCTCGAAAG CGCACAGGAA AACGAGCGCG ACCTCTCAGA TAGAATAGAG GAACGCTCTC AGGAACTACA GAAGGCTGAG CAGAAGCTGA GAAAAGCCAT TGACGAGCTA ACTGCCGCAC AAAATGAAAT GGCTGAGCAG AAGGCCATTA GTGATGCAAC ATGGACACAA AAGCTAGAAG ATGAGCGGGC AAAGTGGCGT GAACAAGCCA TGCGCCCCAT GAATCCCCTC CGGCGCAACG AGTCTCCTGT TTCCTCTCAT CGACCCAGTA TACTGGAAGC TCCCACCTCG CTCTCAGACT ACCGGCCAAC AAGCCGACGC TCGTCCGCTA TACCAGGTGT CATCCCTGAC ATAAACACTC CTCCACGCCA GAATTCCCTT CCGGTTTCAG CCTCTCAATC GGTTTTGTCG CCGATACTTT CAGAGAAGGG TTCACTCCCA ACAGTGCCCG GGTCGCCAAA GCTGCTTGAA CCAGATGAGT TCTTCATCGG CTCGCGGACG CCGTCAGCAT TCGGTGGCAC TGCAACACAC TCGCGGGGCA TCAACGATAT TATTTCTGAA TCCACTGTCG GCGCGGGGCC GTCAGTTCAG CTCGTTGAGC GCATGAGCGC CACCGTACGC CGGCTGGAGA GCGAACGCGC CGCTTCGAAG GACGAAATGG CCCGCATAAC CGCCCAACGT GACGAAGCCC GCGAACAAGT CGTTGAACTG ATGCGAGAGG TGGAGGAGAA GAGAGCATCT GATTCACAGG TGCAAGAGCT ACAGCAGAAA CTCGAAGATC TCGACAGACG ATACGAGACC ACGCTTGAGT TACTTGGTGA GAAGAGCGAG CAGGTTGAGG AGCTTCAAGC CGACATTGCG GATCTCAAGA AGATATATCG CGAGTTGGTA GACAGCACAA TGAAGTGA
|
Protein sequence | MSQARWKVGS FLQQAVAGVE SRLDQMLTEE EDAKRSQHKQ AAVRTKSGEQ TGNISRSSSN ARKNDRLQER LARAMAKNNA MNTPDSSSAV VSPISSPVQS NGARSSMDIE SSLGSPPREI TPLPDTGSGS PAAALSRMSH DSSSSPRVSS EAAAPTPSEK DTLASAPASE AGGETDPSSA PKEPAPIGSG SSAERGITPA NEEENPDGLQ QSDKKAVESE LQEEIHGYIE RIDALQSKLK YLAQEAAESA RKAAATAEPG SVDRQLREKD ERIALLLEEG QKLSKTEMDH RTLIKKLRQQ LAENSKLQAE AKKKNDRLER DLANAEARVK RAEAAEKRAT GSLSAQTKTA RDLETVTAER NALSQTVQEM KGQLARAVSR ADAAEAKANS DALEREKQRA NQLEEELSSA RIEREISEEK LKREIADLKE AIEQEKERAR VLEVELKGEQ SVLESKMESL RSRAEEASSG VAGDAQVKLL RQIETLQTQY AAASENWQAL EGSLLSRLAN VEKERDEVAR REAEARRKIR EMNLKVKRLE EDLESAQENE RDLSDRIEER SQELQKAEQK LRKAIDELTA AQNEMAEQKA ISDATWTQKL EDERAKWREQ AMRPMNPLRR NESPVSSHRP SILEAPTSLS DYRPTSRRSS AIPGVIPDIN TPPRQNSLPV SASQSVLSPI LSEKGSLPTV PGSPKLLEPD EFFIGSRTPS AFGGTATHSR GINDIISEST VGAGPSVQLV ERMSATVRRL ESERAASKDE MARITAQRDE AREQVVELMR EVEEKRASDS QVQELQQKLE DLDRRYETTL ELLGEKSEQV EELQADIADL KKIYRELVDS TMK
|
| |