Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ANIA_00421 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Aspergillus nidulans FGSC A4 |
Kingdom | Eukaryota |
Replicon accession | BN001308 |
Strand | - |
Start bp | 3546220 |
End bp | 3549318 |
Gene Length | 3099 bp |
Protein Length | 819 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | Multiple RNA-binding domain-containing protein 1 [Source:UniProtKB/Swiss-Prot;Acc:Q5BGA9] |
Protein accession | CBF89520 |
Protein GI | 259489335 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.857696 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.9634 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAGCA CAAGAGTGTT TGTCTCTGGC CTCCCCCCAA CATTAACCAA TGATCAACTC AAGAAACACT TTGAAACTCG CTTTCATGTC ACGGATGCTC ATGTCCTGCC AAAACGGAGG ATTGGCTTCG TGGGCTTCAA AAGCTCTGAG GCGGCTCAGC AGGCTGTTTC TTACTTCAAT AAGACATATA TGAGGATGTC CAAGATTTCA GTAGACATTG CTAAGCCGGT ATGTCACTCC TCTCTCTGCT TAATTTGCAC TTCGTTTTTC CCATTGAGCT TGCTGACCAG TCAAGGTGGT CTTGTAACTG ACCAGTAATT AGATTGATGC GGAACCTGCC CACCGCAAGG ACAGTAGAAC AGCTCAGCCC GACGATGCCT TGGGAAACAA TCTCAAGCGT AAGCGCGATG GGGATACTAT CAAGGATTCT AAAACGCAAG AGTATCTTTC CCTCTTACAG CAACCATCTA AGACCCGAAC GTGGGCAAAT GATGATCAGC TACCTGATCC TGATGAGACC GACTCACATG CACAAGAACA AGAACAGCCT TTCGACGTCG ATGATCAGGA GGAACTGACA TACGCTCAGA GGAAAAAGGC AAAACTGGGT CAGGATGCAA ACGAAAGCTC TCATGTTCCA GTAGTTGCTG GCTATCAACC TACAACTGAC GAAAGCGATG GTCAACCCTC GCCTGAAAAA CATGAAGAGG AGTTGGAGGA CCCGCAAAAG GACCAGGCGC CTGTTTCTGA CTCTGACTGG CTGCGCTCAA AAACAAGTCG CCTTCTAGGC TTGCTAGACG AAGATGAACA AGAAACGTTT GCGTCCCCTG CGGCGGCCAC GAATCCTACG CCAATAATCA ACTCCAACGT TGAGAAGCCC GAAGCTGAGA GTCCGGAGAA GCCCGCTGAG AGCGACTTGA CGAAAGCGCC CACAGCAGCG GAAGTTGACA CGAACATCGA AAACATCCGC ATTTCAGCAC GTTTATTTGT CAGAAATTTG TCTTACGAGA CGAAAGAGTC AGAACTCGAG CCGGTTTTTT CGCCTTTTGG CAAAATTGAA GAAGTAAGGA CCCTTTCATG TTTCGCTATC CAAACGCCCT TCATAGTGCA ATCTTCTGCT TGAATGATGA TATACCTCTG ATAGGGACAT CCGATGCAAA AGCAAGTGAT GTGAAAATCA GAAGAGAGAA TTTTAGTAGA TGCCTCTCGT TATCTGATAT AGCAGCTTTA TTTCTGTTTT GACTCATACA ATTGCCTTGA AGCTATTGCT AACGTTTCAT TGCTTTAGAT TCATGTTGCC TTCGATACAA GATTTACGAC CAGCAAAGGA TTTGCGTACG TTCAGTACGC TGATCCTGAC GCGGCTGTTG AAGCGTACAG AAATTTAGAT GGAAAGATCT TTCAGGGTCG CTTATTACAT ATTCTGCCTG CATCGCAGAA GAAGACGTAC AAGCTAGACG AGCACGAGCT GTCCAAATTG CCGTTAAAGA AGCAAAAGCA GATCAAGCGA AAACAGGAGG CTGCTTCTTC AACGTTCAGC TGGAACTCCC TGTATATGAA TGTGAGTTAG GAAGTCGCTA ACGGACTACG TGAATATTTG CTAACCCATA CAGGCTGATG CCGTCATGTC TTCCGTTGCT GAACGAATTG GTGTCTCCAA AGCTGACCTT TTGGATCCCA CTTCGTCGGA TGCCGCTGTC AAGCAGGCCC ATGCTGAAAC GCATGTAATC CAAGAGACAA AAGCTTATTT CAAAGCAAAC GGTGTAAATT TGGACGCTTT CAAGCAACGA GAGCGCGGAA ATCTCGCCAT TTTAGTAAAG AACTTCTCGT ATGGCACAAA AACCGAAGAC CTTCGCAAAT TATTTGAACC CTTCGGACAA ATCACCAGGC TGTTGATGCC GCCGAGCGGC ACAATAGCTA TTGTGGCATT CGCGCGACCG GATGAAGCGC AGAAGGCTTT CAAAAGCCTA GCGTACCGAA AACTTGGTGA CTCGATCCTC TTTTTGGAGA AGGCACCAAA AGACCTTTTT GAGGCGGATG TTCCACCTCA GAACCCACTG CCAGAAACCA AAGCCGTCTC GCAAGGTTTC TCAACCGCAG ATACTTTTGC TGCGGATGAA GGTGACGAAG AAGTTATGGC TACTGCAACG TTGTTCATCA AGAACCTTAA CTTCTCCACA ACTAATCAAT CTTTGATAGA AGCGTTCAGG CCGCTGGATG GCTTTGTTTC CGCAAGGATA AAAACCAAAC CTGATCCCAA AAACCCTGGT CAGACACTGA GCATGGGCTT TGGTTTCGCC GATTTCAAAA CCAAGGCCCA GGCCCAGGCT GCTCTCGCTG TGATGAATGG CTACACGCTT GACCGGCACA CATTGGTAGT AAGGGCATCT CATAAGGGCA TGGATGCAGC CGAGGAAAGA AGAAAGGAGG ACACTGCGAA GAAGATCGCT GCACGACGTA CCAAGATCAT CATTAAGAAC TTGCCGTTCC AGGCCACGAA GAAAGACGTC AGGTCGCTTT TCGGGGCTTA TGGCCAGCTC CGGTCTGTGC GGGTGCCCAA GAAGTTCGAC CGGTCTGCTC GGGGTTTTGG TTTCGCTGAT TTTGTAAGCG CTCGAGAAGC GGAAAATGCC ATGGATGCCT TGAAGAACAC ACACCTTCTC GGCAGGCGAT TGGTATTGGA ATTCGCGAAC GAGGAGGCCA TAGATGCCGA AGAGGAGATT CAAAGAATCG AAAAGAAGGT AGGGGAGCAG CTGGACAGGG TCAAGCTCCA AAAGCTCACT GGAGCTGGGC GCAAAAAGTT CACTGTAGGG GCCCAGGACG ACGAGAGCTA GAGTCCCTTC TCCCTGTCGA TAGGTGCAAA TCCAGGTCGA CTTTCGCGAT ACTATCCCTT GCTTTCCGGA TAGAAGCAGC GCACACGTTC CTGTCAGTAC CTGCATACAT ACGCGTCCTG CAAAGTGCCG TCCACTCGTG CATCCGAACC TTAGATACCC TGTATCGCAT TACTATTACT GTCTTTCTGG GATCTCTCTC GATGTAAGCG TCTCCTTAAG TCCTATACAA CTAGGTACTC GCACCCTGGA TATTCCAACT CCTAGGGCT
|
Protein sequence | MESTRVFVSG LPPTLTNDQL KKHFETRFHV TDAHVLPKRR IGFVGFKSSE AAQQAVSYFN KTYMRMSKIS VDIAKPIDAE PAHRKDSRTA QPDDALGNNL KRKRDGDTIK DSKTQEYLSL LQQPSKTRTW ANDDQLPDPD ETDSHAQEQE QPFDVDDQEE LTYAQRKKAK LGQDANESSH VPVVAGYQPT TDESDGQPSP EKHEEELEDP QKDQAPVSDS DWLRSKTSRL LGLLDEDEQE TFASPAAATN PTPIINSNVE KPEAESPEKP AESDLTKAPT AAEVDTNIEN IRISARLFVR NLSYETKESE LEPVFSPFGK IEEIHVAFDT RFTTSKGFAY VQYADPDAAV EAYRNLDGKI FQGRLLHILP ASQKKTYKLD EHELSKLPLK KQKQIKRKQE AASSTFSWNS LYMNADAVMS SVAERIGVSK ADLLDPTSSD AAVKQAHAET HVIQETKAYF KANGVNLDAF KQRERGNLAI LVKNFSYGTK TEDLRKLFEP FGQITRLLMP PSGTIAIVAF ARPDEAQKAF KSLAYRKLGD SILFLEKAPK DLFEADVPPQ NPLPETKAVS QGFSTADTFA ADEGDEEVMA TATLFIKNLN FSTTNQSLIE AFRPLDGFVS ARIKTKPDPK NPGQTLSMGF GFADFKTKAQ AQAALAVMNG YTLDRHTLVV RASHKGMDAA EERRKEDTAK KIAARRTKII IKNLPFQATK KDVRSLFGAY GQLRSVRVPK KFDRSARGFG FADFVSAREA ENAMDALKNT HLLGRRLVLE FANEEAIDAE EEIQRIEKKV GEQLDRVKLQ KLTGAGRKKF TVGAQDDES
|
| |