Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2238 |
Symbol | |
ID | 3831284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2336397 |
End bp | 2337566 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637830158 |
Product | hypothetical protein |
Protein accession | YP_431068 |
Protein GI | 83591059 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 58 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0102712 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGATC TGCCAGCCAC CTTATATACA GGGGAACTGT TCTCTAATAC TGCTCCAGTA ATCATTCCGA GAAAGTATGA ATATCTCCCT GGAATTTGTG CTTTCTGTAT GTCCGGCGAA TTCACCAAAG CTTTGCGAGC GATTAATCCT AATGTTGCAG TTGAAAATGG TTATGTTAGT AAAATCCATT GGGATCCAGG CAAGTGGCTA TATTCCGATA CAACAATACC TTTGCCTAAA GCTTATGCAG ATGCACCAAC CCAATGGATC TTTAAGGGGA CAATAACATC GTCCATTGCT CCACTCCATG TCGCTATAGC CCGACTTTTA GGTTATCAAT GGCCCGAACA TGAACTGGAT GAATTAGATG AATTGAGCAA TTCAGACGGC ATAGTGTGCA TCCCTGCCGT TCGCGGAGAA GACCCGGCTG CCGAACGGCT GCGGGCGTTG CTCGCCGCCG CATATGGGAA TGACTGGTCG CCAACAAAAG AGCAGGAACT TATCGCAGCA ACGGGTTCAG AAGCAAAGGA TCTTGACGAG TGGCTTCGCA ACGATTTCTT TGAACAGCAC TGTAAGCTAT TCCACCACCG GCCCTTCATC TGGCATATCT GGGACGGGCG CCGGCGCGAC GGTTTCCATG CCTTGGTCAA CTACCATAAG CTTGCCGAGG GCAATGGCAA AGGCAGGCAG GTTTTAGAAA GCCTTACTTA TAGTTATCTC GGTGAGTGGA TTACCCGGCA GAAAGATGGG GTGAAGCGGG GCGAAGGTGG TGCTGAGGAT CGACTGGCAG CGGCATTGGA ATTGCAAAAA CGCCTTATTG CCATCCTCGA AGGTGAACCT CCATTCGACA TTTTCGTTCG CTGGAAGCCT ATCGAAAAGC AGCCCATCGG CTGGGAACCG GATATCAACG ACGGCGTGCG CATCAACATC CGGCCGTTTA TGGCTTCTGA CATTCCCGGC GGCCGGAAAG GTGCCGGCAT CCTCCGGTGG AAGCCCAATA TTAGCTGGGG CAAAGACCGT GGCAAAGAGC CTGTGCGCCC ACAAGAACAG TTTCCCTGGT TATGGAAGAA TGGTAAGTTC ACCGGCGATC GGGTCAATGA TGTGCATTTA ACCAGTGAAG AAAAAAGAAA AGCACGTGAG ACAATGAACA GGAAGACGAG GAGTAAATAG
|
Protein sequence | MRDLPATLYT GELFSNTAPV IIPRKYEYLP GICAFCMSGE FTKALRAINP NVAVENGYVS KIHWDPGKWL YSDTTIPLPK AYADAPTQWI FKGTITSSIA PLHVAIARLL GYQWPEHELD ELDELSNSDG IVCIPAVRGE DPAAERLRAL LAAAYGNDWS PTKEQELIAA TGSEAKDLDE WLRNDFFEQH CKLFHHRPFI WHIWDGRRRD GFHALVNYHK LAEGNGKGRQ VLESLTYSYL GEWITRQKDG VKRGEGGAED RLAAALELQK RLIAILEGEP PFDIFVRWKP IEKQPIGWEP DINDGVRINI RPFMASDIPG GRKGAGILRW KPNISWGKDR GKEPVRPQEQ FPWLWKNGKF TGDRVNDVHL TSEEKRKARE TMNRKTRSK
|
| |