Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2341 |
Symbol | |
ID | 3832059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2458296 |
End bp | 2459960 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637830264 |
Product | hypothetical protein |
Protein accession | YP_431170 |
Protein GI | 83591161 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.920789 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGAAG TCTATCTGGT TCTATGTGGA ATAATCGATA AAAAATGTAG CTGGTTACCG GCTACTTGGG GAGGGTGTGG ATTGGAAAGA AGGTTAATTA GACGCGGACC TCTAAGCAAG AGATTACTAA TGCTCCTCAT CTTGACTATA CTTGCCGGCA GCTTTGCCTT TTCAGGAAAT GCCTTAGCTA ACGACAATGA AGAAAATGAA CCTGAGAAAA TTAATAATGA GGTAGCGGCG GAGGTCAGGA ATATCAGTTA TCAACTTGAT AAAGGTGCGA TGGATCCGGG TACGGCAATC AGCCTTTTAA TTAGCGAGAT AGATGCACTT GAGGAAATTG ACAAAGTCGA TATTGAAAGT GCCACCCTGG ACGAGGTAAA AAAAGCGGTG TCACTAATCG TTCAAGAAAT GCCTTTACTT CCGAATTCTC TTCTAAAGGT TTGGTTTACA GGCAGCAAGG CTGTGGTAGA TATAGCTGAA AACGCTATTC CCTACCACCT AGGAAGAGTG ATGGATGACG CTAAAAAGCT GGAAGAACAT CTAGATAAGC TCGGGTTAAC GGAACAGGCG CAAGACCTAG AAAACGCCAG GCCCTATAAC AATTTACTCT TAAAAATACC TGAAGTGGTC GCTAAAAAGG AAATCCAAGT AAATGTGCCT GTAAGTGTTG CCAAAGACCT TATTGCCAGG AACTTGAATT TAAAAATTCA AGGTGATGGT ATTTCTTTTT TAATGCCTGC AAAAGATTTT GATTTTAGTG ACGGAACTGC TAGAGTAGCT ATTATCTACA AAGAAATGGA GAGCAAGGAC CTCCCTAACC TTGACGCGAA CTTGACTCTC CATTTTGCCA GCAGAATATA TGCTGTCAAG GCTTATCGGC TGGCAGTTAA CGGGGCTTCG TCACCGGCAG ATTTTAGCCA GCGCGTTCGG ATGGCCCTGT CATTTGATCT CGGTAAAGAC AGCGATCCTG CGCGCTTAGG TGCTTTCCGA TATGACGAAA CGAAAAAAAG ATGGATATAT ATAAAGGGTG TTAATGCTAC TGCTGGCATT ATTGAATTAC ATCTTGAAAA TGAAGGTATA TATGCCGTCA TCGATTATCA AAAGACCTTT GAAGACATTC AAAAACACTG GGCTAAAGCG GATATTGAAG CCATGGCCGC CAGGCAGATA GCAATAGGGG TTACTACCAG TCTATTTATG CCTGATGAAC CCCTTAGCAG GGCCCAATTT ACCGCTTTAT TATTAAGATC TCTTAACAAA AAAGAAATAA CTCCGACTAC GCCTACCTTT AAAGACGTGA ACTCAGGCGA CTGGTTTTAT GGTGCCGTTG AAGCCGCTTA CCGGGCGGGA CTGGTTTCGG GTAAGGAAAA AGATATCTTT GCTCCGCGAG ATAATATCAC CAGGGAGGAA ATGGTCGTTA TGCTGGCCCG GGCTCTAAAG GCTGCGGGCT ACCATGGTAC GGGTAATGTG GGCAGCCTTG AGTGCTTCAG AGACTATGCA GTTATTTCCG ATTGGGCCAA GGGTGCCTTT GCTTCCTGCC AGGACGCAGG GATCAATTTG ACTCTACCAG ATGGCAACTT GCACCCCCAT GACGATGCTA CCCGGGCCCA GGCCATGGTA ATGTTAAAGG GATTTTCTGA TGCGATCCAA ACTCTGGACA GATAG
|
Protein sequence | MGEVYLVLCG IIDKKCSWLP ATWGGCGLER RLIRRGPLSK RLLMLLILTI LAGSFAFSGN ALANDNEENE PEKINNEVAA EVRNISYQLD KGAMDPGTAI SLLISEIDAL EEIDKVDIES ATLDEVKKAV SLIVQEMPLL PNSLLKVWFT GSKAVVDIAE NAIPYHLGRV MDDAKKLEEH LDKLGLTEQA QDLENARPYN NLLLKIPEVV AKKEIQVNVP VSVAKDLIAR NLNLKIQGDG ISFLMPAKDF DFSDGTARVA IIYKEMESKD LPNLDANLTL HFASRIYAVK AYRLAVNGAS SPADFSQRVR MALSFDLGKD SDPARLGAFR YDETKKRWIY IKGVNATAGI IELHLENEGI YAVIDYQKTF EDIQKHWAKA DIEAMAARQI AIGVTTSLFM PDEPLSRAQF TALLLRSLNK KEITPTTPTF KDVNSGDWFY GAVEAAYRAG LVSGKEKDIF APRDNITREE MVVMLARALK AAGYHGTGNV GSLECFRDYA VISDWAKGAF ASCQDAGINL TLPDGNLHPH DDATRAQAMV MLKGFSDAIQ TLDR
|
| |