Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0020 |
Symbol | |
ID | 3831893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 20314 |
End bp | 21891 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637827947 |
Product | D-3-phosphoglycerate dehydrogenase |
Protein accession | YP_428903 |
Protein GI | 83588894 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases |
TIGRFAM ID | [TIGR01327] D-3-phosphoglycerate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.173691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000002432 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTGTTT TAGCTCTGGA CGGCGTTGAC GCGCGGGGCC TGGCCGCTCT CCGGGAGGCC GGGCTGGAAG TAACGACCTC CGGTAAGATG GAGGAGGAAG AATTGAAAGA GGCTATTCGC GACTGCGAAG CCCTGATCGT CCGCAGCGGC ACGAGGGTGA CAGCAGCAGC CATTAATGCC GCTAAAAAGT TAAAGATTAT CGCCCGTGCC GGGGTCGGTA CCGACAACAT CGATGTAGCG GCGGCCACTG AAAGGGGTAT TGTAGTGGTC AATGCCCCTG AGGGCAATAC CATTGCTGCC GCTGAACACA CCATAGCCAT GATGCTGGCC CTGGCCCGTA ATATCCCCCA GGCCAGCGCC GCCCTGAAAC AGGGCCGCTG GGAAAAGAAA AAATTTGTAG GCGTGGAACT GCGGGGAAAA ACCCTGGGGA TAATCGGCCT GGGCAAGATT GGCCGGGAGG TGGCCCGTCG CGCCCGGGGC CTGGAAATGA AGGTGGTGGC CTTTGATCCC TATGTAGATT CGGAACAGGC GGCCCGTCTG GAAGTCGAGT TGGTGCCCCT GGAAACCCTT CTGGCCGGGG CTGATTTTGT AACCGTGCAT CTGCCCCTCA CCAAAGACAC CCGCCACCTC CTGGACCGGG AGAAGCTGGG GCTCATGAAG CAGGGAGCGC GGGTTTTAAA TGTCGCCAGG GGGGGTATTA TTGACGAAGG AGCCCTCTAT GAAGCCCTGA AGGCGGGCCA CCTGGCGGGG GCGGCCCTGG ATGTCTTTGA GGAAGAACCC CTGGGGCAGA GCCCCCTGCT GGAACTGGAA AATGTCATTG TGACCCCGCA CCTGGGGGCT TCCACCCGGG AGGCCCAGGT GGCAGTGGCG GTGGAGGTGG CCGGTGACGT TATCCGTTGC CTCCAGGGTG AACCGGTGCT CAATGCCGTC AATATACCGG TGGTTCGGGG GCATCTGGCG GAGGTCCTCC ACCCCTACCT GCAGCTGGCG GAGAAGCTGG GCAGTTTCCT CTCCCAGCTG ATGGAGAGCC CCATCCTCAC GGCAGAAATA TGTTTTAACG GCGAGCTGGC CGGCTACGAC CTGGCCCCCC TTACCAGTTC CTTTTTAAAG GGGCTCTTAC GGCCCCTCCT GGCTGAAGCC GTCAATTACG TGAACGCCCC CCTGGTGGCC AAAAAACGGG GTATCCGTAT CCGAGAGAAA AAGAGCCCGG AGATGGAGTA CTTTGCCAAC CTGATCGGCG TTCAGGTCCA GGGTCGGCGG GAGTCCCACC GCCTGGCCGG GACCGTTAAC CAGGCTGGGG AACCCCGGTT GGTAAACCTG GACGGTTACA GCGTGGACAC TATCCCGGCC GGCCATCTTC TGGTGATACC TCACCTGGAC CGGCCCCGGA TTATCGGCCC GGTGGCCCTG GCTATCGGTG ACCACGGCGT CAATATCGCG GCGATGCAGG TGGGCCGGCG CGAGCGGGGC GGCCAGGCCG TTATGCTAAT CAGCGTCGAT TCCGAAGTCC CCCGGGCCGC CCTGGACGCC ATTCGCCAGG TGGACGGCGT CCTGGACGTG CGTTATATCT CCCTTTAG
|
Protein sequence | MRVLALDGVD ARGLAALREA GLEVTTSGKM EEEELKEAIR DCEALIVRSG TRVTAAAINA AKKLKIIARA GVGTDNIDVA AATERGIVVV NAPEGNTIAA AEHTIAMMLA LARNIPQASA ALKQGRWEKK KFVGVELRGK TLGIIGLGKI GREVARRARG LEMKVVAFDP YVDSEQAARL EVELVPLETL LAGADFVTVH LPLTKDTRHL LDREKLGLMK QGARVLNVAR GGIIDEGALY EALKAGHLAG AALDVFEEEP LGQSPLLELE NVIVTPHLGA STREAQVAVA VEVAGDVIRC LQGEPVLNAV NIPVVRGHLA EVLHPYLQLA EKLGSFLSQL MESPILTAEI CFNGELAGYD LAPLTSSFLK GLLRPLLAEA VNYVNAPLVA KKRGIRIREK KSPEMEYFAN LIGVQVQGRR ESHRLAGTVN QAGEPRLVNL DGYSVDTIPA GHLLVIPHLD RPRIIGPVAL AIGDHGVNIA AMQVGRRERG GQAVMLISVD SEVPRAALDA IRQVDGVLDV RYISL
|
| |