Gene Moth_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0020 
Symbol 
ID3831893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp20314 
End bp21891 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content62% 
IMG OID637827947 
ProductD-3-phosphoglycerate dehydrogenase 
Protein accessionYP_428903 
Protein GI83588894 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID[TIGR01327] D-3-phosphoglycerate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.173691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000002432 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTGTTT TAGCTCTGGA CGGCGTTGAC GCGCGGGGCC TGGCCGCTCT CCGGGAGGCC 
GGGCTGGAAG TAACGACCTC CGGTAAGATG GAGGAGGAAG AATTGAAAGA GGCTATTCGC
GACTGCGAAG CCCTGATCGT CCGCAGCGGC ACGAGGGTGA CAGCAGCAGC CATTAATGCC
GCTAAAAAGT TAAAGATTAT CGCCCGTGCC GGGGTCGGTA CCGACAACAT CGATGTAGCG
GCGGCCACTG AAAGGGGTAT TGTAGTGGTC AATGCCCCTG AGGGCAATAC CATTGCTGCC
GCTGAACACA CCATAGCCAT GATGCTGGCC CTGGCCCGTA ATATCCCCCA GGCCAGCGCC
GCCCTGAAAC AGGGCCGCTG GGAAAAGAAA AAATTTGTAG GCGTGGAACT GCGGGGAAAA
ACCCTGGGGA TAATCGGCCT GGGCAAGATT GGCCGGGAGG TGGCCCGTCG CGCCCGGGGC
CTGGAAATGA AGGTGGTGGC CTTTGATCCC TATGTAGATT CGGAACAGGC GGCCCGTCTG
GAAGTCGAGT TGGTGCCCCT GGAAACCCTT CTGGCCGGGG CTGATTTTGT AACCGTGCAT
CTGCCCCTCA CCAAAGACAC CCGCCACCTC CTGGACCGGG AGAAGCTGGG GCTCATGAAG
CAGGGAGCGC GGGTTTTAAA TGTCGCCAGG GGGGGTATTA TTGACGAAGG AGCCCTCTAT
GAAGCCCTGA AGGCGGGCCA CCTGGCGGGG GCGGCCCTGG ATGTCTTTGA GGAAGAACCC
CTGGGGCAGA GCCCCCTGCT GGAACTGGAA AATGTCATTG TGACCCCGCA CCTGGGGGCT
TCCACCCGGG AGGCCCAGGT GGCAGTGGCG GTGGAGGTGG CCGGTGACGT TATCCGTTGC
CTCCAGGGTG AACCGGTGCT CAATGCCGTC AATATACCGG TGGTTCGGGG GCATCTGGCG
GAGGTCCTCC ACCCCTACCT GCAGCTGGCG GAGAAGCTGG GCAGTTTCCT CTCCCAGCTG
ATGGAGAGCC CCATCCTCAC GGCAGAAATA TGTTTTAACG GCGAGCTGGC CGGCTACGAC
CTGGCCCCCC TTACCAGTTC CTTTTTAAAG GGGCTCTTAC GGCCCCTCCT GGCTGAAGCC
GTCAATTACG TGAACGCCCC CCTGGTGGCC AAAAAACGGG GTATCCGTAT CCGAGAGAAA
AAGAGCCCGG AGATGGAGTA CTTTGCCAAC CTGATCGGCG TTCAGGTCCA GGGTCGGCGG
GAGTCCCACC GCCTGGCCGG GACCGTTAAC CAGGCTGGGG AACCCCGGTT GGTAAACCTG
GACGGTTACA GCGTGGACAC TATCCCGGCC GGCCATCTTC TGGTGATACC TCACCTGGAC
CGGCCCCGGA TTATCGGCCC GGTGGCCCTG GCTATCGGTG ACCACGGCGT CAATATCGCG
GCGATGCAGG TGGGCCGGCG CGAGCGGGGC GGCCAGGCCG TTATGCTAAT CAGCGTCGAT
TCCGAAGTCC CCCGGGCCGC CCTGGACGCC ATTCGCCAGG TGGACGGCGT CCTGGACGTG
CGTTATATCT CCCTTTAG
 
Protein sequence
MRVLALDGVD ARGLAALREA GLEVTTSGKM EEEELKEAIR DCEALIVRSG TRVTAAAINA 
AKKLKIIARA GVGTDNIDVA AATERGIVVV NAPEGNTIAA AEHTIAMMLA LARNIPQASA
ALKQGRWEKK KFVGVELRGK TLGIIGLGKI GREVARRARG LEMKVVAFDP YVDSEQAARL
EVELVPLETL LAGADFVTVH LPLTKDTRHL LDREKLGLMK QGARVLNVAR GGIIDEGALY
EALKAGHLAG AALDVFEEEP LGQSPLLELE NVIVTPHLGA STREAQVAVA VEVAGDVIRC
LQGEPVLNAV NIPVVRGHLA EVLHPYLQLA EKLGSFLSQL MESPILTAEI CFNGELAGYD
LAPLTSSFLK GLLRPLLAEA VNYVNAPLVA KKRGIRIREK KSPEMEYFAN LIGVQVQGRR
ESHRLAGTVN QAGEPRLVNL DGYSVDTIPA GHLLVIPHLD RPRIIGPVAL AIGDHGVNIA
AMQVGRRERG GQAVMLISVD SEVPRAALDA IRQVDGVLDV RYISL