Gene Moth_2404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2404 
Symbol 
ID3830771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2524133 
End bp2524993 
Gene Length861 bp 
Protein Length286 aa 
Translation table11 
GC content62% 
IMG OID637830323 
Productfructose-1,6-bisphosphate aldolase 
Protein accessionYP_431229 
Protein GI83591220 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0191] Fructose/tagatose bisphosphate aldolase 
TIGRFAM ID[TIGR00167] ketose-bisphosphate aldolases
[TIGR01859] fructose-1,6-bisphosphate aldolase, class II, various bacterial and amitochondriate protist 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTAG TTACCTTGGC CGAAGTTCTG CGGGAAGCCG ATACCGGCGG CTACGCCGTA 
GGTGCCTTCA ACTGCAACAA CATGGAAATC GTCCAGGCCA TTATCAACGC TGCTGTTACC
GCGCAAGCCC CGGTCATCAT CCAGGCCAGC CAGGGGGCCA TCAAGTACGC GGGGCTGGAG
TACATAACCT CCCTGGTACG GACGGCGGCC AGTCAAGCGC CAGTACCGGT GGTCCTCCAC
CTGGACCACG GCACGGACTT TGAGCAGGTC TTGCGTTGTC TGCGGGCCGG CTTTACCTCG
GTGATGATCG ACGGCTCCAA ATACCCCCTG GAAGAAAACA TCGCCCTCAC CAGGAAGGTA
GTAGAGATTG CCCACGCCAT GGGTGCCTCG GTGGAGGGGG AGCTGGGACG CATCGGCGGT
ACCGAGGAAC AAATCAAGGT CTCGGAGAGG GAAGCCACCA TGACCGACCC GGAGGAGGCC
CAGCGGTTCG CTCGGGAGAC CGGGGTTGAT GCCCTGGCTG TTGCCATCGG CACGGCCCAC
GGCCGCTACC ACGGCACTCC CAGACTGGAT TTTGAGCGCC TGGCGACCAT CGACCGCCTG
GTACCGACGC CTATAGTCCT CCACGGTTCT TCCGGGGTGC CTGATGACGA TATCCGCCGC
GCCGTCGAAC TGGGCGTCCG TAAAATCAAC ATCGACACCG ATATCCGCAT CGCCTTTATC
GAGGCCACCC GCACCGCCCT GGACGCCAAT CCCGACGAGA TCGACCCGCG GAAGGTCCTG
GGACCGGCGA GGGATGCCGC CAGCAAGGTC ATCAGCCATA AGATGCAGGT CTTCGGCTGC
GCCGGCAGGG TAAAAAAATA G
 
Protein sequence
MPLVTLAEVL READTGGYAV GAFNCNNMEI VQAIINAAVT AQAPVIIQAS QGAIKYAGLE 
YITSLVRTAA SQAPVPVVLH LDHGTDFEQV LRCLRAGFTS VMIDGSKYPL EENIALTRKV
VEIAHAMGAS VEGELGRIGG TEEQIKVSER EATMTDPEEA QRFARETGVD ALAVAIGTAH
GRYHGTPRLD FERLATIDRL VPTPIVLHGS SGVPDDDIRR AVELGVRKIN IDTDIRIAFI
EATRTALDAN PDEIDPRKVL GPARDAASKV ISHKMQVFGC AGRVKK