Gene Moth_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0404 
Symbol 
ID3832344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp408558 
End bp409703 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content64% 
IMG OID637828341 
Productglycerate kinase 
Protein accessionYP_429281 
Protein GI83589272 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1929] Glycerate kinase 
TIGRFAM ID[TIGR00045] glycerate kinase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0467091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00777473 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGTCGTA TTGTTATTGC CCCCGACTCT TTCAAAGAGA GTTTATCGGC CCCGGAAGTC 
GCCGCGGCCA TAGCTCAAGG TATCCATCGG GTCCTGCCGG AGGTGGAAAC CGTCAACGTG
CCCATGGCCG ACGGTGGTGA AGGCCTGACA GCCACCCTGG TAGCTGCTAC CGGCGGCCGG
GAGATGACCG CCACCGTCAC CGGCCCCCTG GGGGAACCGG TCCAGGCCTC CTGGGGTATC
CTGGGGGACG GTATCACGGC CGTAGTGGAG ATGGCCCAGG CTTCCGGCCT GCCCCTGGTA
CCCCGGGAAA AACGCAATCC CCTGGTTACC ACCACCTATG GTACAGGTGA ACTCATCCAC
CAGGCCCTGG AGGCAGGTTG CCGGCGGCTG ATTGTGGGCA TCGGCGGAAG CGCTACCAAT
GACGGCGGTG CCGGCATGGC CCGGGCCCTG GGGGTAAAGC TGCTGGATGC AGAGGGCGCT
GACATCCCCC CCGGGGCTGG AGGACTGGAA CGCCTGGAGC GCATCGATAT CCAGGGCCTG
GACCCGAGGG TGAAGGAGGT AGAAATCCTG GTAGCTTGCG ATGTTGACAA CCCCCTGTGC
GGGCCCCGGG GCGCCTCGGC CGTCTACGGC CCCCAGAAAG GAGCCACGCC GGAGATGATT
CCCCGCCTGG ATGCCGCCCT GGCCCGCCTG GCGGATATCG TTGCCAGGGA CCTGAAGGTG
GATGTTAGAG AACTGCCCGG CGCCGGTGCC GCCGGAGGCC TGGGTGCCGG CCTGGTAGCC
TTCCTGGGGG CCACCCTGCG CCGTGGTATT GAACTGGTCA TAGAAGCTGT GAATCTTGAC
GGTATCCTGG CAGCCGGCGC CGACCTGGTC ATCACCGGCG AAGGGGAAAT CAACCGCCAG
ACGGCTTACG GGAAGGTTCC GGCCGGAGTG GCCGGTGTGG CCGCCAAATA TGGTATCCCG
GTAGTCGCCC TGGTGGGCTC CATCGGCGAA GGAGCCAGTG CCGTCTATGA TCATGGCATC
CAGGGTTTCA TGAGTATTGT CCCCCGGCCG GTACCTTTGA GTTACTGCCT GGAGAATGCC
GCCTCCCTGC TGGCCGATGC CGCCGAGCGC TTAATGCGCT TGCTAAGTAT AAACATTAAA
AAATGA
 
Protein sequence
MRRIVIAPDS FKESLSAPEV AAAIAQGIHR VLPEVETVNV PMADGGEGLT ATLVAATGGR 
EMTATVTGPL GEPVQASWGI LGDGITAVVE MAQASGLPLV PREKRNPLVT TTYGTGELIH
QALEAGCRRL IVGIGGSATN DGGAGMARAL GVKLLDAEGA DIPPGAGGLE RLERIDIQGL
DPRVKEVEIL VACDVDNPLC GPRGASAVYG PQKGATPEMI PRLDAALARL ADIVARDLKV
DVRELPGAGA AGGLGAGLVA FLGATLRRGI ELVIEAVNLD GILAAGADLV ITGEGEINRQ
TAYGKVPAGV AGVAAKYGIP VVALVGSIGE GASAVYDHGI QGFMSIVPRP VPLSYCLENA
ASLLADAAER LMRLLSINIK K