Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0404 |
Symbol | |
ID | 3832344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 408558 |
End bp | 409703 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637828341 |
Product | glycerate kinase |
Protein accession | YP_429281 |
Protein GI | 83589272 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1929] Glycerate kinase |
TIGRFAM ID | [TIGR00045] glycerate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0467091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00777473 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCGTCGTA TTGTTATTGC CCCCGACTCT TTCAAAGAGA GTTTATCGGC CCCGGAAGTC GCCGCGGCCA TAGCTCAAGG TATCCATCGG GTCCTGCCGG AGGTGGAAAC CGTCAACGTG CCCATGGCCG ACGGTGGTGA AGGCCTGACA GCCACCCTGG TAGCTGCTAC CGGCGGCCGG GAGATGACCG CCACCGTCAC CGGCCCCCTG GGGGAACCGG TCCAGGCCTC CTGGGGTATC CTGGGGGACG GTATCACGGC CGTAGTGGAG ATGGCCCAGG CTTCCGGCCT GCCCCTGGTA CCCCGGGAAA AACGCAATCC CCTGGTTACC ACCACCTATG GTACAGGTGA ACTCATCCAC CAGGCCCTGG AGGCAGGTTG CCGGCGGCTG ATTGTGGGCA TCGGCGGAAG CGCTACCAAT GACGGCGGTG CCGGCATGGC CCGGGCCCTG GGGGTAAAGC TGCTGGATGC AGAGGGCGCT GACATCCCCC CCGGGGCTGG AGGACTGGAA CGCCTGGAGC GCATCGATAT CCAGGGCCTG GACCCGAGGG TGAAGGAGGT AGAAATCCTG GTAGCTTGCG ATGTTGACAA CCCCCTGTGC GGGCCCCGGG GCGCCTCGGC CGTCTACGGC CCCCAGAAAG GAGCCACGCC GGAGATGATT CCCCGCCTGG ATGCCGCCCT GGCCCGCCTG GCGGATATCG TTGCCAGGGA CCTGAAGGTG GATGTTAGAG AACTGCCCGG CGCCGGTGCC GCCGGAGGCC TGGGTGCCGG CCTGGTAGCC TTCCTGGGGG CCACCCTGCG CCGTGGTATT GAACTGGTCA TAGAAGCTGT GAATCTTGAC GGTATCCTGG CAGCCGGCGC CGACCTGGTC ATCACCGGCG AAGGGGAAAT CAACCGCCAG ACGGCTTACG GGAAGGTTCC GGCCGGAGTG GCCGGTGTGG CCGCCAAATA TGGTATCCCG GTAGTCGCCC TGGTGGGCTC CATCGGCGAA GGAGCCAGTG CCGTCTATGA TCATGGCATC CAGGGTTTCA TGAGTATTGT CCCCCGGCCG GTACCTTTGA GTTACTGCCT GGAGAATGCC GCCTCCCTGC TGGCCGATGC CGCCGAGCGC TTAATGCGCT TGCTAAGTAT AAACATTAAA AAATGA
|
Protein sequence | MRRIVIAPDS FKESLSAPEV AAAIAQGIHR VLPEVETVNV PMADGGEGLT ATLVAATGGR EMTATVTGPL GEPVQASWGI LGDGITAVVE MAQASGLPLV PREKRNPLVT TTYGTGELIH QALEAGCRRL IVGIGGSATN DGGAGMARAL GVKLLDAEGA DIPPGAGGLE RLERIDIQGL DPRVKEVEIL VACDVDNPLC GPRGASAVYG PQKGATPEMI PRLDAALARL ADIVARDLKV DVRELPGAGA AGGLGAGLVA FLGATLRRGI ELVIEAVNLD GILAAGADLV ITGEGEINRQ TAYGKVPAGV AGVAAKYGIP VVALVGSIGE GASAVYDHGI QGFMSIVPRP VPLSYCLENA ASLLADAAER LMRLLSINIK K
|
| |