Gene Mext_2817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2817 
Symbol 
ID5831920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3155069 
End bp3156403 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content72% 
IMG OID641368618 
Product2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase 
Protein accessionYP_001640278 
Protein GI163852235 
COG category[I] Lipid transport and metabolism 
COG ID[COG0245] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.288222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.315391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACC AAGCCGCGCG GCCGCCCGGG CAGGAGCCGG GGAATAAATC GGCGGCGGCG 
GTCGTGGTCG CGGCCGGCAA GGGGCTGCGT GTCGGCGGCG ACTTACCCAA GCAATACCGC
CGCGTCGGCG GCCGGGCCGT CCTGACGCGG ACGCTTGCGG CGCTGGCGCA ATCGCCCCGC
ATCACCCGCA TCCAGCCGGT GATCGCGCCG GATGCGCAGG ACTTCTATCG CGAATGCCTC
GCCGATCTCG CGCCTGCCCA TCGTGAAAAG CTCGCCGAGC CGGTGCCGGG CGGGGCGACG
CGCCAGCAAT CGGTGGCGGC CGGGCTCGAA GGGCTCGCCC GCTTAGGCGC GCCCGATCTC
GTGCTCGTCC ACGACGCGGC GCGGCCCTTC GTGGACGAGG CGCTGATCGC CCGCGCGATC
GCGGCCGGCT CCGAGCACGG CGCATCGGTG CCGGGCATCG CGGTCTCCGA CACGATCAAG
CTCGTGGAGG AGATCGCGCC GGGCATCGGC CGCGTCCACG AGACCCCGGC GCGTGAAAAT
CTCCGCGCGG TGCAGACGCC GCAGAGCTTC CGTTTCGGCC TGCTTCTCGA CGCGCATCGC
CGAGCCGTCG CCGAGGGCCG CGACGGCTTC ACCGATGACG GGGCGCTCGC TGAATGGGCC
GGGCTGCCGG TCGTGGTGTT CGAGGGCGAC GCCCGCAACC GCAAGATCAC TCAGGCTGCC
GACCTGATCG AGGCCGACCG GGCATTCTCC GGACGGGCTT TCTCTGAACC TGCGGCCGCG
ATATCGGATG ACACCATGAC CACTTACGTA ACCCGCCTCG GCACCGGCTT CGACGTCCAC
GCCTTCACGG AGGGCGACCA TGTCTGGCTC GGCGGCGTGA AGATCCCCGC CGACCGCGGC
GTGCTCGCCC ATTCCGACGG CGACGTGGCG CTGCACGCCC TCACCGACGC GCTGCTCGGC
GCCATCGCCG ACGGCGACAT CGGCACGCAC TTCCCGCCCT CGGACGAGAA GTGGCGCGGC
GCGGCCTCCG ATCAGTTCCT GGCCCATGCC TGCGAATTGG TGCGGGCGCG CGGCGGCAAG
ATCGACCATC TCGACATCAC GGTGCTGGCG GAAGCCCCGC GCATCGGCCA GCACCGCGAG
GCGATCCGCG CGCGTATCGC CGCGATCGCC GGCGTGCCGC TGTCCTCGGT GTCGATCAAG
GCGACCACGA CCGAAAAGCT TGGCTTCGTC GGTCGCGCCG AGGGCCTCGC TGCCCAGGCC
GCCGCGACGG TGCGGCTGCC GGAGGTCTGC GCGGAGCTGG AGACCGAGGC GGAGACCAAC
GAGCGCCGTT CGTGA
 
Protein sequence
MSDQAARPPG QEPGNKSAAA VVVAAGKGLR VGGDLPKQYR RVGGRAVLTR TLAALAQSPR 
ITRIQPVIAP DAQDFYRECL ADLAPAHREK LAEPVPGGAT RQQSVAAGLE GLARLGAPDL
VLVHDAARPF VDEALIARAI AAGSEHGASV PGIAVSDTIK LVEEIAPGIG RVHETPAREN
LRAVQTPQSF RFGLLLDAHR RAVAEGRDGF TDDGALAEWA GLPVVVFEGD ARNRKITQAA
DLIEADRAFS GRAFSEPAAA ISDDTMTTYV TRLGTGFDVH AFTEGDHVWL GGVKIPADRG
VLAHSDGDVA LHALTDALLG AIADGDIGTH FPPSDEKWRG AASDQFLAHA CELVRARGGK
IDHLDITVLA EAPRIGQHRE AIRARIAAIA GVPLSSVSIK ATTTEKLGFV GRAEGLAAQA
AATVRLPEVC AELETEAETN ERRS