Gene Msil_0526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0526 
Symbol 
ID7091587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp584211 
End bp585425 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content67% 
IMG OID643463856 
Product2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase 
Protein accessionYP_002360860 
Protein GI217976713 
COG category[I] Lipid transport and metabolism 
COG ID[COG0245] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTC CCGCCCCTCC GCCCGCCAAA ATCGCCATCG TTGTCGTCGC GGCCGGCCGC 
GGGTCGCGCG CCGGCGACGG GGCGCCAAAG CAATATCGGC AAGTGGCCGG CCTAACCGTC
CTCGCCCATA GCCTCAACGC GCTGGGCCGC GCCGCGCCGG ACGCGATCAT CGCGCCCGTC
ATCCATGCCG ACGATTTCGA TCTTTATAGC GAGGCGATCG CCGGCCTCGA TCCGGCGGCC
CGAGGCCAGT TGACGGCGCC CGTCTGGGGC GGCGCGACCC GGCAAGACAG CGTGCGCGCC
GGGCTGGAGG CTCTTGCCGC CGACCCTACG AACCGCCCGA ATATCGTTCT GATTCATGAT
GCTGCACGAA TCTTCGCAAG CGAAACATTA ATCATGCGCG CCATTGAGGC GGCAAGAGAA
TATGGCGCCG CGATCCCCGG AATCGCGGTC ACCGATACGA TCAAGGAGAT CGATGTCGAG
GCCTGCATCG CCGCCACCCC ACCGCGCGCG CGTCTGCGGG CGGTGCAGAC GCCCCAGGCC
TTCGACTTCA GCCTGATCCT CGACGCGCAC CGCAAAGCCG CCGCGGCCGG AGCCGCCGAT
TTGACCGATG ACGCCGCCAT CGCCGAATGG GCCGGGCATC GCGTTTTTGT GTTCAAGGGG
GACGCCGACA ATATGAAGAT TACGAGCGCT GAGGATCTCG CCGCCGCCGA AGGCAGGCTC
ATTCGCGATC TCGCCGACAT CAGAACGGGG CAAGGCTATG ACGTGCACGC TTTTGGCGCC
GGCGACCATA TCTGGCTCGG CGGCGTAGAG ATGTCGCATG ATCATGGCCT TGTCGGCCAT
TCTGACGCCG ACGTGTTAAG CCACGCCGTC ACCGATGCCC TGCTGGGGGC TCTCGCCGAC
GGCGACATCG GCAGCCATTT TCCGCCCTCC GATCCGCAAT GGCGCGGCGC GGCTTCAAAG
ATTTTCCTCG CGGCGGCGGC GGCGCGGGTG CGCGCGCGCG GCGGCATGAT CGCCCATATC
GACGCGACCG TGGTCTGCGA ACGGCCGAAG ATCGGGCCGC ATCGCGACGC CATTCGCGCC
AGCCTCGCGG CGATCGTCGG CGTTTCCCTC GACCGCGTCG CCGTGAAGGC GACGACCAGC
GAGCGCCTCG GCTTTACGGG GCGGGAAGAG GGGATCGCCG CCTTCGCAAT CGCCACCGTA
CGTTTGCCGC TCTGA
 
Protein sequence
MKFPAPPPAK IAIVVVAAGR GSRAGDGAPK QYRQVAGLTV LAHSLNALGR AAPDAIIAPV 
IHADDFDLYS EAIAGLDPAA RGQLTAPVWG GATRQDSVRA GLEALAADPT NRPNIVLIHD
AARIFASETL IMRAIEAARE YGAAIPGIAV TDTIKEIDVE ACIAATPPRA RLRAVQTPQA
FDFSLILDAH RKAAAAGAAD LTDDAAIAEW AGHRVFVFKG DADNMKITSA EDLAAAEGRL
IRDLADIRTG QGYDVHAFGA GDHIWLGGVE MSHDHGLVGH SDADVLSHAV TDALLGALAD
GDIGSHFPPS DPQWRGAASK IFLAAAAARV RARGGMIAHI DATVVCERPK IGPHRDAIRA
SLAAIVGVSL DRVAVKATTS ERLGFTGREE GIAAFAIATV RLPL