Gene TM1040_1364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1364 
Symbol 
ID4076381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1458902 
End bp1460050 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content62% 
IMG OID638006674 
Product2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase 
Protein accessionYP_613359 
Protein GI99081205 
COG category[I] Lipid transport and metabolism 
COG ID[COG0245] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCG CCGTATTGAT CGTTGCCGCC GGCAAAGGCA CCCGCGCAGG CGGTGGTCTC 
GCCAAGCAAT GGCGCCCACT GGCCGGGCGA CTGGTCATCG ACTGGACGAT CGAGGCCTTT
CAACGTGCGG GGTGCGGCAC CATCATGGTT GTGCGCGATC CCGACAATGA GCACGCGATC
GAGGCGCTTG CGCCCTACCC TGAATTATTG CTCGCAGATG GCGGTCCCTC GCGGTCTGAA
TCCGTGCGTA ACGGTTTGAT TGCGCTCCAA GAGATCGGTG TCGAACGCGT TCTTATTCAT
GACGCGGCGC GTCCATGTGT GTGTCCTCAG GTGATCCAAC AGGTGCTCGA CGCACTTGAT
GACACGCCTG CTGCCGCGCC AGGACTTGCG GTGACAGATG CGCTTTGGAC CGGGGCCGAT
GGCCATGTCA CAGGCACGCA GGACCGAAGC GCGCTCTTTG CGGCGCAAAC GCCGCAAGGC
TTTCATTTTG ACGCGATCCT TGCGGCGCAT ATGCGCCACG ACGGCACCGC AGCGGATGAT
GTCGAGGTTG CCCGTCAAGC GGGGCTCGCG GTCCGTATCA CGCCGGGTGA CGTCAATAAT
ATCAAGATCA CCCGGCCCGA AGATTTCTCC CGCGCCGAGC ACATATTGAG GAGCACCATG
GACAACATTC CTGACATCAG GCTTGGAAAT GGCTATGACG TTCACCGGTT CGGACCCGGG
GATCATGTCA TGCTCTGTGG GGTTCAAGTG CCGCATGAGC GCGGTCTGCA AGGCCATTCC
GATGCGGATG TGGGCATGCA CGCGGTCACC GACGCACTCT ACGGGGCGAT GGCAGAGGGC
GACATCGGCC GCCACTTCCC GCCAAGCGAC CCTCAGTGGA AAGGCGCGGC GTCGGACATC
TTCCTGCGCC ATGCGGTCGA ATTGGCACGC TCCAAAGGGT TCACCATCAA TAACGTGGAT
TGCACCCTCG TCTGTGAATA CCCCAAAGTC GGCCCCCACG CAGAGGCGAT GCGCGCCCGG
ATGGCAGAGA TCATGGGCAT GGATATGGGA CGCCTCTCGA TCAAGGCGAC AACTTCAGAG
CGGCTTGGGT TCACCGGTCG CAAAGAAGGC ATCGCGGCAC TGGCGACAGC AACATTGGTG
CGGGCATGA
 
Protein sequence
MTLAVLIVAA GKGTRAGGGL AKQWRPLAGR LVIDWTIEAF QRAGCGTIMV VRDPDNEHAI 
EALAPYPELL LADGGPSRSE SVRNGLIALQ EIGVERVLIH DAARPCVCPQ VIQQVLDALD
DTPAAAPGLA VTDALWTGAD GHVTGTQDRS ALFAAQTPQG FHFDAILAAH MRHDGTAADD
VEVARQAGLA VRITPGDVNN IKITRPEDFS RAEHILRSTM DNIPDIRLGN GYDVHRFGPG
DHVMLCGVQV PHERGLQGHS DADVGMHAVT DALYGAMAEG DIGRHFPPSD PQWKGAASDI
FLRHAVELAR SKGFTINNVD CTLVCEYPKV GPHAEAMRAR MAEIMGMDMG RLSIKATTSE
RLGFTGRKEG IAALATATLV RA