Gene Smed_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1087 
SymbolispDF 
ID5321933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1154196 
End bp1155410 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content65% 
IMG OID640790028 
Productbifunctional 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase/2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase protein 
Protein accessionYP_001326773 
Protein GI150396306 
COG category[I] Lipid transport and metabolism 
COG ID[COG0245] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCCG AAGAACAGTT CTCCTGCGGT GTCATTGTCG TCGCGGCCGG CCGCGGCGAG 
CGCGCCGGTC AGTCGAGCGA GGGGCCGAAG CAATATCGCA CGGTCGGCGA TCGCCCGGTT
ATCACTCATA CGCTTGACGT TTTTGCGACA TGGGACGGAA CGGGACCTGT CGTCGTCGTG
ATTCACCCCG AGGATGAAGA GCTCTTCGCG TCTGCCCGCA AGCGGATGGG CCATATGCTG
GACCTTACGG TGGTACATGG CGGGGCGACT CGCCAGCTGT CGGTTCTGGC CGGACTGCAA
GCGATCGCCG GCGCCGGGGT AAAACATGTC ATGATCCACG ACGCAGTGCG CCCGTTCTTC
GACCATGCCC TTCTCGATCG CTGTCGGGCG GCGCTGCGGA ACGGCGCGGG GGCGGTCCTG
CCGGCCGTTG CGGTCGCCGA CACCCTGAAA CGCGCGCAAG CCGGCGGCCT CGTCGCCGAA
ACCGTACCGA GAACCGATCT CCATGCCGCC CAGACGCCGC AGTGCTTCCG CCTCGAGGCG
ATCCTCTCCG CTCACCGGCA AGCGGCGGCA AGCGGCCAAG CGGATTTTAC CGACGACGCC
TCAATCGCCG AATGGGCCGG TATTCCGGTC CATCTGGTGG AAGGCTCGCC GGACAACTTC
AAGTTGACGC TCCGGAGGGA CCTGTCGATG GCAGATGAGA AACTGACGCG CATGGCAATC
CCTGATGTGC GTACCGGAAA CGGATATGAC GTGCATCAAC TCGTCGAGGG TGACGGCGTC
ACGCTCTGCG GCGTGTTCAT TCCCCATGAT CGCAAGCTCT CGGGTCATTC CGACGCGGAT
GTAGCGCTCC ATGCCTTGAC GGACGCCCTG CTTGCCACCT GCGGCGCCGG CGACATCGGC
GACCACTTCC CGCCGTCCGA TCCGCGGTGG AAGGGCGCGC CTTCGCACAT TTTCCTCGAA
CATGCCGCAC GGATCGTCCG CGAGCGTGGC GGCACTATCA CCCATGCGGA TATATCGCTG
ATCGCCGAGG CGCCCAAGGT CGGTCCGCAC CGTCAGCAGA TGCGGGAAAG CCTGTCGGCC
ATGCTCGCGA TTGCGATCGA CCGTTGCTCC GTCAAGGCGA CCACCAACGA GAAACTCGGC
TTCGTCGGCC GCAACGAAGG CATTGCGGCA ATCGCCACGG CGACGGTCGT GTATGCATCA
GGGGGTGATG CGTGA
 
Protein sequence
MQAEEQFSCG VIVVAAGRGE RAGQSSEGPK QYRTVGDRPV ITHTLDVFAT WDGTGPVVVV 
IHPEDEELFA SARKRMGHML DLTVVHGGAT RQLSVLAGLQ AIAGAGVKHV MIHDAVRPFF
DHALLDRCRA ALRNGAGAVL PAVAVADTLK RAQAGGLVAE TVPRTDLHAA QTPQCFRLEA
ILSAHRQAAA SGQADFTDDA SIAEWAGIPV HLVEGSPDNF KLTLRRDLSM ADEKLTRMAI
PDVRTGNGYD VHQLVEGDGV TLCGVFIPHD RKLSGHSDAD VALHALTDAL LATCGAGDIG
DHFPPSDPRW KGAPSHIFLE HAARIVRERG GTITHADISL IAEAPKVGPH RQQMRESLSA
MLAIAIDRCS VKATTNEKLG FVGRNEGIAA IATATVVYAS GGDA