Gene Sare_1679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1679 
Symbol 
ID5704576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1936463 
End bp1938016 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content71% 
IMG OID641271183 
Product4-diphosphocytidyl-2C-methyl-D-erythritol synthase 
Protein accessionYP_001536558 
Protein GI159037305 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.325361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.131768 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCACG AGCAGAGCGC CGTAGCGTCG GCTGGTCCCC CGCCCTGGCG GCCCAGCCGG 
ACCGTGGCCG TGATCTTGGC CGGCGGTACC GGCACCCGGC TCGGCCTCGG CATCCCCAAA
CAGCTGCTCA AGGTCGCCGG TAAGCCGATC ATCGAGCACA CCCTCGCCGT CTTCGAGGCC
GCGCCCGAGA TCGACGAGAT CATCGTGCTG ATGGCGGCCG GGCACGTCCC CGACGTCGAG
CGGATCGTGC GACAGGCCGG GTTCGGCAAG GTCTCGGCCG TCGTCGAGGG CGGGCAGACC
CGCAACGCCA CCACCCGCCT CGCGCTGGAC GTGATCGGTC CCGACGACTG CAACATCCTC
TTCCACGACG CGGTGCGTCC GCTGATCAGC GGACGGGTCA TTCGGGAGTG CGTGAACGCG
CTCTGGACGT ACTCGGCGGT CGATGTGGCC ATCCCGTCGG CGGACACGAT CATCCAGGTG
GACGAGGACG ACCGCATCAC CGACATCCCG GTCCGCGCCC GGCTACGCCG GGGCCAGACC
CCTCAGGCGT TCCGCTCCGG CACCATCCGC GAGGCATACC GGAAAGCGGC GGGTGACCCG
GACTTCGCGG CGACCGACGA CTGCGGCGTG GTGCTGCGCT ACCTGCCCGG CACGCCGATC
AAGGTGATCG ACGGCACCGA CGAGAACATC AAGGTCACCC ATCCGCTCGA CGTGCACCTG
GCGGACAAGC TGTTCCAACT CGCCGCGGCC AAGGCACACC GACTGGCCGA CCACCGGGGC
TACGCCGAGG AGCTGACCGG CCGCACGATC GTCGTCTTCG GCGGCAGCCA CGGCATCGGC
CAGGAGCTGA CCGAGTTGGC CCGGCACTAC GGCGCGCGGG TCTTCCCGGT CAGCCGCTCG
AGCACCGGCA CCCACGTGGA GCGGGCCGAG GACGTCGAGG CCGCGCTCCA GTCCGCCTTC
GCCGCCACCG GCCGGATCGA CCATGTGGTC GTCACCGCGG GGCTGTTGGA GAAGGGCATG
CTCGCCGACA TGGACGCCGG CACCGTGGAC CAGCTCCTCC AGGTGAACTT CGTCGGCCCG
GTGACGGTAG CCCGCGCGGC CCTGTCGTAC CTGCGGCGGA CCCAGGGCCA ACTGCTGCTG
TACACCTCCA GCTCGTACAC CCGGGGCCGG GCCCGGTACG CGCTGTACTC GGCCACCAAG
GCGGCCCTGG TGAACCTCAC CCAGGCCCTT GCCGACGAGT GGGCCGAGGC CGGCGTGCGG
GTCAACTGCG TCAACCCGGA GCGCACCGCC ACACCGATGC GTACCCGGGC GTTCGGTCCG
GAACCGGCGC ACACCCTGCT CACCGCGGAG GCGGTGGCCC GGGCATCGCT GGACGTGCTG
CTCTCCGGGC TCACCGGCCA GGTCATCGAT GTGCGTCGCG ACGACGAGGA GGAGGCAGAG
TCGTCGGCTG TCCTGCCGGC GCAGGCCGGG CGGGAGCGCA ACGAGCGCCC GACCGACCAA
GTGGACGCGG GCCACGGCAC GGATCGGGTG GAGGCGGGCC ACGGTGCGCG GTGA
 
Protein sequence
MTHEQSAVAS AGPPPWRPSR TVAVILAGGT GTRLGLGIPK QLLKVAGKPI IEHTLAVFEA 
APEIDEIIVL MAAGHVPDVE RIVRQAGFGK VSAVVEGGQT RNATTRLALD VIGPDDCNIL
FHDAVRPLIS GRVIRECVNA LWTYSAVDVA IPSADTIIQV DEDDRITDIP VRARLRRGQT
PQAFRSGTIR EAYRKAAGDP DFAATDDCGV VLRYLPGTPI KVIDGTDENI KVTHPLDVHL
ADKLFQLAAA KAHRLADHRG YAEELTGRTI VVFGGSHGIG QELTELARHY GARVFPVSRS
STGTHVERAE DVEAALQSAF AATGRIDHVV VTAGLLEKGM LADMDAGTVD QLLQVNFVGP
VTVARAALSY LRRTQGQLLL YTSSSYTRGR ARYALYSATK AALVNLTQAL ADEWAEAGVR
VNCVNPERTA TPMRTRAFGP EPAHTLLTAE AVARASLDVL LSGLTGQVID VRRDDEEEAE
SSAVLPAQAG RERNERPTDQ VDAGHGTDRV EAGHGAR