Gene Strop_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1684 
Symbol 
ID5058143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1925125 
End bp1926678 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content70% 
IMG OID640473956 
Product4-diphosphocytidyl-2C-methyl-D-erythritol synthase 
Protein accessionYP_001158526 
Protein GI145594229 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.228213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCACG AGCAGACCGC CGTAGCGTCG GCCGGCGCCC AGCCCTGGCG GCCCACCCGG 
ACCGTCGCGG TGATCCTGGC CGGTGGTACC GGCACCCGAC TCGGCCTCGG TATCCCCAAA
CAGCTACTCA AGGTCGCCGG CAAACCGATC ATCGAGCACA CCCTCGCTGT CTTCGAGGCC
GCGCCCGAGA TTGACGAGAT CATCGTGCTG ATGGCGGCCG GGCACGTCCC GGACGTCGAA
CAGATCGTGC AGCAGGCCGG GTTCGGCAAG GTCTCGGCCG TCGTCGAGGG CGGGCAGACC
CGCAACGCCA CCACCCGCCT CGCGCTGGAG ATGATCGGTC CGGACGACTG CAACATCCTG
TTCCACGACG CGGTACGTCC GCTGACTAGC GGGCGCATCA TCCGGGAGTG CGTGAACGCG
CTCTGGACGT ACTCGGCGGT TGATGTGGCC ATCCCGTCAG CGGACACGAT CATCCAGGTG
GACGAGGACG ACTGCATCAC CGACATCCCG GTCCGCTCTC GGCTACGCCG GGGTCAGACC
CCCCAGGCCT TCCGCTCCGG TACCATCCGC GCGGCGTACC AGGAAGCGGC GGGTGACCCG
GACTTCGCCG CGACCGACGA CTGCGGCGTG GTGCTGCGCT ACCTGCCCGG CACGCCGATC
AAGGTGATCG ACGGCACCGA CGAGAACATC AAGGTCACCC ACCCGCTCGA TGTGCACCTG
GCGGACAAGC TGTTTCAACT CGCCGCCGCC AAGGCGCCCC GGTTAGCCGA CCACCGGGAC
TACATCGAGG AGCTGACCGG CCGCACGATC GTCGTCTTCG GCGGCAGCCA CGGCATCGGC
CAGGAGCTGA CCGCACTGGC CCGGCGCTAC GGCGCCCGGG TCTTCCCGGT CAGCCGCTCG
AGCACCGGCA CCCACGTGGA GCGGGCCGAG GACGTCGAGG CCGCACTCCA GACCGCCTTC
GCCGCGACCG GCCGGATCGA CCATGTGGTC GTCACCGCCG GGCTGTTGGA GAAGGGCATG
CTCGCCGACA TGGACGCCGG CACCGTGGAC CAACTGCTCC AGGTGAACTT CGTCGGCCCG
GTGACGGTAG CCCGCGCGGC ACTACCGTAC CTACGGCGGA CCCAGGGCCA GCTGCTGCTC
TACACCTCCA GCTCGTACAC CCGGGGCCGA GCCCGGTACG CGCTCTACTC GGCCACCAAG
GCAGCCCTGG TGAACCTCAC CCAGGCCCTC GCCGACGAGT GGGCCGAGGC CGACGTACGG
GTCAACTGCG TCAACCCGGA GCGCACCGCC ACGCCGATGC GTACCCGGGC GTTCGGCCCA
GAACCGGCGC ACACCCTACT CACCGCGGCG GCGGTGGCGC GGGCCTCGCT GGACGTACTG
CTCTCCGGGC TCACCGGGCA GGTGATCGAC GTACGCCGTG CGGACGACGA GGAGGCGGAG
TCGATGGCTG TCCTGCCGAC GCAGGCCGGA GGGGAGCGCG CCGAGCGCCC CGCCGATCAG
TCGGGGGCGG GTCGTGGTCC GGACCAGGTG GAGGCGAACC ACGGTGCGCG GTGA
 
Protein sequence
MTHEQTAVAS AGAQPWRPTR TVAVILAGGT GTRLGLGIPK QLLKVAGKPI IEHTLAVFEA 
APEIDEIIVL MAAGHVPDVE QIVQQAGFGK VSAVVEGGQT RNATTRLALE MIGPDDCNIL
FHDAVRPLTS GRIIRECVNA LWTYSAVDVA IPSADTIIQV DEDDCITDIP VRSRLRRGQT
PQAFRSGTIR AAYQEAAGDP DFAATDDCGV VLRYLPGTPI KVIDGTDENI KVTHPLDVHL
ADKLFQLAAA KAPRLADHRD YIEELTGRTI VVFGGSHGIG QELTALARRY GARVFPVSRS
STGTHVERAE DVEAALQTAF AATGRIDHVV VTAGLLEKGM LADMDAGTVD QLLQVNFVGP
VTVARAALPY LRRTQGQLLL YTSSSYTRGR ARYALYSATK AALVNLTQAL ADEWAEADVR
VNCVNPERTA TPMRTRAFGP EPAHTLLTAA AVARASLDVL LSGLTGQVID VRRADDEEAE
SMAVLPTQAG GERAERPADQ SGAGRGPDQV EANHGAR