Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1679 |
Symbol | |
ID | 5704576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1936463 |
End bp | 1938016 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641271183 |
Product | 4-diphosphocytidyl-2C-methyl-D-erythritol synthase |
Protein accession | YP_001536558 |
Protein GI | 159037305 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase |
TIGRFAM ID | [TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.325361 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.131768 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCACG AGCAGAGCGC CGTAGCGTCG GCTGGTCCCC CGCCCTGGCG GCCCAGCCGG ACCGTGGCCG TGATCTTGGC CGGCGGTACC GGCACCCGGC TCGGCCTCGG CATCCCCAAA CAGCTGCTCA AGGTCGCCGG TAAGCCGATC ATCGAGCACA CCCTCGCCGT CTTCGAGGCC GCGCCCGAGA TCGACGAGAT CATCGTGCTG ATGGCGGCCG GGCACGTCCC CGACGTCGAG CGGATCGTGC GACAGGCCGG GTTCGGCAAG GTCTCGGCCG TCGTCGAGGG CGGGCAGACC CGCAACGCCA CCACCCGCCT CGCGCTGGAC GTGATCGGTC CCGACGACTG CAACATCCTC TTCCACGACG CGGTGCGTCC GCTGATCAGC GGACGGGTCA TTCGGGAGTG CGTGAACGCG CTCTGGACGT ACTCGGCGGT CGATGTGGCC ATCCCGTCGG CGGACACGAT CATCCAGGTG GACGAGGACG ACCGCATCAC CGACATCCCG GTCCGCGCCC GGCTACGCCG GGGCCAGACC CCTCAGGCGT TCCGCTCCGG CACCATCCGC GAGGCATACC GGAAAGCGGC GGGTGACCCG GACTTCGCGG CGACCGACGA CTGCGGCGTG GTGCTGCGCT ACCTGCCCGG CACGCCGATC AAGGTGATCG ACGGCACCGA CGAGAACATC AAGGTCACCC ATCCGCTCGA CGTGCACCTG GCGGACAAGC TGTTCCAACT CGCCGCGGCC AAGGCACACC GACTGGCCGA CCACCGGGGC TACGCCGAGG AGCTGACCGG CCGCACGATC GTCGTCTTCG GCGGCAGCCA CGGCATCGGC CAGGAGCTGA CCGAGTTGGC CCGGCACTAC GGCGCGCGGG TCTTCCCGGT CAGCCGCTCG AGCACCGGCA CCCACGTGGA GCGGGCCGAG GACGTCGAGG CCGCGCTCCA GTCCGCCTTC GCCGCCACCG GCCGGATCGA CCATGTGGTC GTCACCGCGG GGCTGTTGGA GAAGGGCATG CTCGCCGACA TGGACGCCGG CACCGTGGAC CAGCTCCTCC AGGTGAACTT CGTCGGCCCG GTGACGGTAG CCCGCGCGGC CCTGTCGTAC CTGCGGCGGA CCCAGGGCCA ACTGCTGCTG TACACCTCCA GCTCGTACAC CCGGGGCCGG GCCCGGTACG CGCTGTACTC GGCCACCAAG GCGGCCCTGG TGAACCTCAC CCAGGCCCTT GCCGACGAGT GGGCCGAGGC CGGCGTGCGG GTCAACTGCG TCAACCCGGA GCGCACCGCC ACACCGATGC GTACCCGGGC GTTCGGTCCG GAACCGGCGC ACACCCTGCT CACCGCGGAG GCGGTGGCCC GGGCATCGCT GGACGTGCTG CTCTCCGGGC TCACCGGCCA GGTCATCGAT GTGCGTCGCG ACGACGAGGA GGAGGCAGAG TCGTCGGCTG TCCTGCCGGC GCAGGCCGGG CGGGAGCGCA ACGAGCGCCC GACCGACCAA GTGGACGCGG GCCACGGCAC GGATCGGGTG GAGGCGGGCC ACGGTGCGCG GTGA
|
Protein sequence | MTHEQSAVAS AGPPPWRPSR TVAVILAGGT GTRLGLGIPK QLLKVAGKPI IEHTLAVFEA APEIDEIIVL MAAGHVPDVE RIVRQAGFGK VSAVVEGGQT RNATTRLALD VIGPDDCNIL FHDAVRPLIS GRVIRECVNA LWTYSAVDVA IPSADTIIQV DEDDRITDIP VRARLRRGQT PQAFRSGTIR EAYRKAAGDP DFAATDDCGV VLRYLPGTPI KVIDGTDENI KVTHPLDVHL ADKLFQLAAA KAHRLADHRG YAEELTGRTI VVFGGSHGIG QELTELARHY GARVFPVSRS STGTHVERAE DVEAALQSAF AATGRIDHVV VTAGLLEKGM LADMDAGTVD QLLQVNFVGP VTVARAALSY LRRTQGQLLL YTSSSYTRGR ARYALYSATK AALVNLTQAL ADEWAEAGVR VNCVNPERTA TPMRTRAFGP EPAHTLLTAE AVARASLDVL LSGLTGQVID VRRDDEEEAE SSAVLPAQAG RERNERPTDQ VDAGHGTDRV EAGHGAR
|
| |