Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_1684 |
Symbol | |
ID | 5058143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 1925125 |
End bp | 1926678 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640473956 |
Product | 4-diphosphocytidyl-2C-methyl-D-erythritol synthase |
Protein accession | YP_001158526 |
Protein GI | 145594229 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase |
TIGRFAM ID | [TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.228213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCACG AGCAGACCGC CGTAGCGTCG GCCGGCGCCC AGCCCTGGCG GCCCACCCGG ACCGTCGCGG TGATCCTGGC CGGTGGTACC GGCACCCGAC TCGGCCTCGG TATCCCCAAA CAGCTACTCA AGGTCGCCGG CAAACCGATC ATCGAGCACA CCCTCGCTGT CTTCGAGGCC GCGCCCGAGA TTGACGAGAT CATCGTGCTG ATGGCGGCCG GGCACGTCCC GGACGTCGAA CAGATCGTGC AGCAGGCCGG GTTCGGCAAG GTCTCGGCCG TCGTCGAGGG CGGGCAGACC CGCAACGCCA CCACCCGCCT CGCGCTGGAG ATGATCGGTC CGGACGACTG CAACATCCTG TTCCACGACG CGGTACGTCC GCTGACTAGC GGGCGCATCA TCCGGGAGTG CGTGAACGCG CTCTGGACGT ACTCGGCGGT TGATGTGGCC ATCCCGTCAG CGGACACGAT CATCCAGGTG GACGAGGACG ACTGCATCAC CGACATCCCG GTCCGCTCTC GGCTACGCCG GGGTCAGACC CCCCAGGCCT TCCGCTCCGG TACCATCCGC GCGGCGTACC AGGAAGCGGC GGGTGACCCG GACTTCGCCG CGACCGACGA CTGCGGCGTG GTGCTGCGCT ACCTGCCCGG CACGCCGATC AAGGTGATCG ACGGCACCGA CGAGAACATC AAGGTCACCC ACCCGCTCGA TGTGCACCTG GCGGACAAGC TGTTTCAACT CGCCGCCGCC AAGGCGCCCC GGTTAGCCGA CCACCGGGAC TACATCGAGG AGCTGACCGG CCGCACGATC GTCGTCTTCG GCGGCAGCCA CGGCATCGGC CAGGAGCTGA CCGCACTGGC CCGGCGCTAC GGCGCCCGGG TCTTCCCGGT CAGCCGCTCG AGCACCGGCA CCCACGTGGA GCGGGCCGAG GACGTCGAGG CCGCACTCCA GACCGCCTTC GCCGCGACCG GCCGGATCGA CCATGTGGTC GTCACCGCCG GGCTGTTGGA GAAGGGCATG CTCGCCGACA TGGACGCCGG CACCGTGGAC CAACTGCTCC AGGTGAACTT CGTCGGCCCG GTGACGGTAG CCCGCGCGGC ACTACCGTAC CTACGGCGGA CCCAGGGCCA GCTGCTGCTC TACACCTCCA GCTCGTACAC CCGGGGCCGA GCCCGGTACG CGCTCTACTC GGCCACCAAG GCAGCCCTGG TGAACCTCAC CCAGGCCCTC GCCGACGAGT GGGCCGAGGC CGACGTACGG GTCAACTGCG TCAACCCGGA GCGCACCGCC ACGCCGATGC GTACCCGGGC GTTCGGCCCA GAACCGGCGC ACACCCTACT CACCGCGGCG GCGGTGGCGC GGGCCTCGCT GGACGTACTG CTCTCCGGGC TCACCGGGCA GGTGATCGAC GTACGCCGTG CGGACGACGA GGAGGCGGAG TCGATGGCTG TCCTGCCGAC GCAGGCCGGA GGGGAGCGCG CCGAGCGCCC CGCCGATCAG TCGGGGGCGG GTCGTGGTCC GGACCAGGTG GAGGCGAACC ACGGTGCGCG GTGA
|
Protein sequence | MTHEQTAVAS AGAQPWRPTR TVAVILAGGT GTRLGLGIPK QLLKVAGKPI IEHTLAVFEA APEIDEIIVL MAAGHVPDVE QIVQQAGFGK VSAVVEGGQT RNATTRLALE MIGPDDCNIL FHDAVRPLTS GRIIRECVNA LWTYSAVDVA IPSADTIIQV DEDDCITDIP VRSRLRRGQT PQAFRSGTIR AAYQEAAGDP DFAATDDCGV VLRYLPGTPI KVIDGTDENI KVTHPLDVHL ADKLFQLAAA KAPRLADHRD YIEELTGRTI VVFGGSHGIG QELTALARRY GARVFPVSRS STGTHVERAE DVEAALQTAF AATGRIDHVV VTAGLLEKGM LADMDAGTVD QLLQVNFVGP VTVARAALPY LRRTQGQLLL YTSSSYTRGR ARYALYSATK AALVNLTQAL ADEWAEADVR VNCVNPERTA TPMRTRAFGP EPAHTLLTAA AVARASLDVL LSGLTGQVID VRRADDEEAE SMAVLPTQAG GERAERPADQ SGAGRGPDQV EANHGAR
|
| |