Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1525 |
Symbol | |
ID | 4029227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1737249 |
End bp | 1738145 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637966714 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_573577 |
Protein GI | 92113649 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.324742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGCC TCAGCCTGCC GGCGCCCGCC AAGCTCAATC GCATGCTGCA TATCGTGGGC CGCCGTGCCG ACGGCTACCA CGAGCTGCAG ACACTGTTCC AGTTTCTCGA CCGGAGCGAC ACGCTGCACT TCAGCCCCCG CGCTGACGGC GCCATTCACC TGGCCCCGGC CATCGCGGAC GTCGATCACG ACGCCAACCT CATCGTGCGC GCCGCCCGTC TGCTGCAGCA CGCAAGCGGC ACGCATCAGG GCGTGGACAT TCACCTCGAC AAACGGCTCC CCATGGGAGG CGGCCTGGGC GGCGGCAGCT CCGACGCGGC CACCACCCTG CTCGCGCTCG ACCGCTTGTG GTCGCTCGAC CTGGGGCTGC CCCGTCTCGC CGAGCTGGGT CTCACGCTCG GCGCCGACGT GCCGGTCTTC GTGCGCGGGC ACAGCGCCTG GGCCGAAGGT ATCGGCGAAC GCCTTACGCC AGTGACGCTC GACACCCCCT GGTTCGTGGT CATACACCCG GGCGAGGAAA TCGCCACGCC CGCCGTGTTC GGGCACCCCG AATTGACACG CGACACCCCG CCGATTAGTA TGGCGCGCGC ACTGCGAGGG GGAGCCGAAC AGGGGCGCGC CTGGCGCAAT GACTGCGAAG CCGTGGTGCG ACGCCTCTCG CCGGACGTCG CACATGCGCT CGACTGGTTA TCGGCCTTCG GTCCGGCCAT GCTCACGGGA ACGGGCAGTT GCCTTTTCTG TCCCTTGACG AGCGAACGGC AAGCCGATAG GATTTTGCGC CGTGTTGGCA GCCACTGGCA TGCGTTCAAG GCACGCGGCT GCAACACTTC TCCCCTCCAT GACGCTCTGG GCATTCACGA CGAATGGTCA CCCATGAGCC AATACGGGGA CGCCTGA
|
Protein sequence | MQRLSLPAPA KLNRMLHIVG RRADGYHELQ TLFQFLDRSD TLHFSPRADG AIHLAPAIAD VDHDANLIVR AARLLQHASG THQGVDIHLD KRLPMGGGLG GGSSDAATTL LALDRLWSLD LGLPRLAELG LTLGADVPVF VRGHSAWAEG IGERLTPVTL DTPWFVVIHP GEEIATPAVF GHPELTRDTP PISMARALRG GAEQGRAWRN DCEAVVRRLS PDVAHALDWL SAFGPAMLTG TGSCLFCPLT SERQADRILR RVGSHWHAFK ARGCNTSPLH DALGIHDEWS PMSQYGDA
|
| |