Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1884 |
Symbol | |
ID | 4571226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2180274 |
End bp | 2181158 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639766466 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_912324 |
Protein GI | 119357680 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCATC TTTCAGTAAA ATCATTTGCC AAAATCAATC TCGGCCTTCT CATAACCGGT AAACGCAAGG ACGGCTATCA TACGCTTGAG ACGATTTTTG CGCCAATCAA CTGGTATGAT ACCATCGGAT TTTCCGATTC CGACGTGATT TCAATGAGCT GCTCAAACAT CGATCTTCCT GTTGACGACA ATAATCTCTG CATCAGGGCG GCAAGAGCCT TGCAGCAGTC AGCCTCTTGT TCGAAGGGCG CAGCCATGAA TCTTCAGAAA GTGGTGCCTT TCGGTGCAGG GCTTGGCGGC GGGAGCAGCG ATGCCGCAAC GGTTCTCAGG GTACTCAATG AGCTGTGGAA GATTAATGTC TCTTCCGCCG AGCTGCATGA ACTTGCCGTA AAACTCGGTG CCGATGTACC TTATTTTCTT GAAATGAAGG GGCTGGCGTT TGCCAGAGGT ATTGGTGACG AACTTGAGGA TCTCGGTCTC ACCCTGCCGT TTCATGTGGT GACCGTTTTT CCCGAAGAGC ATATCTCAAC GGTATGGGCC TATAAAAATT TCTATCAAAA ATTCGACCGA CCGGTTCCGG ATCTCAGACT GCTTTTGCAG CGGCTTTGTC TGGATGGCGA CCGTTCCGTT CTCGGGGCTT TTGAAAATGA CTTTGAACCG GCAGTGTTCG ATCACTACCC GAAGGTTCGG GTTGTAAAAG AGAGTTTGCT CGATGCCGGC AGTTTTTATG CCTCTCTTTC CGGGAGCGGT TCAGCGGTGT TCGGTCTGTT CGATACGCTG GAAAATGCAG CTGGCGCGGT GTGCGCCATG CAGCAAAAGG GGTATCGGGT TACGCTTACC CCTCCCGGAT TTTCCATGGA AGCACAGGCC GGGTCAAGAT TATGA
|
Protein sequence | MDHLSVKSFA KINLGLLITG KRKDGYHTLE TIFAPINWYD TIGFSDSDVI SMSCSNIDLP VDDNNLCIRA ARALQQSASC SKGAAMNLQK VVPFGAGLGG GSSDAATVLR VLNELWKINV SSAELHELAV KLGADVPYFL EMKGLAFARG IGDELEDLGL TLPFHVVTVF PEEHISTVWA YKNFYQKFDR PVPDLRLLLQ RLCLDGDRSV LGAFENDFEP AVFDHYPKVR VVKESLLDAG SFYASLSGSG SAVFGLFDTL ENAAGAVCAM QQKGYRVTLT PPGFSMEAQA GSRL
|
| |