Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2587 |
Symbol | |
ID | 4023083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 2898580 |
End bp | 2899776 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637962784 |
Product | 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase |
Protein accession | YP_569717 |
Protein GI | 91977058 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0245] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase [COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase |
TIGRFAM ID | [TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase [TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.439438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00531739 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGAATC CCCCCCGCAC TGCCGCCATC ATCGTCGCCG CCGGCCGCGG CCTCCGCGCA GGCGCGGGCG GCCCCAAGCA ATACCGAACG CTGGCCGGCC GACCGGTTAT CGCACGTGCT TTGCAGCCGT TTTGCACTCA CCCGGAGGTG TTTGCGGTCC AGCCGGTGAC TAACCCGGAC GACACCGCGA TCTTCAACGA CGCCGTGACA GGCCTGAACT TCCGTCCCGC GGTCGGCGGC GGCGCGACCC GACAGGGCTC GGTTCGCGCC GGGCTGGAGG CGCTGGCGGA GTTGAACCCT GACATCGTGC TGATCCACGA CGCAGCGCGT CCCTTTGTCA CGCCCGATCT GATCTCGCGC GCGATCGTCG CCGCAGGTCA GACCGGCGCC GCGCTGCCGG TCGTCGCGAT CAACGACACC GTCAAGCAGA TCAACGCCGA AGGCTGCGTC GAGGCGACGC CGGACCGCGC GCGGTTGCGA ATCGCGCAGA CCCCGCAGGC GTTCCGCTTC GACGTCATCC TCGATGCGCA TCGCCGCGCC GCGCGCGACG GCCGCGACGA TTTCACCGAC GACGCCGCGA TCGCCGAGTG GGCAGGATTG ACGGTCTCAA CCTTTGAAGG CGATGCTGCC AACATGAAAC TGACGACGCC GGACGATTTC ATCCGCGAAG AGAGTCGCCT CACCGCGCTG CTCGGCGACA TCCGCACGGG CACCGGCTAC GACGTTCACG CCTTCGGCGA CGGCGATCAC GTCTGGCTGT GCGGGCTGAA GGTGCCGCAC AATCGCGGCT TCCTTGCGCA TTCCGATGGT GACGTCGGCT TGCACGCGCT GGTCGACGCG ATCCTCGGCG CGCTGGCGGA CGGCGACATC GGCTCGCACT TCCCGCCGAC CGATCCGCAA TGGAAGGGCG CGGCCTCCGA CAAGTTCTTG AAGTACGCGG TCGAGCGTGT CGCCGCGCGC GGCGGCCGCA TCGCCAATCT CGAAGTGACG ATGATCTGCG AACGGCCGAA GATCGGCCCG CTGCGCGAGG CGATGCGCGC CAGGATCGCC GAGATCACCG GGCTTCCCGT GTCGCGCATC GCGGTCAAGG CCACCACCAG CGAAAGGCTC GGCTTCACCG GGCGCGAGGA AGGCATCGCC GCCACCGCGT CCGCCACCAT CCGGCTGCCC TGGGGCGCCG AAGGACTGGC CGGCTGA
|
Protein sequence | MPNPPRTAAI IVAAGRGLRA GAGGPKQYRT LAGRPVIARA LQPFCTHPEV FAVQPVTNPD DTAIFNDAVT GLNFRPAVGG GATRQGSVRA GLEALAELNP DIVLIHDAAR PFVTPDLISR AIVAAGQTGA ALPVVAINDT VKQINAEGCV EATPDRARLR IAQTPQAFRF DVILDAHRRA ARDGRDDFTD DAAIAEWAGL TVSTFEGDAA NMKLTTPDDF IREESRLTAL LGDIRTGTGY DVHAFGDGDH VWLCGLKVPH NRGFLAHSDG DVGLHALVDA ILGALADGDI GSHFPPTDPQ WKGAASDKFL KYAVERVAAR GGRIANLEVT MICERPKIGP LREAMRARIA EITGLPVSRI AVKATTSERL GFTGREEGIA ATASATIRLP WGAEGLAG
|
| |