Gene RPB_2885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2885 
Symbol 
ID3910679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3285487 
End bp3286692 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID637884786 
Product2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 
Protein accessionYP_486498 
Protein GI86750002 
COG category[I] Lipid transport and metabolism 
COG ID[COG0245] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.451878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAC CTCCCCGTAC CGCCGCCATC ATCGTCGCCG CCGGCCGCGG CCTCCGCGCT 
GGCGCGGGCG GCCCCAAACA ATACCGCACG CTCGCCGGAC GGCCGGTGAT TGCCAGGGCT
TTGCAGCCGT TTTGTACGCA CCCGGAGGTC TTTGCGGTCC AGCCCGTGAC CAATCCGGAC
GACACCGCGA CGTTCAACGA GGCGGTCGCC GGGCTCGATT TCCGACCTGC GGTCGGTGGC
GGCGCGACCC GGCAGGCGTC GGTTCGCGCC GGGCTGGAAG CGCTGGCCGA ACTGAACCCC
GACATCGTGC TGATCCACGA CGCGGCGCGT CCCTTTGTCA CGCCGGATCT GATCTCGCGT
GCGATCGTCG CGGCCGGCCA GACCGGCGCG GCGCTGCCGG TGATCGCGGT CAACGACACC
GTCAAGCAGG TCGATGCCGA GGGCTGCGTC GTGGCGACGC CGGACCGCGC GCAATTGCGG
ATCGCGCAGA CGCCGCAGGC CTTCCGTTTC GACGTGATTC TCGACGCGCA CCGCCGCGCC
GCGCGCGATG GTCGCGATGA CTTCACCGAC GACGCCGCGA TCGCCGAATG GGCGGGATTG
ACGGTGTCCA CGTTCGAGGG CGATGCTGCC AACATGAAAC TGACCACGCC GGAAGACTTC
TCGCGCGAAG AGAGCCGCCT CACCGCGGCG CTCGGCGACA TCCGCACCGG CACCGGCTAC
GACGTCCACG CCTTCGGCGA CGGCGATCAC GTCTGGCTGT GCGGGCTGAA GGTGCCGCAC
AATCGCGGCT TCCTGGCGCA TTCCGACGGC GACGTCGGAC TGCACGCGCT GGTCGACGCC
ATCCTCGGCG CGCTGGCCGA CGGCGACATC GGCTCGCACT TCCCGCCGAC CGACCCGCAA
TGGAAGGGCG CGGCCTCCGA CAAGTTCCTG AAATACGCGG TCGACCGCGT CGCCGCGCGC
GGCGGCCGCA TCGCCAATCT CGAAGTGACG ATGATCTGCG AGCGGCCGAA GATCGGCCCG
CTGCGCGACC CGATGCGGCA GCGCATCGCC GAGATCACCG GCGTTCCGGT GTCGCGCGTC
GCCGTGAAGG CGACCACCAG CGAGCGGCTC GGCTTCACCG GCCGCGAGGA AGGCATCGCC
GCCACCGCGT CGGCCACCAT CCGGCTGCCT TGGTCCCCTT GGGGCGCCGA AGGACAGGCC
TCCTGA
 
Protein sequence
MQKPPRTAAI IVAAGRGLRA GAGGPKQYRT LAGRPVIARA LQPFCTHPEV FAVQPVTNPD 
DTATFNEAVA GLDFRPAVGG GATRQASVRA GLEALAELNP DIVLIHDAAR PFVTPDLISR
AIVAAGQTGA ALPVIAVNDT VKQVDAEGCV VATPDRAQLR IAQTPQAFRF DVILDAHRRA
ARDGRDDFTD DAAIAEWAGL TVSTFEGDAA NMKLTTPEDF SREESRLTAA LGDIRTGTGY
DVHAFGDGDH VWLCGLKVPH NRGFLAHSDG DVGLHALVDA ILGALADGDI GSHFPPTDPQ
WKGAASDKFL KYAVDRVAAR GGRIANLEVT MICERPKIGP LRDPMRQRIA EITGVPVSRV
AVKATTSERL GFTGREEGIA ATASATIRLP WSPWGAEGQA S