Gene RPD_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2033 
Symbol 
ID4022515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2278432 
End bp2279619 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content66% 
IMG OID637962226 
Productpyrroloquinoline quinone biosynthesis protein PqqE 
Protein accessionYP_569169 
Protein GI91976510 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR02109] coenzyme PQQ biosynthesis protein E 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.455844 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTT CGATCGATCG CCCGACGAGC GCGGTTGATC CAGCCGATGG CGTCGCGGTG 
TTGGAAACCC GCGGCTCGGT CGCGGAGAGC TTCGGCATTC CGCTCGCGGT GCTGCTGGAA
TTGACGCATC GCTGCCCGCT GCAGTGCCCG TATTGCTCGA ACCCGGTGGA GCTGGAGCGC
GGCGGAGCGG AGCTGTCGAC CGACGAATGG AAGCGGGTGT TGAGTGAACT CGCCGCGATC
GGCGTGCTGC AGATTCATTT TTCCGGCGGC GAGCCGACCG CGCGAAAAGA TCTCGACGAG
CTGGTGCGGC ACGCCAGCGA GGTCGGGCTG TACACCAATT TGATCACGTC GGCGGTGCTG
TTGACGCGCG TGCGCCTTGC GGCGCTGGCC GACGCCGGGC TGTGCCACGT CCAGATCAGC
TTTCAAGGCA ACGAGCCGGC CGTCGCCGAT CGCGTCGCGG GTTTCGCCAG CGCTCATGCG
AAGAAAATCG AAGTCGCGCG CTGGACCCGC GAGCTCGATC TGCCGCTCAC CGTCAATGCC
GTGATGCATC GGCAGAACCT TCATCAACTG CCCGACATCA TCGAGATGGC GGTCGCGCTC
GACGCCGACC GGCTCGAAGT CGCCAATGTT CAATATTACG GCTGGGCGCT GAAGAACCGC
GCTGCGCTGA TGCCGACGCT GCAGCAGATC GAGGGCTGCA CGGCGATCGT GGAGACCGCG
CGCGAAAGGC TGAAGGGCGT GCTGGCGATC GACTACGTCA TCCCGGACTA CTACGCGCTG
CGGCCGAAGA AGTGCATGGG CGGCTGGGGC CGGCAGTTCT TCAACATCTC GCCGAGCGGC
AGGGTGCTGC CGTGCCACGC CGCCGAGACC ATCACCGGAC TCGTCTTCGA CTCGGTGCGG
GACGGCAAGT CGATTGCGGA GATCTGGCGC AGCTCCGAAG CGTTCAACCG CTATCGCGGC
ACCGGCTGGA TGCAGGAGCC GTGTGCAAGC TGCGCCTTCA AGGACATCGA TTTCGGCGGC
TGCCGCTGTC AGGCGTTTGC GCTCACCGGC GATGCTGCGG CGACCGATCC GGCCTGCGCA
TTGTCGCCGT TGCACGAGCG CATCTTCAAG ACCGCGGAGG CGGAAGCGGC GCAGGGCAGC
GACCGCTTCC TGTATCGCAA TTTCGCCGGC GGGACGGCGG AGGCGTGA
 
Protein sequence
MSASIDRPTS AVDPADGVAV LETRGSVAES FGIPLAVLLE LTHRCPLQCP YCSNPVELER 
GGAELSTDEW KRVLSELAAI GVLQIHFSGG EPTARKDLDE LVRHASEVGL YTNLITSAVL
LTRVRLAALA DAGLCHVQIS FQGNEPAVAD RVAGFASAHA KKIEVARWTR ELDLPLTVNA
VMHRQNLHQL PDIIEMAVAL DADRLEVANV QYYGWALKNR AALMPTLQQI EGCTAIVETA
RERLKGVLAI DYVIPDYYAL RPKKCMGGWG RQFFNISPSG RVLPCHAAET ITGLVFDSVR
DGKSIAEIWR SSEAFNRYRG TGWMQEPCAS CAFKDIDFGG CRCQAFALTG DAAATDPACA
LSPLHERIFK TAEAEAAQGS DRFLYRNFAG GTAEA