Gene RPB_4688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4688 
Symbol 
ID3912506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5304639 
End bp5305727 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content69% 
IMG OID637886593 
Producthypothetical protein 
Protein accessionYP_488282 
Protein GI86751786 
COG category[R] General function prediction only 
COG ID[COG5621] Predicted secreted hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTA GCCACCCGAT CTCACGCCGT GCGGTCGCCG GCGGCCTGCT CGCGCTCAGC 
CTCGGCGGAT CGCGCGCGCT CGCGCAAGGC TTCGCGGGGC TCGGCAGCGA GGCCGGCGAA
TTCGCCCCCG TCGTTCCCGG ACGGGTGCTG AGCTTCCCGG CCGACCACGG CGCCCACCCG
GATTTCCGCA TCGAATGGTG GTACCTGACG GCGAATCTGC AGGGCGCGGA CGGCAAGCCC
TACGGCGTGC AGTGGACGCT GTTCCGGCAG GCGATGACGC CGGGACCGCA GCGCGAAGGC
TGGGCCAATC AGCAGGTCTG GATGGCGCAT GCGGCGCTCT CCAGCGCCGA GACGCATCGC
TTCGCCGAGA AATTTTCCCG CGGCGGCATC GGTCAGGCCG GCGTGTCCGC CGCGCCGTTT
CGCGCCTTCA TCGACGACTG GCAGATGACC GGCGGCGACG CGATGGACGC GGCGACGCTG
TCGCCGCTCG ACGTCACCGC GACCGGTGCG GATTTCGGCT ACCGGCTGCG CCTCACCGCC
GAGCGGCCGC TGGTGCTGCA GGGCGACGCC GGCTATTCGC GCAAATCCGA GCGCGGGCAG
GCCTCGTATT ATTACAGTCA GCCCTATTTC GCCGCGCGCG GGACGCTGAC GTTGGACGGC
CGGCCGATCG AGGTCAGCGG CACCGCCTGG ATGGACCGCG AATTCTCCAG CCAGCCGCTG
GCGTCGGACC AGACCGGCTG GGACTGGTTC TCGCTGCATC TCGCCTCCGG CGAGAAAGTG
ATGCTGTTCC GGCTGCGCCA GAGCGACGGC AACGCCTATT TCGCCGGCAA CTGGATCGGG
CTCGACGGGC GGTCCGAACA GCTCGCGCCG GACGCCATCG TGCTCGATCC GATCGGCTTC
ACCGACACCG CCGGCCGCAA ACTGCCGACA TCCTGGCGTG TCCGTGTGCC GGTGCGCGGT
CTTGCGATCG AGACCGCACC GCTCAACCCG AACGCCTGGA TGGGCACCAG CTTTCCCTAT
TGGGAGGGGC CGATTTCGTT CAGCGGTAGC CAGAGCGGCA ACGGATATCT TGAGATGACC
GGCTATTGA
 
Protein sequence
MSASHPISRR AVAGGLLALS LGGSRALAQG FAGLGSEAGE FAPVVPGRVL SFPADHGAHP 
DFRIEWWYLT ANLQGADGKP YGVQWTLFRQ AMTPGPQREG WANQQVWMAH AALSSAETHR
FAEKFSRGGI GQAGVSAAPF RAFIDDWQMT GGDAMDAATL SPLDVTATGA DFGYRLRLTA
ERPLVLQGDA GYSRKSERGQ ASYYYSQPYF AARGTLTLDG RPIEVSGTAW MDREFSSQPL
ASDQTGWDWF SLHLASGEKV MLFRLRQSDG NAYFAGNWIG LDGRSEQLAP DAIVLDPIGF
TDTAGRKLPT SWRVRVPVRG LAIETAPLNP NAWMGTSFPY WEGPISFSGS QSGNGYLEMT
GY