Gene RPD_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4398 
Symbol 
ID4024923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4866604 
End bp4867692 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content67% 
IMG OID637964607 
Producthypothetical protein 
Protein accessionYP_571515 
Protein GI91978856 
COG category[R] General function prediction only 
COG ID[COG5621] Predicted secreted hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.117298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTA GAGGCCTGAT CTCACGCCGC GCCTTCGCGG GCGGCTTGCT CGCGCTCGGG 
GCCAGTGGGC AACGCGTGCT GGCACAAGGA TTCGCAGGGC TCGGCAGCGA CGCGGGCGAA
TTCGCGCCGG TCGTGCCCGG GCGGCGGCTT TCGTTTCCCG AGGACCACGG CCCGCATCCG
GATTTCCGCA TCGAATGGTG GTACCTCACC GCAAATCTGA AAGACGCCGA CGGCAAGCCC
TACGGCGTGC AGTGGACGCT GTTCCGTCAG GCGATGACGC CGGGCCCGCA GCGCGAGGGC
TGGGCCAGTC AGCAGATCTG GATGGCGCAT GCGGCGCTCT CCAGCGCCGA GACGCATCGC
TTCGCCGAAA AATTTTCGCG CGGCGGGATT GGGCAGGCCG GCGTTACGGC TGCGCCGTTC
CGCGCCCTGA TCGACGACTG GGCGATGCAG GGCGGCGACG CGATGAAGGC TGCGACGTTG
TCGCCGCTCG ACGTTACCGC ATCAGGCTCG GACTTCAGCT ATCGGCTGCA ATTGACCGCC
GAGCGGCCGC TAGTGCTGCA AGGCGACGCC GGCTTTTCGC GTAAATCCGA CCGCGGCCAG
GCTTCGTATT ACTATAGCCA GCCTTATTTT GCCGCGCGCG GGACGGTGAC GCTCGACGGC
CGGGCGATCG AGGTCAGCGG CACAGCCTGG ATGGACCGCG AATGGTCGAG CCAACCGCTC
GCTTCCGACC AGACCGGCTG GGACTGGTTC TCGCTGCATC TCGCCTCCGG CGAGAAGGTG
ATGCTGTTCC GGCTGCGCCA GAGCGGCGGC CAAGCCTATT TCGCCGGCAA CTGGATCGGG
CTCGACGGCA AATCCGAGCC GCTCGCGCCG GATGCGATCG CGCTCGAACC GATCGGCTTC
ACCGAGACCG CCGGCCGCAG ACTGCCGACG CGCTGGCGCA TCAGCCTGCC CGGCCACGGT
CTGTCGATCG AGACCACGCC GCTGAACCCC AACAGCTGGA TGGGGACCAG CTTCCCATAC
TGGGAGGGAC CGATCTCGTT CAGCGGCAGC CAGGCCGGCA TCGGATATCT TGAGATGACC
GGCTATTGA
 
Protein sequence
MSARGLISRR AFAGGLLALG ASGQRVLAQG FAGLGSDAGE FAPVVPGRRL SFPEDHGPHP 
DFRIEWWYLT ANLKDADGKP YGVQWTLFRQ AMTPGPQREG WASQQIWMAH AALSSAETHR
FAEKFSRGGI GQAGVTAAPF RALIDDWAMQ GGDAMKAATL SPLDVTASGS DFSYRLQLTA
ERPLVLQGDA GFSRKSDRGQ ASYYYSQPYF AARGTVTLDG RAIEVSGTAW MDREWSSQPL
ASDQTGWDWF SLHLASGEKV MLFRLRQSGG QAYFAGNWIG LDGKSEPLAP DAIALEPIGF
TETAGRRLPT RWRISLPGHG LSIETTPLNP NSWMGTSFPY WEGPISFSGS QAGIGYLEMT
GY