Gene RPD_3855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3855 
Symbol 
ID4024371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4293706 
End bp4295724 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content66% 
IMG OID637964059 
Producthydrogenase 4 subunit B 
Protein accessionYP_570977 
Protein GI91978318 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC TCGCGGTCCA GCTGTGGTGT GTCGCCGCGC TGCTTGCGGC TGCGGTCGTC 
GCGGTTCCAC TGAGTCGCTG GCCCGGCCTC TCCACCGCCA TCATCTACAG CGCGACGCTG
ATGGCCTGCG CCATCGCGAC TGCGAGCGCG CTCACCTGGC TGCTCCTGCA TGCCGATGCC
GCCGCCGAGC GCACGCTGCC GATCGGGCTG CCATGGCTCG GCGCGCATCT GCGGCTCGAT
GCGCTGGCGG CGTTCTTCCT GGTCGTCGTC AATCTCGGCG GCGCGCTCGC GAGCCTTTAT
GCGCTGGGCT ACGGCCGCCA CGAACGGGCG CCGCATCGCG TGCTGCCGTT CTATCCGGCA
TTTCTCGCCG GGATGAATCT GGTCGTGCTG GCCGCCGATG CGTTCTCTTA TCTTTTGAGC
TGGGAGTTCA TGTCGCTGGC GTCGTGGGCG CTGGTGATGG CGCATCATCG CGACGGCGGC
AATCGCCGCG CCGGATACGT CTATCTGGTG ATGGCGAGCT TCGGCACGCT GGCGCTGCTG
CTCGCCTTCG GCCTGCTTGC GGGCACCGCC GGAGACTATG ATTTCGCGTC GATCCGGATC
GCTCCGCACT CGCCTTACGT CGCCACGCTT GTGCTGATCC TGATGTTGCT CGGCGCTGGC
TCCAAGGCCG GTCTGGTGCC CCTCCATGTC TGGCTGCCGC TGGCGCATCC GGCGGCGCCG
AGCCACGTCT CGGCGCTGAT GAGCGGCGTG ATGACCAAGG TGGCAATCTA CGGCTTCATC
CGCGTGGTGT TCGATCTGCT CGGGCAACCG GACTGGTCGA CGAGCGGCAT CGTGCTGGCG
CTCGGCGGCA TCACCGCGGC GCTCGGCATT CTCTACGCGC TGATGGAGAA GGACCTGAAG
CGGCTGCTCG CCTACTCGAC GATCGAGAAT GTCGGCATCG TGTTCGTCAG TCTCGGCCTG
GCTCTGGCGT TTCAGGCCAA TGACGACAAG GTGCCGGCGG CGCTGGCGTT CACCGCGGCG
CTGTTTCACA TCTTCAACCA TTCGCTGTTC AAGAGCCTGC TGTTCTTCGG TGCCGGCGCG
GTACTGAGTG CGACCGGGGA GCGCGACATG GACAGACTCG GCGGGCTGAT CCATCGTATG
CCGTTCACCA GCGTGGTGTT CCTGGTCGGC TGCGTCGCGA TTTCGGCGCT GCCCCCGTTC
AACGGCTTTG TTTCGGAATG GCTGATCTTC CAGGCGGTGC TGCAGAGCCC GCAATTGCCG
CAATGGGGCC TGAAGATCCT TGTGCCGGCG GTCGGCGGGC TGATGGCGCT GGCCGCCGCG
CTGGCGGCGG CGTGTTTCGT CAAGGCCTAT GGCGTCACCT TTCTCGGCCG GCCGCGCGGC
GCAGCGACGA TTGCCGCGAA AGAGGTCGAT CGTTTCTCGC TCGCCGCGAT GGCGATCCTC
GCCGCGTTGT GCCTGCTCGC CGGCGTTTTG CCCGGCGTGG TGATGGATGC TCTGGCTCCG
ATCGCAACCA ACATCCTCGG CAGCCGGCTG GCGCCGCAAA GCGGAATGGC GTGGCTCTCG
ATCGTGCCGA TCTCGGGAAA TCACAGTTCC TATAACGGTC TTCTGGTGGC GATGTTCATT
GCGCTGTCGG CGTTGCTGCT GTTCGTCGCG ATCCGGATGT TCTCGTCGGG TGCAGTGCGC
CGCGGACCGG CCTGGGGCTG CGGTTTCACC GATCCGGCCC CGCTCGCTCA ATATTCCGCG
GACGGGTTCG CGCAGCCGAT CCGTCGCGTG TTCGGAACGC TGATCTTCCG CGCCCGCGAT
CACGTCACCA TGCCGCCGCC GGGCGACACA GCGCCGGCGC GGCTCACCAT CGAACTGCAC
GATCAAATCT GGGAACGGAT CTACGCGCCG CTCGCCGGCG GCATCATGAT CGCCTCGGCA
CGTCTCGACA GGCTGCAGCT TCTCACGATC CGTGGTTATC TCAGCCTCGT CTTCGTCACC
CTTGTCACCC TGCTGCTGGT GCTCGCGATA TGGACGTGA
 
Protein sequence
MTELAVQLWC VAALLAAAVV AVPLSRWPGL STAIIYSATL MACAIATASA LTWLLLHADA 
AAERTLPIGL PWLGAHLRLD ALAAFFLVVV NLGGALASLY ALGYGRHERA PHRVLPFYPA
FLAGMNLVVL AADAFSYLLS WEFMSLASWA LVMAHHRDGG NRRAGYVYLV MASFGTLALL
LAFGLLAGTA GDYDFASIRI APHSPYVATL VLILMLLGAG SKAGLVPLHV WLPLAHPAAP
SHVSALMSGV MTKVAIYGFI RVVFDLLGQP DWSTSGIVLA LGGITAALGI LYALMEKDLK
RLLAYSTIEN VGIVFVSLGL ALAFQANDDK VPAALAFTAA LFHIFNHSLF KSLLFFGAGA
VLSATGERDM DRLGGLIHRM PFTSVVFLVG CVAISALPPF NGFVSEWLIF QAVLQSPQLP
QWGLKILVPA VGGLMALAAA LAAACFVKAY GVTFLGRPRG AATIAAKEVD RFSLAAMAIL
AALCLLAGVL PGVVMDALAP IATNILGSRL APQSGMAWLS IVPISGNHSS YNGLLVAMFI
ALSALLLFVA IRMFSSGAVR RGPAWGCGFT DPAPLAQYSA DGFAQPIRRV FGTLIFRARD
HVTMPPPGDT APARLTIELH DQIWERIYAP LAGGIMIASA RLDRLQLLTI RGYLSLVFVT
LVTLLLVLAI WT