Gene RPD_1178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1178 
Symbol 
ID4021654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1336136 
End bp1337182 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content68% 
IMG OID637961370 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_568317 
Protein GI91975658 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.668048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC GCACCCGCAG GCGCAAGCTC GATATCAAAA ACGGTCGGGT CGACCTGTCG 
CACGGCGCCG GCGGCCGCGC CATGGCGCAT CTGATCGGCG AGATTTTCCA TGAGGCGTTC
GACAACGATC TGCTGCGCCG CGGCAACGAT CAGGCGGCGT TCGATGTCGT CGGCGGACGG
ATGGTGATGA CGACGGACGG CTATGTGGTG TCGCCGCTGT TCTTTCCCGG CGGCGACATC
GGCTCGCTGG CGGTGCACGG CACCATCAAC GACGTCGCGA TGGCCGGGGC CAGGCCGCTG
TATCTGTCCG CCGGCTTCAT CATCGAGGAG GGGTTCCCGC TCGCCGATCT GAAGCGCATC
GCCGACAGCA TGGGCGCGGC GTCGCGCGAG GCCGGCGTGC CGATCGTCAC CGGCGATACC
AAGGTGGTCG AACGCGGCAA GGCCGACGGC GTGTTCATCA CCACCACGGG CGTCGGCGTC
GCGCCGGCAG AGCTGGTGCT GTCGTCCGAA TTGGCGTGCC CCGGCGACAA AGTCCTTCTC
TCCGGCTTCA TCGGCGATCA CGGCGTTGCG GTGATGTCGC AGCGGCAGAA CCTCGCCTTC
GAGACGTCGA TCGTGTCGGA TTCCGCCGCA TTGCATGAAC TGGTCGCCGC GATGGTCGCC
GCCGCGCCGC AGGCGCTGCG GGTGATGCGC GATCCGACCC GCGGCGGGCT CGCGGCGACG
TTGAACGAAT TGGCGCAGCA ATCGGCGATC GGCTTCCGGC TCGACGAGGA CGCGGTGCCG
ATCCGGCCCG AGGTCGCGGC GGCCTGCGAG CTGCTCGGGC TCGATCCGCT TTACGTCGCC
AATGAAGGCA AGCTGATCGC CATCGTCGCA TCCGAGGCGG CGGACACCGT CCTCGCGGCG
ATGCGCGCGC ATCCGCTCGG GCGTGATGCC GCGATCATCG GCGAGGTGAT CGCCGACGAT
CATCACTTCG TGCAGATGAC CACGTCATTC GGCGGCGGGC GGATCGTCGA CTGGCTGTCG
GGCGAGCAAT TGCCGCGAAT TTGTTGA
 
Protein sequence
MSDRTRRRKL DIKNGRVDLS HGAGGRAMAH LIGEIFHEAF DNDLLRRGND QAAFDVVGGR 
MVMTTDGYVV SPLFFPGGDI GSLAVHGTIN DVAMAGARPL YLSAGFIIEE GFPLADLKRI
ADSMGAASRE AGVPIVTGDT KVVERGKADG VFITTTGVGV APAELVLSSE LACPGDKVLL
SGFIGDHGVA VMSQRQNLAF ETSIVSDSAA LHELVAAMVA AAPQALRVMR DPTRGGLAAT
LNELAQQSAI GFRLDEDAVP IRPEVAAACE LLGLDPLYVA NEGKLIAIVA SEAADTVLAA
MRAHPLGRDA AIIGEVIADD HHFVQMTTSF GGGRIVDWLS GEQLPRIC