Gene RPC_3758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3758 
Symbol 
ID3969351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4182186 
End bp4183232 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content68% 
IMG OID637926868 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_533612 
Protein GI90425242 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGC GCCAGCGTCA CCGCAAGCTC GATCTCGTGG CCGGCCGGGT CGAGCTGTCG 
CACGGCGCCG GCGGTCGCGC CATGGCGCAG CTGATCGCGG AGGTGTTTCA CGCAGCCCTC
GACAACGACT GGCTGCGCCG CGGCAACGAT CAGTCGGCGT TCGACGTCGA GGCCGGCCGC
ATGGTGATGA CCACCGACGG CTATGTGATC TCGCCGTTGT TTTTTCCCGG CGGCGACATC
GGCTCGCTTT CGGTGCACGG CACCATCAAC GACGTGGCGA TGGCCGGGGC AAAGCCGCTG
TATCTGTCGG CGAGTTTTAT TATCGAAGAG GGGTTTCCGC TCGCGGACTT GAATCGCATC
GCCGACAGCA TGGGGCAAGC GTCGCGCGAG GCTGGTGTGC CGGTGATCAC CGGCGACACC
AAGGTGGTGG AGCGCGGCAA GGCCGACGGC GTGTTTATCT CCACCGCCGG CGTCGGCGTG
CTGCCGCACG GCCTCGAGCT CTCCGCCGAC AAAGCGCGGC CCGGCGACAA GCTGCTGCTG
TCCGGCTCGC TCGGCGATCA CGGCGTCGCG GTGATGTCGC GGCGGCAGAA TCTGGCCTTC
GACACCAACA TCGTGTCGGA CTCCGCGGCG CTGCACGGTC TCGTCGCCGA CATGGTCGCG
GTGGCTGGGG CTAGCCTGCG GGTGATGCGC GATCCGACCC GCGGCGGGCT CGCCGCGACG
CTGAATGAAC TGGCGCAGCA ATCCCGCGTC GGCTTCCGCA TCGACGAGGA CAATCTCCCG
ATCAAGCCGC AGGTCGCCGC CGCCTGCGAA TTGCTGGGCC TCGATCCGCT CTACGTCGCC
AACGAGGGCA AGCTGGTTGC GATCGTCGCA CCCGACGCCG CCGAGGCGGC GCTCTCGGCG
ATGCGCCGGC ATCCGTTGGG GCGCGAGGCT ACTATCATTG GCGAGGCGGT TGAGGACGAG
CATCGCTTCG TGCAGATGAC CACCGCGTTC GGCGGCGGCC GCATCGTCGA TTGGCTGGCG
GGCGATCAAT TGCCACGGAT CTGTTGA
 
Protein sequence
MSARQRHRKL DLVAGRVELS HGAGGRAMAQ LIAEVFHAAL DNDWLRRGND QSAFDVEAGR 
MVMTTDGYVI SPLFFPGGDI GSLSVHGTIN DVAMAGAKPL YLSASFIIEE GFPLADLNRI
ADSMGQASRE AGVPVITGDT KVVERGKADG VFISTAGVGV LPHGLELSAD KARPGDKLLL
SGSLGDHGVA VMSRRQNLAF DTNIVSDSAA LHGLVADMVA VAGASLRVMR DPTRGGLAAT
LNELAQQSRV GFRIDEDNLP IKPQVAAACE LLGLDPLYVA NEGKLVAIVA PDAAEAALSA
MRRHPLGREA TIIGEAVEDE HRFVQMTTAF GGGRIVDWLA GDQLPRIC