Gene RPD_1163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1163 
Symbol 
ID4021639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1323584 
End bp1325377 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content64% 
IMG OID637961355 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_568302 
Protein GI91975643 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.28682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG TCACGACACC CAACGGCTTC AAGCTCGACA ATGGCGGCCG CCGCGTCGTC 
GTCGATCCGG TGACCCGGAT CGAGGGCCAT CTGCGCGTCG AGGTCAATGT CGACAGCGCC
AACGTCATCC GCAACGCGGT GTCCACCGGC ACGATGTGGC GCGGCATCGA GATCATCCTG
AAGGGTCGCG ACCCGCGCGA CGCCTGGGCC TTCACCCAGC GGATCTGCGG CGTCTGCACC
GGCACCCACG CGCTGACCTC GGTGCGCGCG GTGGAGAACG CTCTCGACAT CAAGATCCCG
GAAAACGCCA ACACCATCCG CAACATCATG CAGCTCACGC TGTATGTGCA CGACCATATC
GTCCACTTCT ATCACCTCCA CGCGCTCGAC TGGGTCGACG TGGTGTCGGC GCTTTCGGCC
GATCCGAAGG CGACCTCGGC GCTGGCGCAG AGCATTTCGT CGTGGCCGCT GTCGAGCCCG
GGTTACTTCA AGGATCTGCA GACGAGACTG AAGAAATTCG TCGAGAGCGG GCAGCTCGGC
CCGTTCAAGA ACGGCTATTG GGGCCACCCG GCCTATAAGC TGCCGCCGGA AGCCAATCTG
ATGGCGGTCG CGCATTATCT GGAAGCGCTG GATTTCCAGA AGGAGATGGT GAAGATCCAC
ACCATCTACG GCGGCAAGAA TCCGCATCCG AACTGGCTGG TCGGCGGCGT GCCCTGCGCG
ATCAATCTCG ACGGCGTCGG CGCGGTCGGC GCGATCAACA TGGAGCGGCT CAATCTGGTC
TCCTCGATCA TCGACCAGAT GGTGACGTTC ACCGAGCAGG TCTATCTGCC CGATCTCCAG
GCGATCGCCT CGTTCTACAA GGACTGGACC TTCGGTGGCG GGCTGTCCTC GAAATCGGTG
TTGAGCTACG GCGATATCCC CGAGCACGCC AACGACTATT CGGCGTCGTC ACTGAAGCTG
CCGCGCGGCG CGATCATCAA CGGCAATCTC GCCGAAGTCC TCCCGGTCGA TCTCACCGAT
CCGTCGCAGG TGCAGGAGTT CGTCGCGCAT TCCTGGTACA AATATCCCGA CGAGACCAAG
GGCCTGCATC CGTGGGACGG CGTCACCGAA CCGAACTTCA AGCTCGGTCC CAACGCCAAG
GGCACCAAGA CCCGGATCGA GGCGCTCGAC GAGGACGCCA AATACTCCTG GATCAAATCG
CCGCGCTGGA AAGGCAACGC GGTCGAGGTC GGGCCGCTGG CGCGCTACAT CGTCGGCTAT
GCCCAGAACA AGCCGGAGTT CAAGGAGCCG ACCGACAAGC TGCTGAAGGA TCTCGGCCTG
CCGCTACCGG CGCTGTTCTC GACGCTCGGC CGCACCGCGG CCCGGGCGTT GGAGGCGCAG
TGGGCGGCGC GTCAGCTGCG TTACTTCCAG GACAAGCTGA TCGCATCGCT GAAGGCCGGC
GACAGCGCCA CCGCCAATGT CGACAAATGG AAGCCGGAGA GCTGGGCGAA GGACGTCAAG
GGCGTCGGCT TCACCGAGGC GCCGCGCGGC GCGCTCGGCC ACTGGGTCAA GATCAAGGAC
GGCAAGATCG ACAATTACCA GGCGGTGGTG CCGACCACCT GGAACGGCAG CCCGCGCGAT
CCCAAGGGCG GCATCGGCGC GTTCGAGGCG TCGCTGCTCG ACACCCCGAT GGCCGATCCG
AATCAACCGC TGGAGATTCT GCGGACGCTG CATTCGTTCG ATCCCTGCCT CGCCTGCTCG
ACGCATGTGA TGAGCGAGGA CGGCCAGGAA ATGACCAAGG TCACCGTGCG CTGA
 
Protein sequence
MSNVTTPNGF KLDNGGRRVV VDPVTRIEGH LRVEVNVDSA NVIRNAVSTG TMWRGIEIIL 
KGRDPRDAWA FTQRICGVCT GTHALTSVRA VENALDIKIP ENANTIRNIM QLTLYVHDHI
VHFYHLHALD WVDVVSALSA DPKATSALAQ SISSWPLSSP GYFKDLQTRL KKFVESGQLG
PFKNGYWGHP AYKLPPEANL MAVAHYLEAL DFQKEMVKIH TIYGGKNPHP NWLVGGVPCA
INLDGVGAVG AINMERLNLV SSIIDQMVTF TEQVYLPDLQ AIASFYKDWT FGGGLSSKSV
LSYGDIPEHA NDYSASSLKL PRGAIINGNL AEVLPVDLTD PSQVQEFVAH SWYKYPDETK
GLHPWDGVTE PNFKLGPNAK GTKTRIEALD EDAKYSWIKS PRWKGNAVEV GPLARYIVGY
AQNKPEFKEP TDKLLKDLGL PLPALFSTLG RTAARALEAQ WAARQLRYFQ DKLIASLKAG
DSATANVDKW KPESWAKDVK GVGFTEAPRG ALGHWVKIKD GKIDNYQAVV PTTWNGSPRD
PKGGIGAFEA SLLDTPMADP NQPLEILRTL HSFDPCLACS THVMSEDGQE MTKVTVR