Gene RPC_3771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3771 
Symbol 
ID3969364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4191116 
End bp4192909 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content64% 
IMG OID637926881 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_533625 
Protein GI90425255 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.292608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.910106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACA TCACCACGCC GAATGGCTTC AAACTCGACA ATGCCGGCCG CCGCGTCGTC 
GTCGATCCGG TGACCCGGAT CGAGGGACAT TTGCGCGTCG AGGTCAATGT CGACGCCAAC
AACGTGATCC GCAACGCGGT GTCGACCGGC ACGATGTGGC GCGGCATCGA AGTGATTTTG
CGCGGCCGCG ATCCGCGCGA CGCCTGGGCC TTCACCGAAC GGATCTGCGG CGTCTGCACC
GGCACCCACG CCTTGACCTC GGTGCGCGCG GTGGAAAACG CGCTCGACAT CAAGATCCCG
GAAAACGCCA ACACCATCCG CAACATCATG CAGCTGACGC TGCAGGTGCA CGACCACATC
GTGCATTTCT ATCATCTGCA CGCGCTGGAC TGGGTCGACA TCGTCTCGGC ACTGTCCGCC
GATCCGAAGG CGACCTCGAC GCTGGCGCAG AGCATTTCGA ACTGGCCGCT GTCGAGTCCG
GGCTACTTCA AGGATCTGCA GACCCGGCTG AAGAAGTTCG TCGAGTCGGG GCAGCTCGGG
CCGTTCAAGA ACGGCTATTG GGGCCATCCG GCCTACAAGT TGCCGCCGGA AGCCAATCTG
ATGGCGGTGG CGCATTATCT GGAGGCGCTC GACTTCCAGA AGGACATGGT GAAGATCCAC
ACCGTCTATG GCGGCAAGAA CCCGCATCCG AACTGGCTGG TCGGCGGCGT GCCCTGCGCG
ATCAATGTCG ACGGCGTGGG CGCGGTCGGT GCCGTCAACA TGGAGCGGCT CAACCTGGTG
TCGTCGATCA TCGACCAGAT GGTGACCTTC ACCGAGCAGG TCTATCTGCC GGACTTGCAG
GCGATCGCCT CGTTCTACAA GGATTGGACC TTCGGCGGCG GGCTGTCGTC GACATCGGTG
ATGAGCTACG GCGACATTCC GGAATACGCC AACGACTATT CGGCGGCGTC GCTGAAAATG
CCGCGCGGCG TGATCCTCAA CGGCAATCTC AACGAGGTGC TACCGGTCGA TCTCGCCGAC
CCCTCGCAGG TGCAGGAGTT CGTCACCCAC TCTTGGTACA AATATCCCGA CGAGACCAAG
GGCCTGCATC CGTGGGACGG TGTCACCGTT CCGCATTATG AGCTGGGGCC GAAGTCGAAG
GGCACCAAGA CCCGGATCGA GGCGGTCGAC GAGGAGGCGA AGTATTCCTG GATCAAGTCG
CCGCGCTGGA AAGGCAACGC GGTCGAGGTC GGGCCGTTGG CGCGCTACAT CGTCGGCTAC
GCGCAGAACA AGCCGGAGTT CAAGGAGCCG ACCGAAAAGC TCCTCAAGGA CCTCGGCCTG
CCGGTGACCG CGCTGTTCTC CACGCTCGGC CGCACCGCGG CGCGCGCGCT GGAGGCGCAA
TGGGCGGCGC ATCAGCTGCG CTACTTCCAG AACAAGCTGA TGGCGTCGCT GAAGGCCGGC
GACAGCGCCA CCGCCAATGT CGAGAAGTGG AGCCCGGAGA CCTGGCCGAA GGAGGTCAAG
GGCGTCGGCT TCACCGAGGC GCCGCGCGGC GCGCTCGGCC ACTGGATCAA GATCAAGGAT
GGCAAGATCG ACAACTACCA GGCGGTGGTG CCGACCACCT GGAACGGCAG TCCGCGCGAT
CCCAAGGGCG GCATCGGCGC CTTCGAGGCG AGCTTGCTCG AAACCCCGAT GGCCGATCCG
AACCAGCCTT TGGAGATCCT GCGCACGCTG CATTCGTTCG ACCCGTGTCT GGCCTGCTCA
ACTCATGTCT TGAGCGAGGA CGGCCAGGAA ATGAGCCGGG TGACGGTGCG CTGA
 
Protein sequence
MTNITTPNGF KLDNAGRRVV VDPVTRIEGH LRVEVNVDAN NVIRNAVSTG TMWRGIEVIL 
RGRDPRDAWA FTERICGVCT GTHALTSVRA VENALDIKIP ENANTIRNIM QLTLQVHDHI
VHFYHLHALD WVDIVSALSA DPKATSTLAQ SISNWPLSSP GYFKDLQTRL KKFVESGQLG
PFKNGYWGHP AYKLPPEANL MAVAHYLEAL DFQKDMVKIH TVYGGKNPHP NWLVGGVPCA
INVDGVGAVG AVNMERLNLV SSIIDQMVTF TEQVYLPDLQ AIASFYKDWT FGGGLSSTSV
MSYGDIPEYA NDYSAASLKM PRGVILNGNL NEVLPVDLAD PSQVQEFVTH SWYKYPDETK
GLHPWDGVTV PHYELGPKSK GTKTRIEAVD EEAKYSWIKS PRWKGNAVEV GPLARYIVGY
AQNKPEFKEP TEKLLKDLGL PVTALFSTLG RTAARALEAQ WAAHQLRYFQ NKLMASLKAG
DSATANVEKW SPETWPKEVK GVGFTEAPRG ALGHWIKIKD GKIDNYQAVV PTTWNGSPRD
PKGGIGAFEA SLLETPMADP NQPLEILRTL HSFDPCLACS THVLSEDGQE MSRVTVR