Gene RPD_1175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1175 
Symbol 
ID4021651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1332500 
End bp1334791 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content71% 
IMG OID637961367 
Product(NiFe) hydrogenase maturation protein HypF 
Protein accessionYP_568314 
Protein GI91975655 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.311862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTTG CTCGCGCCAG CGCCGCAGAC CAAGTCGGTC GCGCCCGGGT GCGGGTGCGC 
GGCGCGGTCC AGGGCGTGGG CTTCCGGCCG TTCGTCTACG GGCTGGCGCA GCGCTATGCG
CTCGGCGGCT TCGTCGCCAA CGATGCCGAG GGCGTGCTGA TCGAGGTCGA GGGCGGCTCG
ATTGCGGAAT TCCTCGCCGC GCTGCGCTGC GAGGCGCCGC CGCTGGCGCG GGTCGATTCG
ATCGAGACCG AACAACTGCG CGCGCGCGGC GAGCGCGGCT TCGACATCGC CGAGAGCCGC
GCCGGCCGCG TCACGACGCG GATCGGCGCC GATGCCGCGA CCTGCGAGGC GTGCCTCGAC
GATCTGTTCG ATCCGGCGAG CCGCTTCCAT CTCTACCCGT TCGTCAACTG CACCCATTGC
GGCCCGCGCT ACACGCTGAC GCATCGCCTG CCATATGACC GCGCCAACAC AGCGATGGCC
GGCTTTGCGC TGTGCGCGGA TTGCCGCCGC GACTATCAAG ACCCGCGTGA TCGCCGCTTC
CACGCCGAGC CGATCGCCTG CCCGGCCTGC GGGCCGCGGC TCAGCCATCC GATCGACGAG
ATCGTCGAGC GATTGCGCGC CGGCGGCATC GTCGCGCTGA AGAGCCTCGG CGGTTATCAT
CTGCTGTGCG ATGCGACCAA CGAGGCATCG GTCGCCGAGC TGCGCCGGCG CAAGCGCCGC
GACGCCAAGC CGTTCGCGGT GATGGTCGCG TCCGAGGCTT CGCTCGATCG CGTCGTCGCC
GCCGACGCGG CCGAGCGGGC GCTGCTGCGT TCGGTCGAGC GGCCGATCGT GCTGATGCAA
GATCGCGGCG CGCTGGCGCC GTCGGTGGCG CCGGGCCTGC GCCATGTCGG CGTGATGCTG
CCCTACACGC CGTTGCATCA TCTGCTGTTC CACGCCGCGG CCGGATCGCC GCAGGGCCGT
GGCTGGCAGC GCGCGCCGCT GGACCTCGTG CTGGTCGCGA CCAGCGCGAA TTGCGGCGGC
GATCCGATCG TGATCGACGA TGCGGAACGC AAGCTCGGCG GCATCGCCGA CCTGATCGTC
AGCCACGATC GGGACATCGT GGTGCGCGCC GACGACAGCG TGATGGCGAT CAGCGACGGC
GGGCCGGCGT TCATCCGCCG CGCCCGTGGC TTCACGCCGC GGCCGGTCCG GCTGCCGCGC
GAAATCCCGC CGGTGCTGGC CGTCGGCGGT TACTTGAAAA ATACGATCAC GCTGACGCGC
GGCCGCGAGG CGTTCGTGTC GCAGCATGTC GGCGATCTCG CCACCGCCGA CACCGTCCGC
TTCTTCGAAC AGACGATCGC GCATCTGACC CGGCTGGTCG GCGTCGCGCC GGTCGCGGTG
GCGCATGATC TGCACGCCGA CTTCGCCTCG ACCCGCTTGG CCGAAAGCCT CGGGTTGCGG
CTGATCGCCG TGCAGCATCA TCACGCCCAT GTCGCATCGA TAGCCGCCGA ACACGGCATC
GACGCGCCGC TGCTCGGCCT CGTGCTCGAC GGCCATGGCC AGGGCAGCGA CGGCGGCAAT
TGGGGCGGCG AATTGCTGCG CGTCGACGGC GCACACGTCA CCCGGCTCGG CCATCTCGCG
GCGCTGGCGC TGCCGGGCGG CGACGCCGCC GCGCGCGAGC CGTGGCGGAT GGCGGCGGCG
GCGCTGGCCG CGATCGGACA AAGCGAGGCG ATCACCGCGC GGTTTGCCGA TCAGCCGCGC
GCGCCGGCGC TGGCCGCGAT GCTGGCCAAT CATGGCTGCG CCACCACGAC CAGCGCCGGG
CGGCTGTTCG ACGCCGCCGC CGGGTTGCTC GGCGTTTGTT CGGTTCAGGC CTACGAAGGC
CAAGCCGCGA TGCAGCTCGA GGCGCTGGTG CAAACGCCGC GCGTGCTGAT CGATGGCTGG
CGCATCGAAC GCGATGCGCT CGATCTATCG CCGCTGTTGC GCCATCTCGC GACGCCCGGC
CTCGCTCCTG TCGCCGGCGC CGAACTGTTC CACGGTACGC TGGCGGCGGC GCTGGCGAAT
TGGATCGCAC AAGCGTCGGC GCGAACCGGC CTCACCACGA TCGCGCTCGG CGGTGGTTGC
TTCCTCAACC GCGTTCTCAG CGCCGATCTC GCAGCGCGGT TGCGCGCGTG CGGTCTGACG
CCGCTGCCGG CGCGGCAATT GCCGCCGAAT GACGGCGGCC TCAGCCTCGG TCAGGCCTGG
ATCGCCGGAC AGGCGATCGT GAACGACACA GAGGAGGAGC GCCCATGTGC CTCGCCATTC
CCGCCGAAGT GA
 
Protein sequence
MELARASAAD QVGRARVRVR GAVQGVGFRP FVYGLAQRYA LGGFVANDAE GVLIEVEGGS 
IAEFLAALRC EAPPLARVDS IETEQLRARG ERGFDIAESR AGRVTTRIGA DAATCEACLD
DLFDPASRFH LYPFVNCTHC GPRYTLTHRL PYDRANTAMA GFALCADCRR DYQDPRDRRF
HAEPIACPAC GPRLSHPIDE IVERLRAGGI VALKSLGGYH LLCDATNEAS VAELRRRKRR
DAKPFAVMVA SEASLDRVVA ADAAERALLR SVERPIVLMQ DRGALAPSVA PGLRHVGVML
PYTPLHHLLF HAAAGSPQGR GWQRAPLDLV LVATSANCGG DPIVIDDAER KLGGIADLIV
SHDRDIVVRA DDSVMAISDG GPAFIRRARG FTPRPVRLPR EIPPVLAVGG YLKNTITLTR
GREAFVSQHV GDLATADTVR FFEQTIAHLT RLVGVAPVAV AHDLHADFAS TRLAESLGLR
LIAVQHHHAH VASIAAEHGI DAPLLGLVLD GHGQGSDGGN WGGELLRVDG AHVTRLGHLA
ALALPGGDAA AREPWRMAAA ALAAIGQSEA ITARFADQPR APALAAMLAN HGCATTTSAG
RLFDAAAGLL GVCSVQAYEG QAAMQLEALV QTPRVLIDGW RIERDALDLS PLLRHLATPG
LAPVAGAELF HGTLAAALAN WIAQASARTG LTTIALGGGC FLNRVLSADL AARLRACGLT
PLPARQLPPN DGGLSLGQAW IAGQAIVNDT EEERPCASPF PPK