Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1175 |
Symbol | |
ID | 4021651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1332500 |
End bp | 1334791 |
Gene Length | 2292 bp |
Protein Length | 763 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637961367 |
Product | (NiFe) hydrogenase maturation protein HypF |
Protein accession | YP_568314 |
Protein GI | 91975655 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0068] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR00143] [NiFe] hydrogenase maturation protein HypF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.311862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCTTG CTCGCGCCAG CGCCGCAGAC CAAGTCGGTC GCGCCCGGGT GCGGGTGCGC GGCGCGGTCC AGGGCGTGGG CTTCCGGCCG TTCGTCTACG GGCTGGCGCA GCGCTATGCG CTCGGCGGCT TCGTCGCCAA CGATGCCGAG GGCGTGCTGA TCGAGGTCGA GGGCGGCTCG ATTGCGGAAT TCCTCGCCGC GCTGCGCTGC GAGGCGCCGC CGCTGGCGCG GGTCGATTCG ATCGAGACCG AACAACTGCG CGCGCGCGGC GAGCGCGGCT TCGACATCGC CGAGAGCCGC GCCGGCCGCG TCACGACGCG GATCGGCGCC GATGCCGCGA CCTGCGAGGC GTGCCTCGAC GATCTGTTCG ATCCGGCGAG CCGCTTCCAT CTCTACCCGT TCGTCAACTG CACCCATTGC GGCCCGCGCT ACACGCTGAC GCATCGCCTG CCATATGACC GCGCCAACAC AGCGATGGCC GGCTTTGCGC TGTGCGCGGA TTGCCGCCGC GACTATCAAG ACCCGCGTGA TCGCCGCTTC CACGCCGAGC CGATCGCCTG CCCGGCCTGC GGGCCGCGGC TCAGCCATCC GATCGACGAG ATCGTCGAGC GATTGCGCGC CGGCGGCATC GTCGCGCTGA AGAGCCTCGG CGGTTATCAT CTGCTGTGCG ATGCGACCAA CGAGGCATCG GTCGCCGAGC TGCGCCGGCG CAAGCGCCGC GACGCCAAGC CGTTCGCGGT GATGGTCGCG TCCGAGGCTT CGCTCGATCG CGTCGTCGCC GCCGACGCGG CCGAGCGGGC GCTGCTGCGT TCGGTCGAGC GGCCGATCGT GCTGATGCAA GATCGCGGCG CGCTGGCGCC GTCGGTGGCG CCGGGCCTGC GCCATGTCGG CGTGATGCTG CCCTACACGC CGTTGCATCA TCTGCTGTTC CACGCCGCGG CCGGATCGCC GCAGGGCCGT GGCTGGCAGC GCGCGCCGCT GGACCTCGTG CTGGTCGCGA CCAGCGCGAA TTGCGGCGGC GATCCGATCG TGATCGACGA TGCGGAACGC AAGCTCGGCG GCATCGCCGA CCTGATCGTC AGCCACGATC GGGACATCGT GGTGCGCGCC GACGACAGCG TGATGGCGAT CAGCGACGGC GGGCCGGCGT TCATCCGCCG CGCCCGTGGC TTCACGCCGC GGCCGGTCCG GCTGCCGCGC GAAATCCCGC CGGTGCTGGC CGTCGGCGGT TACTTGAAAA ATACGATCAC GCTGACGCGC GGCCGCGAGG CGTTCGTGTC GCAGCATGTC GGCGATCTCG CCACCGCCGA CACCGTCCGC TTCTTCGAAC AGACGATCGC GCATCTGACC CGGCTGGTCG GCGTCGCGCC GGTCGCGGTG GCGCATGATC TGCACGCCGA CTTCGCCTCG ACCCGCTTGG CCGAAAGCCT CGGGTTGCGG CTGATCGCCG TGCAGCATCA TCACGCCCAT GTCGCATCGA TAGCCGCCGA ACACGGCATC GACGCGCCGC TGCTCGGCCT CGTGCTCGAC GGCCATGGCC AGGGCAGCGA CGGCGGCAAT TGGGGCGGCG AATTGCTGCG CGTCGACGGC GCACACGTCA CCCGGCTCGG CCATCTCGCG GCGCTGGCGC TGCCGGGCGG CGACGCCGCC GCGCGCGAGC CGTGGCGGAT GGCGGCGGCG GCGCTGGCCG CGATCGGACA AAGCGAGGCG ATCACCGCGC GGTTTGCCGA TCAGCCGCGC GCGCCGGCGC TGGCCGCGAT GCTGGCCAAT CATGGCTGCG CCACCACGAC CAGCGCCGGG CGGCTGTTCG ACGCCGCCGC CGGGTTGCTC GGCGTTTGTT CGGTTCAGGC CTACGAAGGC CAAGCCGCGA TGCAGCTCGA GGCGCTGGTG CAAACGCCGC GCGTGCTGAT CGATGGCTGG CGCATCGAAC GCGATGCGCT CGATCTATCG CCGCTGTTGC GCCATCTCGC GACGCCCGGC CTCGCTCCTG TCGCCGGCGC CGAACTGTTC CACGGTACGC TGGCGGCGGC GCTGGCGAAT TGGATCGCAC AAGCGTCGGC GCGAACCGGC CTCACCACGA TCGCGCTCGG CGGTGGTTGC TTCCTCAACC GCGTTCTCAG CGCCGATCTC GCAGCGCGGT TGCGCGCGTG CGGTCTGACG CCGCTGCCGG CGCGGCAATT GCCGCCGAAT GACGGCGGCC TCAGCCTCGG TCAGGCCTGG ATCGCCGGAC AGGCGATCGT GAACGACACA GAGGAGGAGC GCCCATGTGC CTCGCCATTC CCGCCGAAGT GA
|
Protein sequence | MELARASAAD QVGRARVRVR GAVQGVGFRP FVYGLAQRYA LGGFVANDAE GVLIEVEGGS IAEFLAALRC EAPPLARVDS IETEQLRARG ERGFDIAESR AGRVTTRIGA DAATCEACLD DLFDPASRFH LYPFVNCTHC GPRYTLTHRL PYDRANTAMA GFALCADCRR DYQDPRDRRF HAEPIACPAC GPRLSHPIDE IVERLRAGGI VALKSLGGYH LLCDATNEAS VAELRRRKRR DAKPFAVMVA SEASLDRVVA ADAAERALLR SVERPIVLMQ DRGALAPSVA PGLRHVGVML PYTPLHHLLF HAAAGSPQGR GWQRAPLDLV LVATSANCGG DPIVIDDAER KLGGIADLIV SHDRDIVVRA DDSVMAISDG GPAFIRRARG FTPRPVRLPR EIPPVLAVGG YLKNTITLTR GREAFVSQHV GDLATADTVR FFEQTIAHLT RLVGVAPVAV AHDLHADFAS TRLAESLGLR LIAVQHHHAH VASIAAEHGI DAPLLGLVLD GHGQGSDGGN WGGELLRVDG AHVTRLGHLA ALALPGGDAA AREPWRMAAA ALAAIGQSEA ITARFADQPR APALAAMLAN HGCATTTSAG RLFDAAAGLL GVCSVQAYEG QAAMQLEALV QTPRVLIDGW RIERDALDLS PLLRHLATPG LAPVAGAELF HGTLAAALAN WIAQASARTG LTTIALGGGC FLNRVLSADL AARLRACGLT PLPARQLPPN DGGLSLGQAW IAGQAIVNDT EEERPCASPF PPK
|
| |