Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2827 |
Symbol | |
ID | 4785400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3008310 |
End bp | 3009404 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640091398 |
Product | hydrogenase (acceptor) |
Protein accession | YP_001022016 |
Protein GI | 124268012 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACCT TTTATGAGGT GATGCGCCGC CAAGGCATTT CGCGCCGCAG CTTCCTGAAG TACTGCTCGC TCACGGCCAC CTCGTTGGGG CTGGGTCCGG CCTTCGTGCC GCAGATCGCG CACGCCATGG AGAACAAGCC GCGTACGCCG GTGCTCTGGC TGCACGGTCT GGAATGCACC TGCTGCTCGG AGAGCTTCAT CCGCTCAGCG CACCCGCTCG CGAAGGACGT GGTGCTGTCG ATGATCTCGC TCGACTACGA CGACACGCTG ATGGCCGCCG CAGGCCATCA GGCCGAGGCG ATCCTCGAAG AGATCATGGC CAAGTACAAG GGCAACTACA TCCTGGCGGT GGAGGGCAAT CCTCCGCTGA ACCAGGATGG CATGAGCTGC ATCATCGGCG GCAAGCCCTT CATCGACCAG CTCAAGCACG TGGCCAAGGA CGCCAAGGCC ATCATCTCCT GGGGTTCGTG TGCCTCGTGG GGCTGTGTGC AGGCCGCCAA GCCCAATCCC ACGCAGGCCA CGCCGGTTCA CAAGGTGATC TTCGACAAGC CCATCATCAA GGTGCCGGGC TGCCCGCCGA TCGCCGAGGT GATGACCGGG GTGATCACCT ACATGCTCAC CTTCGACCGG CTGCCCGAGC TCGATCGCCA GGGCCGACCG AAGATGTTCT ACAGCCAGCG CATCCACGAC AAGTGCTACC GCCGGCCGCA CTTCGATGCC GGTCAGTTCG TCGAGCACTT CGACGACGAA TCGGCGCGCC GCGGCTATTG CCTCTACAAG GTCGGCTGCA AGGGCCCGAC CACCTACAAC GCGTGCTCGA CCACGCGCTG GAACGAGGGC ACCAGCTTCC CGATCCAGGC CGGCCACGGT TGCATCGGCT GCTCCGAAGA CGGCTTCTGG GACAAGGGGT CGTTCTACGA CCGCCTCACC AACATCCATC AGTTCGGCAT CGAGGCCAAC GCCGACCAGG TCGGTGGTGC CGCGGCCGGT GTGGTGGGCG CGGCCGCCGT CGCCCACGCC GCCGTGTCGG CGATCAAGCG TGCGACGCAG AAGACCGATT CGAAGACGAA CGAAACCTCC CGCGCCGGGC AGTGA
|
Protein sequence | METFYEVMRR QGISRRSFLK YCSLTATSLG LGPAFVPQIA HAMENKPRTP VLWLHGLECT CCSESFIRSA HPLAKDVVLS MISLDYDDTL MAAAGHQAEA ILEEIMAKYK GNYILAVEGN PPLNQDGMSC IIGGKPFIDQ LKHVAKDAKA IISWGSCASW GCVQAAKPNP TQATPVHKVI FDKPIIKVPG CPPIAEVMTG VITYMLTFDR LPELDRQGRP KMFYSQRIHD KCYRRPHFDA GQFVEHFDDE SARRGYCLYK VGCKGPTTYN ACSTTRWNEG TSFPIQAGHG CIGCSEDGFW DKGSFYDRLT NIHQFGIEAN ADQVGGAAAG VVGAAAVAHA AVSAIKRATQ KTDSKTNETS RAGQ
|
| |