Gene Mpe_A2807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2807 
Symbol 
ID4785057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2990239 
End bp2991258 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content69% 
IMG OID640091378 
Producthydrogenase small chain 
Protein accessionYP_001021996 
Protein GI124267992 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGG GCGCTGCGGC GGGATTCAAC GTGCTGTGGC TGCAGTCGGG GGGATGCGGT 
GGCTGCAGCA TGTCGCTGCT GTGCGCCGAC ACGACCGACT TCCACGGCCA GTTGCGCGAC
GCCGGCATCG ACCTGCTGTG GCACCCCTCG CTCTCGATCG AGAGCGGGCA CGAGCTGAGC
ACCATGCTCG ACCGCATCGC CGATGGCCGC CTGCGGCTCG ACGCCCTGTG CATCGAAGGC
TCCCTGCTGC GCGGCCCGCA TGGCAGCGGT CGCTTCCACG TGCTGGCCGG CACCGGCATC
CCGATGATCC ACTGGGTCTC GCGGTTGGCG GCCAGGGCCC GGCACGTGGT GGCGGTCGGC
AGCTGCGCGG CCTGGGGCGG CGTGACCGCC GGTGGCGACA ACCCCACCGA TGCCTGCGGC
CTGCAGTTCG AGGACGACCG TCGCGGTGGC CTGCTCGGTG CCGACTTCCG TTCTGAGAGT
GGCCTGCCGG TGATCAACAT CGCTGGCTGC CCCACGCATC CGAGCTGGGT GATCGACACG
CTGATGGCGC TGGCCGCTGA GAGCTTCACG GCCGGCGACC TCGACCAGCT GGGCCGTCCG
CGCTTCTATG CCGATCAGCT GGTGCACCAC GGCTGCACCC GCAATGAATA CTACGAATTC
AAGGCCAGCG CCGAGAAGCC GTCGGACCTG GGTTGCATGA TGGAGCACAT GGGCTGCAAG
GGCACACAGG TGCATGCGGA CTGCAACACG CGGCTGTGGA ACGGCGAGGG CTCGTGCACC
CGGGGCGGCT ACGCCTGCGT CGCCTGCACC GAGCCGGGCT TCCAGGAACC GGGCCACCCC
TTCCAACAGA CACCCAAGCT AGCCGGCATC CCGATCGGCC TGCCGACCGA CATGCCCAAG
GCCTGGTTCG TCGCGCTTGC GTCGCTGTCG AAGTCGGCGA CGCCCAGGCG CGTGAAGCTC
AATGCCGTGG CCGATCACCT GGTGGTCACG CCGGCGGTGC GCAAGACGCG CCTGAAATGA
 
Protein sequence
MSMGAAAGFN VLWLQSGGCG GCSMSLLCAD TTDFHGQLRD AGIDLLWHPS LSIESGHELS 
TMLDRIADGR LRLDALCIEG SLLRGPHGSG RFHVLAGTGI PMIHWVSRLA ARARHVVAVG
SCAAWGGVTA GGDNPTDACG LQFEDDRRGG LLGADFRSES GLPVINIAGC PTHPSWVIDT
LMALAAESFT AGDLDQLGRP RFYADQLVHH GCTRNEYYEF KASAEKPSDL GCMMEHMGCK
GTQVHADCNT RLWNGEGSCT RGGYACVACT EPGFQEPGHP FQQTPKLAGI PIGLPTDMPK
AWFVALASLS KSATPRRVKL NAVADHLVVT PAVRKTRLK