Gene Mpe_A2432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2432 
Symbol 
ID4784268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2592525 
End bp2594252 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content67% 
IMG OID640091002 
Productsulfate thiol esterase SoxB 
Protein accessionYP_001021622 
Protein GI124267618 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGT CCAAGCGAGA GTTCCTGCAG GTGCTGGGCG CGGCGTCGGC TGCGGGCCTG 
GGCCTGGCAC GGTACGCCGA CGCCGACGCC GCCACGGCAG AGCGGGGGCT CTACGAGGTG
CCGCGTTTCG GCAACGTGTC GTTGCTGCAC ATGACCGACT GTCATGCGCA ATTGCTGCCC
ATCCACTTCC GCGAGCCGAG CGTCAACCTC GGCGTGGGTG CGATGAGCGG CCAGTTGCCG
CACCGGGTCG GCGAGCACCT GCTCGAGGCC GTCGGGGTGC GGCCCGGCAC GTTGCTCGCG
CATGCCTATA CCTTTCTGGA TTTCGAGACG GCTGCGCGCC GCTACGGCAA GGTGGGCGGC
TTCGCGCACA TGGCGACGCT GGTCAAGCGC CTGAAGGCCA GCCGCCCCGG CGCGCTGCTG
CTCGATGGCG GCGACACCTG GCAGGGCTCG GCCACGTCGC TGTGGACGAA CGGTCAGGAC
ATGGTGGACG CCTGCAAGCT GCTGGGGGTC GACGTGATGA CCGGGCACTG GGAGTTCACT
TACGGCCAGA AGCGTGTGCA GCAGATCGTC GACGAGGACT TCAAGGGTCG GATCGATTTC
GTGGCGCAGA ACGTCAGGAC GACCGATTTC GGCGACGAGG TGTTCAAGCC CTACACGCTG
CGCGATGTCA ACGGCGTGAA GCTGGCGATC GTGGGGCAGG CCTTTCCCTA CACGCCCATC
GCCAACCCGC GCTACATGGT GGCGGACTGG AGCTTCGGGA TCCAGGACGA CAACCTGCAG
AAGGTCGTCG ACGCGGCGCG GGCTGCCGGA GCGCAGGTGG TGGTGGTGCT GTCGCACAAC
GGCATGGACG TCGATCTGAA GATGGCGGGC CGCGTGCGTG GCATCGACGC CATCCTCGGC
GGCCACACGC ACGATGGCAT TCCGGTGCCG GTGGTCGTTG CCAACCCGGG GGGCAAGACT
CTGGTGACCA ATGCCGGCTC GAACACCAAG TTCCTCGGCG TGCTCGATCT CGACGTGAAG
GGCGGTAAGG TCGCCGACTA CCGCTACAAG CTGCTGCCGG TGTTCTCGAA CCAGTTGCCG
GCCGACCCTC AGATGCAGTC GCTGATCGAC AGGATCCGTG CTCCCTACAA GGACAAGCTC
GCCGAGAAAC TGGCCATCAC CGAGGGGCTG CTCTACCGGC GCGGCAACTT CAACGGCAGC
TGGGATCAGC TGCTGTGCGA TGCGTTGATG GAGGTGCAGG GCGCAGAGAT CGCCTTCTCG
CCGGGCTTTC GATGGGGCAC GAGCCTGCTG CCCGGTGACG TGATCACGCG CGAACTCATG
ATGGACCAGG TGGCGACCAC CTACTCCTAT GCAACCGTGA CCGAGATGAC CGGCGAGACG
ATCAAGACCA TCCTCGAGGA TGTCGCCGAC AACCTGTTCA ACCCCGACCC CTACTACCAG
CAGGGCGGTG ACATGGTTCG CGTGGGGGGC CTTGCCTACG CGATCGCACC CGGCGAATCG
ATGGGCAAGC GCATCCAGGA CCTGCGCCTC GCAGGCCGAC CGATCGAGGC GGACAAGCGC
TACCGGGTGG CGGGCTGGGC CCCCGTCGCC GAAGAGGCCC GCAGCGCCGG CAACAAGATG
GTGTGGGACG TGGTCGAATC CTGGCTCCAG GCGAAGGGCC GCGTCACGCC GCGCAGGCTC
AATGCGCCTC GACTGATCGG CGTGGACGGC AATGCCGGCG CGGCTTGA
 
Protein sequence
MSLSKREFLQ VLGAASAAGL GLARYADADA ATAERGLYEV PRFGNVSLLH MTDCHAQLLP 
IHFREPSVNL GVGAMSGQLP HRVGEHLLEA VGVRPGTLLA HAYTFLDFET AARRYGKVGG
FAHMATLVKR LKASRPGALL LDGGDTWQGS ATSLWTNGQD MVDACKLLGV DVMTGHWEFT
YGQKRVQQIV DEDFKGRIDF VAQNVRTTDF GDEVFKPYTL RDVNGVKLAI VGQAFPYTPI
ANPRYMVADW SFGIQDDNLQ KVVDAARAAG AQVVVVLSHN GMDVDLKMAG RVRGIDAILG
GHTHDGIPVP VVVANPGGKT LVTNAGSNTK FLGVLDLDVK GGKVADYRYK LLPVFSNQLP
ADPQMQSLID RIRAPYKDKL AEKLAITEGL LYRRGNFNGS WDQLLCDALM EVQGAEIAFS
PGFRWGTSLL PGDVITRELM MDQVATTYSY ATVTEMTGET IKTILEDVAD NLFNPDPYYQ
QGGDMVRVGG LAYAIAPGES MGKRIQDLRL AGRPIEADKR YRVAGWAPVA EEARSAGNKM
VWDVVESWLQ AKGRVTPRRL NAPRLIGVDG NAGAA