Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2432 |
Symbol | |
ID | 4784268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2592525 |
End bp | 2594252 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640091002 |
Product | sulfate thiol esterase SoxB |
Protein accession | YP_001021622 |
Protein GI | 124267618 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGT CCAAGCGAGA GTTCCTGCAG GTGCTGGGCG CGGCGTCGGC TGCGGGCCTG GGCCTGGCAC GGTACGCCGA CGCCGACGCC GCCACGGCAG AGCGGGGGCT CTACGAGGTG CCGCGTTTCG GCAACGTGTC GTTGCTGCAC ATGACCGACT GTCATGCGCA ATTGCTGCCC ATCCACTTCC GCGAGCCGAG CGTCAACCTC GGCGTGGGTG CGATGAGCGG CCAGTTGCCG CACCGGGTCG GCGAGCACCT GCTCGAGGCC GTCGGGGTGC GGCCCGGCAC GTTGCTCGCG CATGCCTATA CCTTTCTGGA TTTCGAGACG GCTGCGCGCC GCTACGGCAA GGTGGGCGGC TTCGCGCACA TGGCGACGCT GGTCAAGCGC CTGAAGGCCA GCCGCCCCGG CGCGCTGCTG CTCGATGGCG GCGACACCTG GCAGGGCTCG GCCACGTCGC TGTGGACGAA CGGTCAGGAC ATGGTGGACG CCTGCAAGCT GCTGGGGGTC GACGTGATGA CCGGGCACTG GGAGTTCACT TACGGCCAGA AGCGTGTGCA GCAGATCGTC GACGAGGACT TCAAGGGTCG GATCGATTTC GTGGCGCAGA ACGTCAGGAC GACCGATTTC GGCGACGAGG TGTTCAAGCC CTACACGCTG CGCGATGTCA ACGGCGTGAA GCTGGCGATC GTGGGGCAGG CCTTTCCCTA CACGCCCATC GCCAACCCGC GCTACATGGT GGCGGACTGG AGCTTCGGGA TCCAGGACGA CAACCTGCAG AAGGTCGTCG ACGCGGCGCG GGCTGCCGGA GCGCAGGTGG TGGTGGTGCT GTCGCACAAC GGCATGGACG TCGATCTGAA GATGGCGGGC CGCGTGCGTG GCATCGACGC CATCCTCGGC GGCCACACGC ACGATGGCAT TCCGGTGCCG GTGGTCGTTG CCAACCCGGG GGGCAAGACT CTGGTGACCA ATGCCGGCTC GAACACCAAG TTCCTCGGCG TGCTCGATCT CGACGTGAAG GGCGGTAAGG TCGCCGACTA CCGCTACAAG CTGCTGCCGG TGTTCTCGAA CCAGTTGCCG GCCGACCCTC AGATGCAGTC GCTGATCGAC AGGATCCGTG CTCCCTACAA GGACAAGCTC GCCGAGAAAC TGGCCATCAC CGAGGGGCTG CTCTACCGGC GCGGCAACTT CAACGGCAGC TGGGATCAGC TGCTGTGCGA TGCGTTGATG GAGGTGCAGG GCGCAGAGAT CGCCTTCTCG CCGGGCTTTC GATGGGGCAC GAGCCTGCTG CCCGGTGACG TGATCACGCG CGAACTCATG ATGGACCAGG TGGCGACCAC CTACTCCTAT GCAACCGTGA CCGAGATGAC CGGCGAGACG ATCAAGACCA TCCTCGAGGA TGTCGCCGAC AACCTGTTCA ACCCCGACCC CTACTACCAG CAGGGCGGTG ACATGGTTCG CGTGGGGGGC CTTGCCTACG CGATCGCACC CGGCGAATCG ATGGGCAAGC GCATCCAGGA CCTGCGCCTC GCAGGCCGAC CGATCGAGGC GGACAAGCGC TACCGGGTGG CGGGCTGGGC CCCCGTCGCC GAAGAGGCCC GCAGCGCCGG CAACAAGATG GTGTGGGACG TGGTCGAATC CTGGCTCCAG GCGAAGGGCC GCGTCACGCC GCGCAGGCTC AATGCGCCTC GACTGATCGG CGTGGACGGC AATGCCGGCG CGGCTTGA
|
Protein sequence | MSLSKREFLQ VLGAASAAGL GLARYADADA ATAERGLYEV PRFGNVSLLH MTDCHAQLLP IHFREPSVNL GVGAMSGQLP HRVGEHLLEA VGVRPGTLLA HAYTFLDFET AARRYGKVGG FAHMATLVKR LKASRPGALL LDGGDTWQGS ATSLWTNGQD MVDACKLLGV DVMTGHWEFT YGQKRVQQIV DEDFKGRIDF VAQNVRTTDF GDEVFKPYTL RDVNGVKLAI VGQAFPYTPI ANPRYMVADW SFGIQDDNLQ KVVDAARAAG AQVVVVLSHN GMDVDLKMAG RVRGIDAILG GHTHDGIPVP VVVANPGGKT LVTNAGSNTK FLGVLDLDVK GGKVADYRYK LLPVFSNQLP ADPQMQSLID RIRAPYKDKL AEKLAITEGL LYRRGNFNGS WDQLLCDALM EVQGAEIAFS PGFRWGTSLL PGDVITRELM MDQVATTYSY ATVTEMTGET IKTILEDVAD NLFNPDPYYQ QGGDMVRVGG LAYAIAPGES MGKRIQDLRL AGRPIEADKR YRVAGWAPVA EEARSAGNKM VWDVVESWLQ AKGRVTPRRL NAPRLIGVDG NAGAA
|
| |