Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0454 |
Symbol | |
ID | 4787594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | - |
Start bp | 403362 |
End bp | 406271 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640092884 |
Product | TnpA family transposase |
Protein accession | YP_001023462 |
Protein GI | 124262992 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 158 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGGCT GGCAATCACC GTATCTGGGC CAGCGCGAGC TGCCGCGTGA GCTCAGCCAG TTTGAGCTGC AGGCCTTCTT CAGCTTCAGC CCCGCCGAGC GAGAGCTGAT CGCGCGACGC CGCGGCGACG GTCTGCGGCT GGGCCTGGCG CTGCACATCG GCTTCGTGCG CATGACCGGC CGGCCGCTGA ACAGCGTGCG CGTCGTGCCG AGCGTCCTGC TGGCCCACCT GGGCCGCGAA CTCGGCATCG AGACTCCCGA TCTGGCGTCC GTGCGGGCGC TGTATGCACG GGGCCGCACC CTGTTCGATC ACCAACAGCT GGCCTGCGAG TCCCTGGGAT TTGGCTGGAT GACTGAGCAC CAGCGTCGTG CGCTCGTGCG TGTGCTGCGC GATGAAGTCG CGCAGTGCGC CGACCGAGAA CGTCTGATCG TCCACGCCCG GCAATGGCTG TACAGCCACA GGTTGCTGGT CTTGCGCGAG CGCGACATCC GCGGCCTGGT GGCGGCCGCC CTGAGCGAGC TCGAGCGCAC GACGGCCGAG GCGGTTCGGC AGTCGGTTCC CGTCTCGACC CTCAAGCGCT GGAGCGCGGT GCTGGATGCC CCGCGTCCTG ATGGACAACC CTGCCAGTCC TGGTTGTGGA GCGCGCCGGC CAAGCATTCC ACGCGGCAGA TAGCAGAGGT TCTGGAGCGA ATCACTTGCC TGAAGGAGCT CGGCGTCGAC CGCGCGCTGG CCGAGATCAA CAACGTGCTG GTGCGCCGCT ACGCGCGACG CATGGCCTCG CGAGCGCCGT CGGTCAACGC CCGCGTCAAG GAGCCCGCCC GGACCGTCGA GACGGCCTGC TTCCTGCGCT ACTGCTTGCT GACGGCCACC GACCAGTTCA TCCTCATGTT CCAGCGCCGC GTCGCCGATC TCTGGCGCCA ATGTGCCGAC GATGCGGTTG CCCCCATCGA CTGGTCGCGG CAGTACCAGT TGCTGCTGCA GGAGTTGGCC GAGCTGGCCC GGGACGAAGC CGGCACGGCC GAACTGCGCA CGCGCCTGCT CGAACTGGTC GCAGCCCGGC GCGCGCAGAG AACACCGAGC CGGGCCTCGG GCATCCGTCG GCAGTTGATC GCCGCCATCG CCCCGGTGCG CTCGCTGCTG GTGGCCGCTG GGAGCCTGTC GTGGACGGCC ACGGGTGAGC ACCCGGCGCT GCAGGCGCTG GACATCCTGC GCGCGCAGTA CGCAGCCGGC GACAAGACCT TGCCTGTCGA TGTCACCGCT GCACGGCTGG GGGCAGCGTG GCGCCAGGAC ATCGCTGACA CCGACCGGGA GCGGGCGTTC CGGGCACTGG AAGTGGCCAC CCTGTTCGCG CTGCGCCGGG GATTGCGCAA TGGCTCGATC TGGATCGAAC ACTCGCTGAG CTTCCGAGGC CGCGAGCGGC TGTTCATCCC GGACGAGCGC TGGGAGGTTG AGGCCCGGCG CCACTACGCC AGGCTACAGA TGCCGGCCAA GGCAGCGAAC TACCTCGCGC CGCTGCTGGA GCGCGTGCGC GCCGGCGTCG ACGCCGTGGC CACGGCGGTA CGCACCGGCG CGCTGCGTGT CGACGACGAA CTGCACTTGG CTCCGCTGGC GGCCGACGAT GAGGACCCCG AAGTCAGCAG GCTGCGCAGC CGGCTCGATC AGCGCATCGG CGAAGTGCAA CTGCCCGAGG TCATCCTGGC GGTGGACGCC CAGGTGCGCT TCAGCTGGAT CATGCTCGCA CGCGAGCCGC GATCTGGCCA GGAACTGCTG ATGGTCTACG CCGGCATCCT GGCCCACGGC ACCAGCCTGA CTGCGGCCGA GTGCGCACGG ATGATCCCGC AGCTGTCGGC CACCAGTATT CGCCAGGCTA TGCGCTGGGC CGGTGACGAG CGGCGCCTGG CGCTGGCCTG CCAGGCGGTG CTGGAGCACA TGCAACGCCA ACCGATCGCC GCCACCTGGG GCCGAGCGGA TCTGGCTTCG TCAGACATGA TGAGCCTGGA GACCAGCCGC CGGGTCTGGC AGGCACGGCA GGACCCCAGG CGGCAGACGG CCTCGATCGG CATCTACAGC CACGTCAAGG ACCGCTGGGG GATCTTCCAT GCCCAGCCCA TCGTGCTCAA CGAGCGTCAG GCTGGCGCGG CCATCGAGGG CGTGGTGCGG CAGGAGAAGG TCGAGACCAC GCAACTGGCT GTGGACACGC ACGGCTACAC CGACTTCGCC ATGGGCCTGG CTCGGCTGCT GGGCTTCGAT CTGTGTCCCA GGCTCAGGGA GCTCAATCAG CGGCACCTGT TCGTGCCGCG CGGCATGAAG GTGCCCGAGG AGATCGCTGC CGTGTGCGAG GCCACCATCG ATACGGCCCT GATAGAGACC CACTGGGACA GTCTGGTGCA TCTGGCAGCC TCGGTGATCA CGGGCCACGC CAGCGCGGTC ACGGCGTTGG CCAGGTTCGG CTCGGCCGCC CGAGGCGACC CGATCTACGA CGCCGGCGTC CAGTTGGGCA AGCTGCTGCG CACGGCGTTC CTGGCCGACT ACTTCGTCAA CGAGGCGTTC CGCCGCGAGC TGCGACGGGT CCTCAACCGC GGCGAGGCGG TCAATGCGCT GAAGCGGGCG ATCTACACCG GCCGTGTGGG GCCAGCGCAG GCACGGAGAG CCGACGAAAT GCAAGCGGTG GCCGATGCGC TGAGCCTGCT GGCCAACATC GTGATGGCAT GGAACACGGC GCAGATGCAG GTAGTGCTGG ACCGGTGGGC CAATAGGCGG CAGATCGTGC CGGCCGAGCT GACCGGGCGC ATTGCGCCCA CGCGCCTTGA AGGCATCAAC CTGCGGGGAG TCTTCCGCTT CCCGCTGGAG CGCTACGCGG GCCAGATCCT GCCGTCACAA ACTGCAGTGA AAACAGGTGC TGCTGGCTGA
|
Protein sequence | MQGWQSPYLG QRELPRELSQ FELQAFFSFS PAERELIARR RGDGLRLGLA LHIGFVRMTG RPLNSVRVVP SVLLAHLGRE LGIETPDLAS VRALYARGRT LFDHQQLACE SLGFGWMTEH QRRALVRVLR DEVAQCADRE RLIVHARQWL YSHRLLVLRE RDIRGLVAAA LSELERTTAE AVRQSVPVST LKRWSAVLDA PRPDGQPCQS WLWSAPAKHS TRQIAEVLER ITCLKELGVD RALAEINNVL VRRYARRMAS RAPSVNARVK EPARTVETAC FLRYCLLTAT DQFILMFQRR VADLWRQCAD DAVAPIDWSR QYQLLLQELA ELARDEAGTA ELRTRLLELV AARRAQRTPS RASGIRRQLI AAIAPVRSLL VAAGSLSWTA TGEHPALQAL DILRAQYAAG DKTLPVDVTA ARLGAAWRQD IADTDRERAF RALEVATLFA LRRGLRNGSI WIEHSLSFRG RERLFIPDER WEVEARRHYA RLQMPAKAAN YLAPLLERVR AGVDAVATAV RTGALRVDDE LHLAPLAADD EDPEVSRLRS RLDQRIGEVQ LPEVILAVDA QVRFSWIMLA REPRSGQELL MVYAGILAHG TSLTAAECAR MIPQLSATSI RQAMRWAGDE RRLALACQAV LEHMQRQPIA ATWGRADLAS SDMMSLETSR RVWQARQDPR RQTASIGIYS HVKDRWGIFH AQPIVLNERQ AGAAIEGVVR QEKVETTQLA VDTHGYTDFA MGLARLLGFD LCPRLRELNQ RHLFVPRGMK VPEEIAAVCE ATIDTALIET HWDSLVHLAA SVITGHASAV TALARFGSAA RGDPIYDAGV QLGKLLRTAF LADYFVNEAF RRELRRVLNR GEAVNALKRA IYTGRVGPAQ ARRADEMQAV ADALSLLANI VMAWNTAQMQ VVLDRWANRR QIVPAELTGR IAPTRLEGIN LRGVFRFPLE RYAGQILPSQ TAVKTGAAG
|
| |