Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0069 |
Symbol | |
ID | 4787672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | - |
Start bp | 58486 |
End bp | 61326 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640092478 |
Product | TnpA family transposase |
Protein accession | YP_001023083 |
Protein GI | 124262613 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.724381 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00503767 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTAAGAG CTACGGGATT CAGGGGCAAG GTCAAGCTGA TCGCGCGACG CCGCGGCGAC GGTCTGCGGC TGGGCCTGGC GCTGCACATC GGCTTCGTGC GCATGACCGG CCGGCCGCTG AACAGCGTGC GCGTCGTGCC GAGCGTCCTG CTGGCCCACC TGGGCCGCGA ACTCGGCATC GAGACTCCCG ATCTGGCGTC CGTGCGGGCG CTGTATGCAC GGGGCCGCAC CCTGTTCGAT CACCAACAGC TGGCCTGCGA GTCCCTGGGA TTTGGCTGGA TGACTGAGCA CCAGCGTCGT GCGCTCGTGC GTGTGCTGCG CGATGAAGTC GCGCAGTGCG CCGACCGAGA ACGTCTGATC GTCCACGCCC GGCAATGGCT GTACAGCCAC AGGTTGCTGG TCTTGCGCGA GCGCGACATC CGCGGCCTGG TGGCGGCCGC CCTGAGCGAG CTCGAGCGCA CGACGGCCGA GGCGGTTCGG CAGTCGGTTC CCGTCTCGAC CCTCAAGCGC TGGAGCGCGG TGCTGGATGC CCCGCGTCCT GATGGACAAC CCTGCCAGTC CTGGTTGTGG AGCGCGCCGG CCAAGCATTC CACGCGGCAG ATAGCAGAGG TTCTGGAGCG AATCACTTGC CTGAAGGAGC TCGGCGTCGA CCGCGCGCTG GCCGAGATCA ACAACGTGCT GGTGCGCCGC TACGCGCGAC GCATGGCCTC GCGAGCGCCG TCGGTCAACG CCCGCGTCAA GGAGCCCGCC CGGACCGTCG AGACGGCCTG CTTCCTGCGC TACTGCTTGC TGACGGCCAC CGACCAGTTC ATCCTCATGT TCCAGCGCCG CGTCGCCGAT CTCTGGCGCC AATGTGCCGA CGATGCGGTT GCCCCCATCG ACTGGTCGCG GCAGTACCAG TTGCTGCTGC AGGAGTTGGC CGAGCTGGCC CGGGACGAAG CCGGCACGGC CGAACTGCGC ACGCGCCTGC TCGAACTGGT CGCAGCCCGG CGCGCGCAGA GAACACCGAG CCGGGCCTCG GGCATCCGTC GGCAGTTGAT CGCCGCCATC GCCCCGGTGC GCTCGCTGCT GGTGGCCGCT GGGAGCCTGT CGTGGACGGC CACGGGTGAG CACCCGGCGC TGCAGGCGCT GGACATCCTG CGCGCGCAGT ACGCAGCCGG CGACAAGACC TTGCCTGTCG ATGTCACCGC TGCACGGCTG GGGGCAGCGT GGCGCCAGGA CATCGCTGAC ACCGACCGGG AGCGGGCGTT CCGGGCACTG GAAGTGGCCA CCCTGTTCGC GCTGCGCCGG GGATTGCGCA ATGGCTCGAT CTGGATCGAA CACTCGCTGA GCTTCCGAGG CCGCGAGCGG CTGTTCATCC CGGACGAGCG CTGGGAGGTT GAGGCCCGGC GCCACTACGC CAGGCTACAG ATGCCGGCCA AGGCAGCGAA CTACCTCGCG CCGCTGCTGG AGCGCGTGCG CGCCGGCGTC GACGCCGTGG CCACGGCGGT ACGCACCGGC GCGCTGCGTG TCGACGACGA ACTGCACTTG GCTCCGCTGG CGGCCGACGA TGAGGACCCC GAAGTCAGCA GGCTGCGCAG CCGGCTCGAT CAGCGCATCG GCGAAGTGCA ACTGCCCGAG GTCATCCTGG CGGTGGACGC CCAGGTGCGC TTCAGCTGGA TCATGCTCGC ACGCGAGCCG CGATCTGGCC AGGAACTGCT GATGGTCTAC GCCGGCATCC TGGCCCACGG CACCAGCCTG ACTGCGGCCG AGTGCGCACG GATGATCCCG CAGCTGTCGG CCACCAGTAT TCGCCAGGCT ATGCGCTGGG CCGGTGACGA GCGGCGCCTG GCGCTGGCCT GCCAGGCGGT GCTGGAGCAC ATGCAACGCC AACCGATCGC CGCCACCTGG GGCCGAGCGG ATCTGGCTTC GTCAGACATG ATGAGCCTGG AGACCAGCCG CCGGGTCTGG CAGGCACGGC AGGACCCCAG GCGGCAGACG GCCTCGATCG GCATCTACAG CCACGTCAAG GACCGCTGGG GGATCTTCCA TGCCCAGCCC ATCGTGCTCA ACGAGCGTCA GGCTGGCGCG GCCATCGAGG GCGTGGTGCG GCAGGAGAAG GTCGAGACCA CGCAACTGGC TGTGGACACG CACGGCTACA CCGACTTCGC CATGGGCCTG GCTCGGCTGC TGGGCTTCGA TCTGTGTCCC AGGCTCAGGG AGCTCAATCA GCGGCACCTG TTCGTGCCGC GCGGCATGAA GGTGCCCGAG GAGATCGCTG CCGTGTGCGA GGCCACCATC GATACGGCCC TGATAGAGAC CCACTGGGAC AGTCTGGTGC ATCTGGCAGC CTCGGTGATC ACGGGCCACG CCAGCGCGGT CACGGCGTTG GCCAGGTTCG GCTCGGCCGC CCGAGGCGAC CCGATCTACG ACGCCGGCGT CCAGTTGGGC AAGCTGCTGC GCACGGCGTT CCTGGCCGAC TACTTCGTCA ACGAGGCGTT CCGCCGCGAG CTGCGACGGG TCCTCAACCG CGGCGAGGCG GTCAATGCGC TGAAGCGGGC GATCTACACC GGCCGTGTGG GGCCAGCGCA GGCACGGAGA GCCGACGAAA TGCAAGCGGT GGCCGATGCG CTGAGCCTGC TGGCCAACAT CGTGATGGCA TGGAACACGG CGCAGATGCA GGTAGTGCTG GACCGGTGGG CCAATAGGCG GCAGATCGTG CCGGCCGAGC TGACCGGGCG CATTGCGCCC ACGCGCCTTG AAGGCATCAA CCTGCGGGGA GTCTTCCGCT TCCCGCTGGA GCGCTACGCG GGCCAGATCC TGCCGTCACA AACTGCAGTG AAAACAGGTG CTGCTGGCTG A
|
Protein sequence | MLRATGFRGK VKLIARRRGD GLRLGLALHI GFVRMTGRPL NSVRVVPSVL LAHLGRELGI ETPDLASVRA LYARGRTLFD HQQLACESLG FGWMTEHQRR ALVRVLRDEV AQCADRERLI VHARQWLYSH RLLVLRERDI RGLVAAALSE LERTTAEAVR QSVPVSTLKR WSAVLDAPRP DGQPCQSWLW SAPAKHSTRQ IAEVLERITC LKELGVDRAL AEINNVLVRR YARRMASRAP SVNARVKEPA RTVETACFLR YCLLTATDQF ILMFQRRVAD LWRQCADDAV APIDWSRQYQ LLLQELAELA RDEAGTAELR TRLLELVAAR RAQRTPSRAS GIRRQLIAAI APVRSLLVAA GSLSWTATGE HPALQALDIL RAQYAAGDKT LPVDVTAARL GAAWRQDIAD TDRERAFRAL EVATLFALRR GLRNGSIWIE HSLSFRGRER LFIPDERWEV EARRHYARLQ MPAKAANYLA PLLERVRAGV DAVATAVRTG ALRVDDELHL APLAADDEDP EVSRLRSRLD QRIGEVQLPE VILAVDAQVR FSWIMLAREP RSGQELLMVY AGILAHGTSL TAAECARMIP QLSATSIRQA MRWAGDERRL ALACQAVLEH MQRQPIAATW GRADLASSDM MSLETSRRVW QARQDPRRQT ASIGIYSHVK DRWGIFHAQP IVLNERQAGA AIEGVVRQEK VETTQLAVDT HGYTDFAMGL ARLLGFDLCP RLRELNQRHL FVPRGMKVPE EIAAVCEATI DTALIETHWD SLVHLAASVI TGHASAVTAL ARFGSAARGD PIYDAGVQLG KLLRTAFLAD YFVNEAFRRE LRRVLNRGEA VNALKRAIYT GRVGPAQARR ADEMQAVADA LSLLANIVMA WNTAQMQVVL DRWANRRQIV PAELTGRIAP TRLEGINLRG VFRFPLERYA GQILPSQTAV KTGAAG
|
| |