Gene Mpe_B0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0454 
Symbol 
ID4787594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp403362 
End bp406271 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content69% 
IMG OID640092884 
ProductTnpA family transposase 
Protein accessionYP_001023462 
Protein GI124262992 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones158 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGGCT GGCAATCACC GTATCTGGGC CAGCGCGAGC TGCCGCGTGA GCTCAGCCAG 
TTTGAGCTGC AGGCCTTCTT CAGCTTCAGC CCCGCCGAGC GAGAGCTGAT CGCGCGACGC
CGCGGCGACG GTCTGCGGCT GGGCCTGGCG CTGCACATCG GCTTCGTGCG CATGACCGGC
CGGCCGCTGA ACAGCGTGCG CGTCGTGCCG AGCGTCCTGC TGGCCCACCT GGGCCGCGAA
CTCGGCATCG AGACTCCCGA TCTGGCGTCC GTGCGGGCGC TGTATGCACG GGGCCGCACC
CTGTTCGATC ACCAACAGCT GGCCTGCGAG TCCCTGGGAT TTGGCTGGAT GACTGAGCAC
CAGCGTCGTG CGCTCGTGCG TGTGCTGCGC GATGAAGTCG CGCAGTGCGC CGACCGAGAA
CGTCTGATCG TCCACGCCCG GCAATGGCTG TACAGCCACA GGTTGCTGGT CTTGCGCGAG
CGCGACATCC GCGGCCTGGT GGCGGCCGCC CTGAGCGAGC TCGAGCGCAC GACGGCCGAG
GCGGTTCGGC AGTCGGTTCC CGTCTCGACC CTCAAGCGCT GGAGCGCGGT GCTGGATGCC
CCGCGTCCTG ATGGACAACC CTGCCAGTCC TGGTTGTGGA GCGCGCCGGC CAAGCATTCC
ACGCGGCAGA TAGCAGAGGT TCTGGAGCGA ATCACTTGCC TGAAGGAGCT CGGCGTCGAC
CGCGCGCTGG CCGAGATCAA CAACGTGCTG GTGCGCCGCT ACGCGCGACG CATGGCCTCG
CGAGCGCCGT CGGTCAACGC CCGCGTCAAG GAGCCCGCCC GGACCGTCGA GACGGCCTGC
TTCCTGCGCT ACTGCTTGCT GACGGCCACC GACCAGTTCA TCCTCATGTT CCAGCGCCGC
GTCGCCGATC TCTGGCGCCA ATGTGCCGAC GATGCGGTTG CCCCCATCGA CTGGTCGCGG
CAGTACCAGT TGCTGCTGCA GGAGTTGGCC GAGCTGGCCC GGGACGAAGC CGGCACGGCC
GAACTGCGCA CGCGCCTGCT CGAACTGGTC GCAGCCCGGC GCGCGCAGAG AACACCGAGC
CGGGCCTCGG GCATCCGTCG GCAGTTGATC GCCGCCATCG CCCCGGTGCG CTCGCTGCTG
GTGGCCGCTG GGAGCCTGTC GTGGACGGCC ACGGGTGAGC ACCCGGCGCT GCAGGCGCTG
GACATCCTGC GCGCGCAGTA CGCAGCCGGC GACAAGACCT TGCCTGTCGA TGTCACCGCT
GCACGGCTGG GGGCAGCGTG GCGCCAGGAC ATCGCTGACA CCGACCGGGA GCGGGCGTTC
CGGGCACTGG AAGTGGCCAC CCTGTTCGCG CTGCGCCGGG GATTGCGCAA TGGCTCGATC
TGGATCGAAC ACTCGCTGAG CTTCCGAGGC CGCGAGCGGC TGTTCATCCC GGACGAGCGC
TGGGAGGTTG AGGCCCGGCG CCACTACGCC AGGCTACAGA TGCCGGCCAA GGCAGCGAAC
TACCTCGCGC CGCTGCTGGA GCGCGTGCGC GCCGGCGTCG ACGCCGTGGC CACGGCGGTA
CGCACCGGCG CGCTGCGTGT CGACGACGAA CTGCACTTGG CTCCGCTGGC GGCCGACGAT
GAGGACCCCG AAGTCAGCAG GCTGCGCAGC CGGCTCGATC AGCGCATCGG CGAAGTGCAA
CTGCCCGAGG TCATCCTGGC GGTGGACGCC CAGGTGCGCT TCAGCTGGAT CATGCTCGCA
CGCGAGCCGC GATCTGGCCA GGAACTGCTG ATGGTCTACG CCGGCATCCT GGCCCACGGC
ACCAGCCTGA CTGCGGCCGA GTGCGCACGG ATGATCCCGC AGCTGTCGGC CACCAGTATT
CGCCAGGCTA TGCGCTGGGC CGGTGACGAG CGGCGCCTGG CGCTGGCCTG CCAGGCGGTG
CTGGAGCACA TGCAACGCCA ACCGATCGCC GCCACCTGGG GCCGAGCGGA TCTGGCTTCG
TCAGACATGA TGAGCCTGGA GACCAGCCGC CGGGTCTGGC AGGCACGGCA GGACCCCAGG
CGGCAGACGG CCTCGATCGG CATCTACAGC CACGTCAAGG ACCGCTGGGG GATCTTCCAT
GCCCAGCCCA TCGTGCTCAA CGAGCGTCAG GCTGGCGCGG CCATCGAGGG CGTGGTGCGG
CAGGAGAAGG TCGAGACCAC GCAACTGGCT GTGGACACGC ACGGCTACAC CGACTTCGCC
ATGGGCCTGG CTCGGCTGCT GGGCTTCGAT CTGTGTCCCA GGCTCAGGGA GCTCAATCAG
CGGCACCTGT TCGTGCCGCG CGGCATGAAG GTGCCCGAGG AGATCGCTGC CGTGTGCGAG
GCCACCATCG ATACGGCCCT GATAGAGACC CACTGGGACA GTCTGGTGCA TCTGGCAGCC
TCGGTGATCA CGGGCCACGC CAGCGCGGTC ACGGCGTTGG CCAGGTTCGG CTCGGCCGCC
CGAGGCGACC CGATCTACGA CGCCGGCGTC CAGTTGGGCA AGCTGCTGCG CACGGCGTTC
CTGGCCGACT ACTTCGTCAA CGAGGCGTTC CGCCGCGAGC TGCGACGGGT CCTCAACCGC
GGCGAGGCGG TCAATGCGCT GAAGCGGGCG ATCTACACCG GCCGTGTGGG GCCAGCGCAG
GCACGGAGAG CCGACGAAAT GCAAGCGGTG GCCGATGCGC TGAGCCTGCT GGCCAACATC
GTGATGGCAT GGAACACGGC GCAGATGCAG GTAGTGCTGG ACCGGTGGGC CAATAGGCGG
CAGATCGTGC CGGCCGAGCT GACCGGGCGC ATTGCGCCCA CGCGCCTTGA AGGCATCAAC
CTGCGGGGAG TCTTCCGCTT CCCGCTGGAG CGCTACGCGG GCCAGATCCT GCCGTCACAA
ACTGCAGTGA AAACAGGTGC TGCTGGCTGA
 
Protein sequence
MQGWQSPYLG QRELPRELSQ FELQAFFSFS PAERELIARR RGDGLRLGLA LHIGFVRMTG 
RPLNSVRVVP SVLLAHLGRE LGIETPDLAS VRALYARGRT LFDHQQLACE SLGFGWMTEH
QRRALVRVLR DEVAQCADRE RLIVHARQWL YSHRLLVLRE RDIRGLVAAA LSELERTTAE
AVRQSVPVST LKRWSAVLDA PRPDGQPCQS WLWSAPAKHS TRQIAEVLER ITCLKELGVD
RALAEINNVL VRRYARRMAS RAPSVNARVK EPARTVETAC FLRYCLLTAT DQFILMFQRR
VADLWRQCAD DAVAPIDWSR QYQLLLQELA ELARDEAGTA ELRTRLLELV AARRAQRTPS
RASGIRRQLI AAIAPVRSLL VAAGSLSWTA TGEHPALQAL DILRAQYAAG DKTLPVDVTA
ARLGAAWRQD IADTDRERAF RALEVATLFA LRRGLRNGSI WIEHSLSFRG RERLFIPDER
WEVEARRHYA RLQMPAKAAN YLAPLLERVR AGVDAVATAV RTGALRVDDE LHLAPLAADD
EDPEVSRLRS RLDQRIGEVQ LPEVILAVDA QVRFSWIMLA REPRSGQELL MVYAGILAHG
TSLTAAECAR MIPQLSATSI RQAMRWAGDE RRLALACQAV LEHMQRQPIA ATWGRADLAS
SDMMSLETSR RVWQARQDPR RQTASIGIYS HVKDRWGIFH AQPIVLNERQ AGAAIEGVVR
QEKVETTQLA VDTHGYTDFA MGLARLLGFD LCPRLRELNQ RHLFVPRGMK VPEEIAAVCE
ATIDTALIET HWDSLVHLAA SVITGHASAV TALARFGSAA RGDPIYDAGV QLGKLLRTAF
LADYFVNEAF RRELRRVLNR GEAVNALKRA IYTGRVGPAQ ARRADEMQAV ADALSLLANI
VMAWNTAQMQ VVLDRWANRR QIVPAELTGR IAPTRLEGIN LRGVFRFPLE RYAGQILPSQ
TAVKTGAAG