Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3346 |
Symbol | |
ID | 4786387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3551139 |
End bp | 3556928 |
Gene Length | 5790 bp |
Protein Length | 1929 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640091919 |
Product | UvrA family protein |
Protein accession | YP_001022534 |
Protein GI | 124268530 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.973275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGGCC CGATCCCTAG AATCTTCCGC TCACCCCCCG GAACCCCCAT GCCCAACGGT CAGCGCCCTC CCGGCGTGAT CCGCATTCGC GGTGCACGCC AGCACAACCT GAAGAACCTC GACCTGGACC TCCACACGGG AGAACTGACG GTCGTCACCG GGCCCAGCGG CTCGGGCAAG TCCAGCCTGG TGTTCGACAC GCTGTACGCC GAGGGCCAGC GCCGCTACGT CGAGACCTTC AGCGCCTATG CGCGCCAGTT CCTCGACCGC ATGGACCGCC CGGCGGTCGA CCATGTCGAG GGCGTGCCAC CGGCCATCGC GATCGACCAG ACCAACCCGG TGCGCACCTC GCGCTCCACG GTCGGCACCA TGACCGAGCT GAACGACCAC CTGAAGCTGC TGTTCGCGCG CGCCGCGCAG CTGTTCGACC GGCAGACCGC GAAGCCGGTG CGCGAGGACG ATCCGGAGTC GATCTACCGC GAACTGCAGC AGCGCGCCGC GCCCGGCCAG CGCCTCGTGG TGACTTTCCC GGTCGAGCTG CCGGCCAGCG CGACCGCGGC CGACGTCGAG CAATGGCTCG CCGCCAGCGG CTACACGCGC GTGCAGGCCG AGCGCGAGAT CGCCACGCCC ACCGGGCCGC GCAAGCTGCT CGACGTGGTG GCCGACCGCT TGCGGCTCGA CGGCACCGAG AAATCGCGCG TCGTCGAGGC GCTCGAGGTG TCGCTCAAGC GCGGCGGCGG GCGCGTCAAC GTCTATGCCG TGGCGGGAGA GGGCGAAGAA CAGCTCTGGC GCTTCTCCTC CGGCCTGCAC AGCCCCGACA GCGAGCTGCG CTACAGCCAT CCGCAGCCGT CGATGTTCTC GTTCAACTCG GCCTTCGGCG CTTGCGAGAC CTGCCGCGGC TTCGGCCGCG TGATCGGCGT CGACTGGGGG CTGGTGATGC CGGACCACCG CAAGACGCTG CGCACCGGTG TCGTCAAGAC CATCCAGACG CCGGCCTGGA AGGAGATCCA GGCCGACCTG CTGAAGTACG CCGGCGAGGC CGGCATCCCG CGCGACACCG CCTGGAGCCA GCTCAGCGAC GCGCAGCGCG ACTGGGTGCT CGACGGCACG CCCAACTGGA AGGGGAACTG GAACAAGCAG TGGTACGGCA TCAAGCGCTT CTTCGAGTAC CTGGAGAGCA AGGCCTACAA GATGCACATC CGTGTGCTCC TGTCCAAGTA CCGCAGCTAC ACGCCGTGCC CGGCCTGCGG CGGCGCGCGC CTGAAGCTGG AGTCGCTGCA GTGGCGCATC GGCACGAAGG AGGGGGCCGA TGCGGCCCTG AAGGCGGCCC AGCGCTTCCT GCCGGTCGGC GCGGCCTGGT CGCGCGAACA GCTGGAAGCG CTGCCGGGCC TGAGCCTGCA CGACCTGATG CTGCTGCCGA TCGATCGGCT GCGGCGCTTT TTCGACGAGC TCACCTTGCC GAGCACGATG CTCGACGACG CACTGGAGCT GCTGCTCGAC GAGGTGCGAA CCCGGCTCAA GTACCTGTGC GACGTGGGCA TCGGCTACCT CACGCTCGAC CGCCAGAGCC GCACGCTGAG CGGCGGCGAG GTGCAGCGCA TCAACCTCAC CACCGCGCTC GGCACGTCGC TGGTCAACAC GCTGTTCGTG CTGGACGAGC CGTCGATCGG CCTGCACCCG CGCGACATGA ACCGCATCGT GCAGGCCATG CACCGCCTGC GCGATGCCGG CAACACGCTG GTGGTCGTGG AGCACGACCC GGCCGTGATG CTGGCCGCCG ACCGCGTGCT CGACATGGGC CCGGGCCCCG GCACCCAGGG CGGGCGCGTC GTGTTCGACG GCACGCCCGA CGACCTCAAG CGCGCCGACA CGATGACCGG CGCCTACCTC GGCGCGCGCC GCAGCATCGG CCTGGGTCTC AAGCGGCTGG TGACCGACGG CACGCCGAGG CTCATCGTGG AGGGCGCGCG CGAGCACAAC CTGCGCGGCA TCACGGTCGA GTTCCCGCTG CAACGCCTGG TGGTGGTGAC CGGCGTGTCC GGCTCAGGCA AGAGCACGCT GATCCAGGAC CTGCTGTTCC CCGCGCTGGC GCGCCACTTC GGCAAGGCCA CCGAGACGCC GGGCGCGCAC GACCGCCTGC TCGGCGCCGA CTGGCTGAGC GACGCGGTGT TCGTCGACCA GAGCCCGATC GGCAAGACCG CGCGCTCCAA CCCGGCTAGC TACGTCGGGG CCTTCGACAC GCTGCGCAAC ATCTTCGCCG AGGCACCGAT GGCGCTGCAG CGCGGCTACG GCGCGGGCAT GTTCAGCTTC AATGCCGGCG ACGGCCGCTG CCCGACCTGC GGCGGTTCGG GCTTCGAGCA CGTGGAGATG CAGTTCCTCA GCGACGTCTA CCTGCGCTGC CCCGACTGCG ACGGCAAGCG CTTCCGCGCC GAACTGCTGG AGGTGAAGGT CCAGCGCGGC GGCAAGTCCT TCGACGTGAG CGAGGTGCTG GAGCTGACCA TCGCCGACGC GGTGCGCTAC TTCGCCGACG ATCGCGAGGT GTTGCGCGCG CTGCAGCCTC TGGCCGACGT GGGCCTGGAC TACGTGAAGC TCGGCCAGCC GGTGCCCACG CTGAGCGGCG GCGAGGCGCA GCGCCTGAAG CTCGCCGGCT TCCTGGCCGA GGCCGCAAAG AACGGCAGCA GCTCGCGCCA GGCGGTGGCG AAGAAGGGCA CGCTGTTCCT GTTCGACGAG CCAACCACCG GCCTGCACTT CGACGACATC GCCAAGCTGA TGCGTGCCTT CCGCAAGCTG CTGGAGGCCG GCCATTCGCT GCTGGTGATC GAACACAACC TCGACGTGAT CCGCGCGGCC GACTGGCTGG TCGACCTGGG TCCCGAGGGC GGCGAGGCCG GCGGCGAGCT GGTCTGCTCC GGCACCCCCG ACGACGTGAA GCGGCACCCC ACGTCGCACA CCGGCGCGGC GCTGCGCGAG TACGAGCAGA GCCTCGGGCA GGGCGGCCTC GCGGTCGAGG AGGGGCTGCC GCTGCAGTCG CTGCTGACCG AGGCGCGGCG CGCCCGGCGC GAGCGCGAGG CCGGCGCTGC CGCCCACGCG ATCCAGATCG TCAATGCCCG CGAGCACAAC CTCCAGGGCC TGTCGGTCGA CATCCCGCGC GGGAAGTTCA GCGTGGTCAC CGGTGTCTCC GGTTCGGGCA AGAGCACGCT GGCCTTCGAC ATCCTGTTCA ACGAGGGCCA GCGCCGCTAT CTCGAGAGCC TCAACGCCTA CGCGCGCAGC ATCGTGCAGC CGGCCGGCCG GCCCGAGGTC GACGCGGTGT ACGGCATCCC GCCCACCGTC GCGATCGAGC AGCGCCTGAG CCGCGGTGGC CGCAAGAGCA CCGTCGCCAC CACCACCGAG GTCTATCACT TCCTGCGCCT GCTCTACGTC AAGCTCGGCA CGCAGTACTG TCCCAAATGC ACGGCCGACG CCAGCGTGGC GGTGCGACCG CAGACGCCGG AGTCGATCGC CGCGCAGCTG CTGCGCGACT ACAAGGGCCA GCACATCGGC CTGCTCGCGC CGCTGGTGGT GAACCGCAAG GGCGTCTACA CCGACCTCGC GAAGTGGGCC GCGCAGCGCG GCCACACGCA CCTGCGGGTC GACGGCGAAT TCCTGAAGAC CTCGCCGTGG CCACGCATCG ACCGTTTCAA GGAGCACACG CTGGAGCTGC CGGTCGGCGA CATCGTCGTC ACGCCCGACA ACGAGGCCGA ACTGCGCGCG CTGCTCGCGA AGACGCTGGA GCTCGGCAAG GGCGTGGTGC ACCTGCTGGG CCCGCTCGAC GGCCTGAAGG CGGCGATGGC CGCCGGCGCG CCGACGCACC GCATCGGCCG CGTCAAGGTG TTCTCGACCA AGCGCGCCTG CCCCGCCTGC GGCACCAGCT ACCCCGAGCT GGACCCGCGC ATGTTCTCGT ACAACAGCAA GCACGGTTGG TGCCCCGATT GCGTGGGCAC CGGCCTCACG CTGACGCGCG AGCAGCGCAA GGCCCTCGAC GATTCGGTGC GCGACGACGA CACCGGCGGG CGCGAGATGA CCTTCGCCGA ACCCGAGATC GAGGACGTGG CCGACGCGCC CTGCGGCACC TGCCATGGCG CGCGCTTGAA CCCGGTCTCG CTGGCCGTGC GTTTCGGCCG CGGGGCGGAG GGCGAGCAGT CGATCGCCGC GGTGGCGGCG CGCGCGGTGA ACGACGCGCG CCGCTGGATC GAGACCCAGA AGCTCGATGG CCGCGAGGCC ACGATCGCAC GCGACGTGAT CGCCGAGATC CGCTCGCGGC TGGGCTTCCT CGACCAGGTG GGCCTGGGCT ATCTGACGCT CGATCGCGCC GCGCCCACGC TGAGCGGCGG CGAGGCGCAG CGCATCCGCC TCGCGGCGCA GCTGGGCAGC AATCTGCAGG GCGTGTGCTA CGTGCTCGAT GAGCCGACCA TCGGTCTGCA CCCGCGCGAC AACCGCGTGC TGCTCGATGC GCTGCACACG CTGGGTTCGC AGGGCAACAC GCTGGTGGTG GTGGAGCACG ACGAAGACAC CATCCGCCGC GCCGACCACA TCATCGACAT CGGCCCCGGC GCCGGCAAGC GCGGCGGCCG TCTGGTGGCG CAGGGCACCG CCGCCGACCT GCTGCGCGCC CCCGACTCGC TGACCGGCCG CCTGCTCGAG CACCCGATGC GCCACCCGCT GCAGGCGCGG CGCGCGGTGC CGCCGGGCGT GCCCTGGCTC ACGCTGCGCG GCGCCACGCT GCACAACTTG GCCGGGCTCG ACGTCGGCGT GCCGGTCGGG CGGTTGGTTG CGGTGACGGG CGTTTCCGGC TCGGGCAAGA GCACCGTCGC GCGCGACGTG CTGCTGGTCA ACGTGCACGC GGCGGTGGCG ATGAAGGCCA CGAAGGCCGG TCGCGATGCG TTGGCCGCCG GCAAGCGGCC GGCCTGGGTG GGTTGCAGCC AGCTCGAAGG CTTCGAGCCC ATCGATCGCG TGCTCGAGGT CGACCAGACG CCGATCGGCA AGACGCCGCG TTCCTGCCCG GCCACCTACA TCGGCTTCTG GGACACGATC CGCAAGCTGT TCGCCGAGAC GCTGGAGGCC AAGGCGCGCG GCTATGCCGC CGGCCGCTTC TCGTTCAACA CCGGCGAGGG CCGCTGTCCG GGCTGCGAGG GCCAGGGCCT GCGCACCATC GAGATGGCCT TCCTGCCGGA CGTGAAGGTG CCCTGCGACC TGTGCCACGG CGCGCGCTTC AACCCCGAGA CGCTGGCCGT CACCTGGCGC GGCAAGAGCA TCGGCGACGT GCTGCAGATG GAGGTGGACG AGGCGGTCGA CTTCTTCGCC TCCATGCCCG CCATCGCCCA CCCGCTGCAA TTGCTGAAGG ACGTGGGCCT GGGCTACCTG ACACTGGGTC AGCCCTCGCC CACGCTGAGC GGCGGCGAGG CGCAGCGCAT CAAGCTCGTC ACCGAGCTCA GCAAGGTGCG TGACGACGTG ACGCGGCGCG GCCAGAAGCC GCCCCACACG CTCTACGTGC TGGACGAACC GACGGTGGGC CTGCACATGG CCGACGTCGA GAAGCTGATC CGCGTGCTGC ATCGGCTGGT CGACGGCGGC CACAGCGTGG TGGTGATCGA GCACGATCTC GACGTCATCG CCGAAGCCGA CTGGGTGCTC GACCTCGGTC CCGAGGGCGG TGCCGGCGGT GGCCGTGTGG TGGCGGCGGG CACGCCCGAG CAGGTGGTGG CCGCGGGCAC GCACACCGGC GTGGCGCTGG CGCCGGTGCT GAAGCGGTGA
|
Protein sequence | MTGPIPRIFR SPPGTPMPNG QRPPGVIRIR GARQHNLKNL DLDLHTGELT VVTGPSGSGK SSLVFDTLYA EGQRRYVETF SAYARQFLDR MDRPAVDHVE GVPPAIAIDQ TNPVRTSRST VGTMTELNDH LKLLFARAAQ LFDRQTAKPV REDDPESIYR ELQQRAAPGQ RLVVTFPVEL PASATAADVE QWLAASGYTR VQAEREIATP TGPRKLLDVV ADRLRLDGTE KSRVVEALEV SLKRGGGRVN VYAVAGEGEE QLWRFSSGLH SPDSELRYSH PQPSMFSFNS AFGACETCRG FGRVIGVDWG LVMPDHRKTL RTGVVKTIQT PAWKEIQADL LKYAGEAGIP RDTAWSQLSD AQRDWVLDGT PNWKGNWNKQ WYGIKRFFEY LESKAYKMHI RVLLSKYRSY TPCPACGGAR LKLESLQWRI GTKEGADAAL KAAQRFLPVG AAWSREQLEA LPGLSLHDLM LLPIDRLRRF FDELTLPSTM LDDALELLLD EVRTRLKYLC DVGIGYLTLD RQSRTLSGGE VQRINLTTAL GTSLVNTLFV LDEPSIGLHP RDMNRIVQAM HRLRDAGNTL VVVEHDPAVM LAADRVLDMG PGPGTQGGRV VFDGTPDDLK RADTMTGAYL GARRSIGLGL KRLVTDGTPR LIVEGAREHN LRGITVEFPL QRLVVVTGVS GSGKSTLIQD LLFPALARHF GKATETPGAH DRLLGADWLS DAVFVDQSPI GKTARSNPAS YVGAFDTLRN IFAEAPMALQ RGYGAGMFSF NAGDGRCPTC GGSGFEHVEM QFLSDVYLRC PDCDGKRFRA ELLEVKVQRG GKSFDVSEVL ELTIADAVRY FADDREVLRA LQPLADVGLD YVKLGQPVPT LSGGEAQRLK LAGFLAEAAK NGSSSRQAVA KKGTLFLFDE PTTGLHFDDI AKLMRAFRKL LEAGHSLLVI EHNLDVIRAA DWLVDLGPEG GEAGGELVCS GTPDDVKRHP TSHTGAALRE YEQSLGQGGL AVEEGLPLQS LLTEARRARR EREAGAAAHA IQIVNAREHN LQGLSVDIPR GKFSVVTGVS GSGKSTLAFD ILFNEGQRRY LESLNAYARS IVQPAGRPEV DAVYGIPPTV AIEQRLSRGG RKSTVATTTE VYHFLRLLYV KLGTQYCPKC TADASVAVRP QTPESIAAQL LRDYKGQHIG LLAPLVVNRK GVYTDLAKWA AQRGHTHLRV DGEFLKTSPW PRIDRFKEHT LELPVGDIVV TPDNEAELRA LLAKTLELGK GVVHLLGPLD GLKAAMAAGA PTHRIGRVKV FSTKRACPAC GTSYPELDPR MFSYNSKHGW CPDCVGTGLT LTREQRKALD DSVRDDDTGG REMTFAEPEI EDVADAPCGT CHGARLNPVS LAVRFGRGAE GEQSIAAVAA RAVNDARRWI ETQKLDGREA TIARDVIAEI RSRLGFLDQV GLGYLTLDRA APTLSGGEAQ RIRLAAQLGS NLQGVCYVLD EPTIGLHPRD NRVLLDALHT LGSQGNTLVV VEHDEDTIRR ADHIIDIGPG AGKRGGRLVA QGTAADLLRA PDSLTGRLLE HPMRHPLQAR RAVPPGVPWL TLRGATLHNL AGLDVGVPVG RLVAVTGVSG SGKSTVARDV LLVNVHAAVA MKATKAGRDA LAAGKRPAWV GCSQLEGFEP IDRVLEVDQT PIGKTPRSCP ATYIGFWDTI RKLFAETLEA KARGYAAGRF SFNTGEGRCP GCEGQGLRTI EMAFLPDVKV PCDLCHGARF NPETLAVTWR GKSIGDVLQM EVDEAVDFFA SMPAIAHPLQ LLKDVGLGYL TLGQPSPTLS GGEAQRIKLV TELSKVRDDV TRRGQKPPHT LYVLDEPTVG LHMADVEKLI RVLHRLVDGG HSVVVIEHDL DVIAEADWVL DLGPEGGAGG GRVVAAGTPE QVVAAGTHTG VALAPVLKR
|
| |