Gene Mpe_A0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0103 
Symbol 
ID4784505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp103282 
End bp106332 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content70% 
IMG OID640088650 
Productexcinuclease ABC subunit A 
Protein accessionYP_001019300 
Protein GI124265296 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00106223 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.272791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGACA CGCCCCCACC CGATCGCGCT GCCGCAGCCT CAGCCGCCGA CGGGCACGAG 
GCCGCGCCGC GCCGCCTGCC GGCCGCCGAG CCCGCCGCGA CGATCCGCGT GCGCGGCGCG
CGCACCCACA ACCTGAAGAA CATCGACCTC GACATCCCGC GCCTGCAGCT GGTGGTGATC
ACCGGGCTGT CGGGTTCGGG CAAGTCGAGC CTGGCCTTCG ACACGCTGTA CGCCGAGGGC
CAGCGCCGCT ACGTCGAGAG CCTCAGCGCC TATGCGCGGC AGTTCCTGCA GTTGATGGAC
AAGCCCGACG TCGACGTGAT CGAGGGCCTG TCGCCGGCGA TCAGCATCGA GCAGAAGGCC
ACTTCGCACA ACCCGCGCTC GACCGTCGGC ACCGTCACCG AGATCCACGA CTACCTGCGC
CTGCTGTACG CCCGCGCCGG CACGCCGTTC TGCCCCGACC ACGAACTGCC GCTGCAGGCC
CAGAGCGTGA GCCAGATGGT CGACGCGGTG CTGGCGCTGC CCGAGGACAC GCGGCTGATG
ATCCTGGCGC CGGTGCTGCG CGAGCGCAAG GGCGAGCACG GCGAGCTGTT CGAGGACATG
CAGGCGCGTG GCTACGTGCG TTTCCGCATC GACGGTGCGG TGCACGAGAT CACCGACCTG
CCCAAGCTGA AGAAGAACGA GAAGCACGAC ATCGACGTGG TGATCGACCG GCTCAAGGTG
CGGCCCGAGA TGGGACAGCG CCTGGCCGAG AGCTTCGAGG CCGCGCTGCG TTTCGCCGAC
GGCCGCGCGA TCGCGTGGGA AATGGATTCC GGCCAGGAAC ACCTGTTCTC CGCGAAGTTC
GCCTGCCCGA TCTGCAGCTA CTCGCTGGCC GAGCTGGAGC CGCGGCTGTT CTCGTTCAAC
TCGCCGGTGG GCGCCTGCCC GAGCTGCGAC GGGCTCGGCG AGGTGACGGT GTTCGACCCG
GAGCGCGTGG TCGCCTTCCC GTCGCTCAGC CTCGCCAGCG GCGCGATCAA GGGCTGGGAC
CGGCGCAATG CCTACACGCA CTCGATGCTG GAGAGCGTGG CGCGGCACTA CGGCTTCGAC
ACCGACACGC CGTTCGAGCA GCTCGCGCCC GAGCACCGGC AGGTGCTGCT GCACGGCTCG
GGCGAGACCG AGATCGCCTT CGTCTACGCC AGCGAGGCCG CGAGCGGCAA GAAGCGCATC
GTGAAGCGCT CGCATCCCTT CGAAGGGATC ATCCCTAGCT TCGAGCGACG CTACCGCGAG
ACCGACTCGG TCGCGGTGCG CGAGGACCTG GCGCGCTACC AGGCCGCCAA GCCCTGCCCC
GACTGCCACG GCACGCGGCT GCGGCGCGAG GCGCGCCACG TGAAGCTGGT CGACGTGGCG
AGCGGCACGG CGCGGCCGAT CTACGAGATC GCGCACGCGA CGCTGCGCGA GGCGCAGCAG
GTGTTCGACG GGCTGCGGCT GCGGGGCGCG AAGGCGGAGA TCGCCGACAA GGTGGTGCGC
GAGATCTCGT CGCGGCTCAA GTTCCTGAAC GACGTGGGGC TCAACTACCT GAGCCTGGAC
CGCAGCGCCG ACACGCTGTC GGGCGGCGAG GCGCAGCGCA TCCGCCTGGC CAGCCAGATC
GGCTCCGGCC TGACCGGCGT GATGTACGTG CTCGACGAGC CCAGCATCGG CCTGCACCAG
CGCGACAACG ACCGGCTGAT CGCGACGCTG AAGCACCTGC GCGACATCGG CAACAGCGTG
CTGGTGGTGG AACACGACGA GGACATGATC CGCGCCGCCG ACCACGTGAT CGACATGGGC
CCGGGCGCCG GCGTGCACGG CGGCCGCGTG ATCGCGCAGG GTCGCTTCGC GGACGTGTGC
GCGGCGCCCG AATCGCTGAC CGGCCAGTAC CTCGGCCGCG CGCTGCGCAT CGCGGTGCCG
CGGCAGCGCA CGCCTTGGCG CAGCGACGCC ACGCCGCACA TCGACGCTGT CGCCGCGGCC
GGCCTGCCGC CGCGCAAGGC CGGCAAGAAT GAACCCACGC CCGGCCAGCG GGCCTGGGCC
GAACGCGGTG CGATCACCGC GCTGCGCATC GCCGGCGCGC GCGGCAACAA CCTCAAGGGC
GTGAGCGTCG AGATCCCGGT CGGCCTGCTG ACCTGCGTGA CCGGCGTGTC GGGTTCCGGC
AAGAGCACGC TGGTCAACGA CACGCTGTAC GCGGCGGTGG CGCGCAAGCT CTACCAGAGC
CATCTCGACC CGGCGCCGCA TGACGAGATC GACGGCATCG AGCACTTCGA CAAGGTGATC
AACGTCGACC AGAGCCCGAT CGGCCGCACG CCGCGCTCCA ACCCGGCCAC CTACACCGGC
CTGTTCACGC CGATCCGCGA GCTGTTCGCC GAGGTGCCGG CGGCGCGCGA GCGCGGCTAC
GGACCGGGCC GCTTCTCGTT CAACGTGGCC GGCGGGCGCT GCGAGGCCTG CCAGGGCGAC
GGCGTGCTGA AGGTCGAGAT GCACTTCCTG CCCGACGTCT ACGTGCCCTG CGACGTCTGC
CATGGCAAGC GCTACAACCG CGAGACGCTC GAGGTGCTCT ACAAGGGCAG GAACATCACC
GAGGTGCTGA ACCTCACGGT GGAGGACGCC CACGCCTACT TCAGCGCCGT GCCGAGCATC
GCGCGCAAGC TGCAGACGCT GCTCGACGTG GGCTTGGGCT ACATCACGCT GGGCCAGAGC
GCCACGACGC TGTCGGGCGG CGAGGCGCAG CGGGTCAAGC TCGCGCTCGA GCTGAGCAAG
CGCGACACCG GCCGCACGCT CTACATCCTC GACGAACCGA CCACCGGCCT GCACTTCGCC
GACATCGACC TGCTGCTGAA GGTGCTGCAC CAGCTGCGCG ACGCCGGCAA CACCATCGTC
GTGATCGAGC ACAACCTCGA TGTCATCAAG ACCGCCGACT GGCTGATCGA CATGGGCCCC
GAAGGCGGCG CCGGAGGCGG CCAGGTGGTG GCGGTGGGCA CGCCCGAGGA CGTGGCCGCC
AATCCGGCGA GCCACACCGG GCGCTACCTG CAGCCGCTGC TGGGCGAATA G
 
Protein sequence
MLDTPPPDRA AAASAADGHE AAPRRLPAAE PAATIRVRGA RTHNLKNIDL DIPRLQLVVI 
TGLSGSGKSS LAFDTLYAEG QRRYVESLSA YARQFLQLMD KPDVDVIEGL SPAISIEQKA
TSHNPRSTVG TVTEIHDYLR LLYARAGTPF CPDHELPLQA QSVSQMVDAV LALPEDTRLM
ILAPVLRERK GEHGELFEDM QARGYVRFRI DGAVHEITDL PKLKKNEKHD IDVVIDRLKV
RPEMGQRLAE SFEAALRFAD GRAIAWEMDS GQEHLFSAKF ACPICSYSLA ELEPRLFSFN
SPVGACPSCD GLGEVTVFDP ERVVAFPSLS LASGAIKGWD RRNAYTHSML ESVARHYGFD
TDTPFEQLAP EHRQVLLHGS GETEIAFVYA SEAASGKKRI VKRSHPFEGI IPSFERRYRE
TDSVAVREDL ARYQAAKPCP DCHGTRLRRE ARHVKLVDVA SGTARPIYEI AHATLREAQQ
VFDGLRLRGA KAEIADKVVR EISSRLKFLN DVGLNYLSLD RSADTLSGGE AQRIRLASQI
GSGLTGVMYV LDEPSIGLHQ RDNDRLIATL KHLRDIGNSV LVVEHDEDMI RAADHVIDMG
PGAGVHGGRV IAQGRFADVC AAPESLTGQY LGRALRIAVP RQRTPWRSDA TPHIDAVAAA
GLPPRKAGKN EPTPGQRAWA ERGAITALRI AGARGNNLKG VSVEIPVGLL TCVTGVSGSG
KSTLVNDTLY AAVARKLYQS HLDPAPHDEI DGIEHFDKVI NVDQSPIGRT PRSNPATYTG
LFTPIRELFA EVPAARERGY GPGRFSFNVA GGRCEACQGD GVLKVEMHFL PDVYVPCDVC
HGKRYNRETL EVLYKGRNIT EVLNLTVEDA HAYFSAVPSI ARKLQTLLDV GLGYITLGQS
ATTLSGGEAQ RVKLALELSK RDTGRTLYIL DEPTTGLHFA DIDLLLKVLH QLRDAGNTIV
VIEHNLDVIK TADWLIDMGP EGGAGGGQVV AVGTPEDVAA NPASHTGRYL QPLLGE