Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0103 |
Symbol | |
ID | 4784505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 103282 |
End bp | 106332 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640088650 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001019300 |
Protein GI | 124265296 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00106223 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.272791 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGACA CGCCCCCACC CGATCGCGCT GCCGCAGCCT CAGCCGCCGA CGGGCACGAG GCCGCGCCGC GCCGCCTGCC GGCCGCCGAG CCCGCCGCGA CGATCCGCGT GCGCGGCGCG CGCACCCACA ACCTGAAGAA CATCGACCTC GACATCCCGC GCCTGCAGCT GGTGGTGATC ACCGGGCTGT CGGGTTCGGG CAAGTCGAGC CTGGCCTTCG ACACGCTGTA CGCCGAGGGC CAGCGCCGCT ACGTCGAGAG CCTCAGCGCC TATGCGCGGC AGTTCCTGCA GTTGATGGAC AAGCCCGACG TCGACGTGAT CGAGGGCCTG TCGCCGGCGA TCAGCATCGA GCAGAAGGCC ACTTCGCACA ACCCGCGCTC GACCGTCGGC ACCGTCACCG AGATCCACGA CTACCTGCGC CTGCTGTACG CCCGCGCCGG CACGCCGTTC TGCCCCGACC ACGAACTGCC GCTGCAGGCC CAGAGCGTGA GCCAGATGGT CGACGCGGTG CTGGCGCTGC CCGAGGACAC GCGGCTGATG ATCCTGGCGC CGGTGCTGCG CGAGCGCAAG GGCGAGCACG GCGAGCTGTT CGAGGACATG CAGGCGCGTG GCTACGTGCG TTTCCGCATC GACGGTGCGG TGCACGAGAT CACCGACCTG CCCAAGCTGA AGAAGAACGA GAAGCACGAC ATCGACGTGG TGATCGACCG GCTCAAGGTG CGGCCCGAGA TGGGACAGCG CCTGGCCGAG AGCTTCGAGG CCGCGCTGCG TTTCGCCGAC GGCCGCGCGA TCGCGTGGGA AATGGATTCC GGCCAGGAAC ACCTGTTCTC CGCGAAGTTC GCCTGCCCGA TCTGCAGCTA CTCGCTGGCC GAGCTGGAGC CGCGGCTGTT CTCGTTCAAC TCGCCGGTGG GCGCCTGCCC GAGCTGCGAC GGGCTCGGCG AGGTGACGGT GTTCGACCCG GAGCGCGTGG TCGCCTTCCC GTCGCTCAGC CTCGCCAGCG GCGCGATCAA GGGCTGGGAC CGGCGCAATG CCTACACGCA CTCGATGCTG GAGAGCGTGG CGCGGCACTA CGGCTTCGAC ACCGACACGC CGTTCGAGCA GCTCGCGCCC GAGCACCGGC AGGTGCTGCT GCACGGCTCG GGCGAGACCG AGATCGCCTT CGTCTACGCC AGCGAGGCCG CGAGCGGCAA GAAGCGCATC GTGAAGCGCT CGCATCCCTT CGAAGGGATC ATCCCTAGCT TCGAGCGACG CTACCGCGAG ACCGACTCGG TCGCGGTGCG CGAGGACCTG GCGCGCTACC AGGCCGCCAA GCCCTGCCCC GACTGCCACG GCACGCGGCT GCGGCGCGAG GCGCGCCACG TGAAGCTGGT CGACGTGGCG AGCGGCACGG CGCGGCCGAT CTACGAGATC GCGCACGCGA CGCTGCGCGA GGCGCAGCAG GTGTTCGACG GGCTGCGGCT GCGGGGCGCG AAGGCGGAGA TCGCCGACAA GGTGGTGCGC GAGATCTCGT CGCGGCTCAA GTTCCTGAAC GACGTGGGGC TCAACTACCT GAGCCTGGAC CGCAGCGCCG ACACGCTGTC GGGCGGCGAG GCGCAGCGCA TCCGCCTGGC CAGCCAGATC GGCTCCGGCC TGACCGGCGT GATGTACGTG CTCGACGAGC CCAGCATCGG CCTGCACCAG CGCGACAACG ACCGGCTGAT CGCGACGCTG AAGCACCTGC GCGACATCGG CAACAGCGTG CTGGTGGTGG AACACGACGA GGACATGATC CGCGCCGCCG ACCACGTGAT CGACATGGGC CCGGGCGCCG GCGTGCACGG CGGCCGCGTG ATCGCGCAGG GTCGCTTCGC GGACGTGTGC GCGGCGCCCG AATCGCTGAC CGGCCAGTAC CTCGGCCGCG CGCTGCGCAT CGCGGTGCCG CGGCAGCGCA CGCCTTGGCG CAGCGACGCC ACGCCGCACA TCGACGCTGT CGCCGCGGCC GGCCTGCCGC CGCGCAAGGC CGGCAAGAAT GAACCCACGC CCGGCCAGCG GGCCTGGGCC GAACGCGGTG CGATCACCGC GCTGCGCATC GCCGGCGCGC GCGGCAACAA CCTCAAGGGC GTGAGCGTCG AGATCCCGGT CGGCCTGCTG ACCTGCGTGA CCGGCGTGTC GGGTTCCGGC AAGAGCACGC TGGTCAACGA CACGCTGTAC GCGGCGGTGG CGCGCAAGCT CTACCAGAGC CATCTCGACC CGGCGCCGCA TGACGAGATC GACGGCATCG AGCACTTCGA CAAGGTGATC AACGTCGACC AGAGCCCGAT CGGCCGCACG CCGCGCTCCA ACCCGGCCAC CTACACCGGC CTGTTCACGC CGATCCGCGA GCTGTTCGCC GAGGTGCCGG CGGCGCGCGA GCGCGGCTAC GGACCGGGCC GCTTCTCGTT CAACGTGGCC GGCGGGCGCT GCGAGGCCTG CCAGGGCGAC GGCGTGCTGA AGGTCGAGAT GCACTTCCTG CCCGACGTCT ACGTGCCCTG CGACGTCTGC CATGGCAAGC GCTACAACCG CGAGACGCTC GAGGTGCTCT ACAAGGGCAG GAACATCACC GAGGTGCTGA ACCTCACGGT GGAGGACGCC CACGCCTACT TCAGCGCCGT GCCGAGCATC GCGCGCAAGC TGCAGACGCT GCTCGACGTG GGCTTGGGCT ACATCACGCT GGGCCAGAGC GCCACGACGC TGTCGGGCGG CGAGGCGCAG CGGGTCAAGC TCGCGCTCGA GCTGAGCAAG CGCGACACCG GCCGCACGCT CTACATCCTC GACGAACCGA CCACCGGCCT GCACTTCGCC GACATCGACC TGCTGCTGAA GGTGCTGCAC CAGCTGCGCG ACGCCGGCAA CACCATCGTC GTGATCGAGC ACAACCTCGA TGTCATCAAG ACCGCCGACT GGCTGATCGA CATGGGCCCC GAAGGCGGCG CCGGAGGCGG CCAGGTGGTG GCGGTGGGCA CGCCCGAGGA CGTGGCCGCC AATCCGGCGA GCCACACCGG GCGCTACCTG CAGCCGCTGC TGGGCGAATA G
|
Protein sequence | MLDTPPPDRA AAASAADGHE AAPRRLPAAE PAATIRVRGA RTHNLKNIDL DIPRLQLVVI TGLSGSGKSS LAFDTLYAEG QRRYVESLSA YARQFLQLMD KPDVDVIEGL SPAISIEQKA TSHNPRSTVG TVTEIHDYLR LLYARAGTPF CPDHELPLQA QSVSQMVDAV LALPEDTRLM ILAPVLRERK GEHGELFEDM QARGYVRFRI DGAVHEITDL PKLKKNEKHD IDVVIDRLKV RPEMGQRLAE SFEAALRFAD GRAIAWEMDS GQEHLFSAKF ACPICSYSLA ELEPRLFSFN SPVGACPSCD GLGEVTVFDP ERVVAFPSLS LASGAIKGWD RRNAYTHSML ESVARHYGFD TDTPFEQLAP EHRQVLLHGS GETEIAFVYA SEAASGKKRI VKRSHPFEGI IPSFERRYRE TDSVAVREDL ARYQAAKPCP DCHGTRLRRE ARHVKLVDVA SGTARPIYEI AHATLREAQQ VFDGLRLRGA KAEIADKVVR EISSRLKFLN DVGLNYLSLD RSADTLSGGE AQRIRLASQI GSGLTGVMYV LDEPSIGLHQ RDNDRLIATL KHLRDIGNSV LVVEHDEDMI RAADHVIDMG PGAGVHGGRV IAQGRFADVC AAPESLTGQY LGRALRIAVP RQRTPWRSDA TPHIDAVAAA GLPPRKAGKN EPTPGQRAWA ERGAITALRI AGARGNNLKG VSVEIPVGLL TCVTGVSGSG KSTLVNDTLY AAVARKLYQS HLDPAPHDEI DGIEHFDKVI NVDQSPIGRT PRSNPATYTG LFTPIRELFA EVPAARERGY GPGRFSFNVA GGRCEACQGD GVLKVEMHFL PDVYVPCDVC HGKRYNRETL EVLYKGRNIT EVLNLTVEDA HAYFSAVPSI ARKLQTLLDV GLGYITLGQS ATTLSGGEAQ RVKLALELSK RDTGRTLYIL DEPTTGLHFA DIDLLLKVLH QLRDAGNTIV VIEHNLDVIK TADWLIDMGP EGGAGGGQVV AVGTPEDVAA NPASHTGRYL QPLLGE
|
| |