Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_1430 |
Symbol | |
ID | 4110267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 1553403 |
End bp | 1557974 |
Gene Length | 4572 bp |
Protein Length | 1523 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638030552 |
Product | exonuclease V subunit alpha |
Protein accession | YP_638598 |
Protein GI | 108798401 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member |
TIGRFAM ID | [TIGR02686] conjugative relaxase domain, TrwC/TraI family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.691452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCACGTCA TGGTGATGAC GCTGCATAAG CTGACCGCCG GGGACGGATA CGTGTACCTG GTGCGGCAAG TCGCGGCCGC CGACAGCACC GAACGCGGTC GCTCCCCCTT GGCCGACTAC TACTCGGCCA AGGGCGAATC ACCGGGCCGT TGGACTGGTC GCGGCCTGGC CGCGTTGGCC GACACCGGCG CGCGTGAGGT CAGCGACGAG GTGCGCCAAC AGATGTGGAC CGTCGAAGGC GGATCGGTGG TCACCGAGGA GCAAATGAAG GCCCTGTTTG GGCTGGGTTT GCACCCCAAC GCCGACAGGA TTTCTGACCA TCTGTCCCCG CGACTAAATG TTCGCCCTTC GATTGCGGCC ACCCAGTTGG GCCGCAAATA CGCGGTGCGT GACGAGTCCT CGGAGTTCAC CCGTCGTGTC GGGAAAGCGT TCCGCGCGCA CAACATCGCC GCCGGGCTAC CGGGCGCGGC GACCATCGAT GATGACGTGC GCGCAGGCAT CCGCACCCGC GTCGCCACCG AGATGTTCGC CGAGCAATTC GACCGCGCAC CAGCCGATGC GCGGGAGCTG TCCGGGTTCA TCGCACGCGC CACCCGTGCC CGCACCACAG CGGTGGCCGG CTACGACTTG ACGTTCTCTC CGGTGAAGTC GATTTCGGCG TTGTGGGCAA TTGCCCCGCC CGAGGTTTCC GAGCAGATCG AGGCCGCGCA CGAGGCCGCG GTGGCTGAGG TGTTGGAGTG GCTGCAAGAC AACGCCGCCT TCACTCGAAC GGGCACCAAC GGTGTCGCCC AGGTCGACAC CCAAGGGCTC ATTGCCGCGG TGTTCACCCA TCGCGATTCG CGTGCCGGGG ACCCCGATCT GCACACCCAT GTCGCGATCT CCAACAAGGT GTCTTACCTC GATCACAATG GGGTGCGCCG CTGGCTCGCT TTGGATGGCC AACCCCTGCA CCGGGTCATC GTGGCGGCCT CGGAGATGTA CAACACCCGC ATGGAAGCCC ACACCATCGA CCGCCTCGGT GTGGACTTCG CCGAGACCTC CCGGGGACGC GGGAAACGAA CCGTGCGCGA GATCGTCGGC ATGTCTACGG AATTGATGAC GCGCTGGTCG AGCCGGCGCA CGGCGATCAA GGCCCGCACC GCCGAGTTGG CCAAGGCGTT TCAGCACGAT CACGGCCGCG AACCGACCAC CATCGAGTCC CTCGCGTTGG CCCAACAAGC GACGTTGGAA TCGCGCGAAG CCAAGCACGA ACCGCGCTCG CTGGCCGAAC AGCGCGACAC CTGGCGCCAC CAAGCCGTCG AGGTCCTGGG CCACGACGGA GTAGACCGCA TGCTAGCCAA CGTTCTGACC CCAGAACGCG GCCACACGAC ACCGGAGATC ACCGACGAAT GGGTGGCGGC GCAGGCTGGC GCACTGATTG CCACAGTGTC CGAATCGCGC GCGACCTGGC AGCGTCATCA CGTGCACGCC GAAGCCCTCC GCGTCGTGCG TAACGAGGGT GTCGCACGCG TGTCGCAGCT TGTCGAACGG CTCACCGACA CCGCACTTTC GGAGGCGTTC TCGGTGCCGC ACGCCCGCAC CGCGGACGCC GAGTTGGGCG AGCCTGTCGC ATTGCGTCGC AGCGACGGTT CCAGCGTCTA TAGCCGCCAC GGAACCGCGA CCTACACCAG CCGCGACATC CTCACCGCCG AGCGGCGAAT CCTGGCCGCA GCGCACCAAC TTGATGGCCG TGTCGCGGCC GCGACAGACG TGCGTTTAGC CCTCGCGGAC GCCGCCGCGC ACGGCAAAAA CCTCAACGAC GGCCAAGCCG CTCTGGTAGC CCAAATGGCC CTCGGTGGCC GCCGCGTAGG GCTCGCGTTG GCCCCGGCCG GGGCGGGCAA GACTACCGCC ATGGCCGCCC TGGCGCACGC CTGGCGCAGC TCCGGCGGAC AGGTCCTCGG GCTGGCCCCC ACCGCGGCCG CGGCGATCGT GCTCGGCGAA GACCTGGGCG CAACCACCGA CACTCTCGAG AAGTACGTGC ACTGCACCGA TGAGAACAAC GCCGCGATTT ACGGCACTCC GGACTGGTTC ACCCAGGTTG GCCCCGACAC CCTGATCGTG GTCGACGAGG CCGGGATGGC GTCCACCCCC GGTCTGGACG CCCTGATCAC CCACGCCCTC AGCGCAGGCG CGAGTGTGCG GCTGGTCGGT GATGACGCCC AGCTCGCCTC CATCTCCGCC GGCGGGGTGC TGCGCGACAT CGCCACCGAC ACCGGAGCCC TCACGCTGTC GGAGGTGGTG CGGTTTCACT CCCGCGCCGA AGCCGCCGCC TCGCTGGCGC TGCGCGGTGG GGATCCGGCC GGGATCGGCT TTTACCTCGA CGCCGGCCGC ATCCACATCG GCACCGAACA GACCTCCGCT GACGCCGCCT TCACCGCGTG GCAGGCCGAC CAGGACGCCG GCCGCGACAG CCTGCTGCTG GCGCCCACCA ACGCGATCGT CAACGAACTC AACGTCCGCG CCCGCACCGC ACGGCTGGCC GCGGCGCTCG AGGCAGACCC GCAGGCGACG CCGGGCCGCG AAACCACGCT CTCCGATGGC CTGACCGCCA GCGTCGGCGA TGTCATCCGC ACCCGCCGCA ACAACCGCCA GCTGCCCCTG TCGGGCACCG ATTACGTCCG CAACGGCTAC ACCTTCACCA TCGTCGACAT CGCCACAGAC GGGGCACTGA CGGCGCGCCA CCGCGGCACC GGCCGCGAGA TCACCCTGCC GACGGACTAC GTCACCGACC ACGTGACGCT GGGCTACGCG ACCACCATCG ACGCCGCCCA AGGCTCCACC GCCGGCCACG CCTGCCACGT CGTCGGTGCC GAACACCTGA CCCGCCAGCA CCTCTACGTC GCCCTCACCC GCGGCCGCGC CGAAAACCAC CTCTACCTGT CCACCGCCGA AACCGACCCG CACCGCATCC TCGCCCCGAA AGCCACCCAC CCCGACACCG CCGTCGATGT CCTGTCCCGG ATCGTGGCCC GCGACGGTGC CCAGGTCTCG GCCACCACCG CCGCCCGACA AGCCGCCGAC CCCAGTCGCC GCCTGGCCGC GGCCGCCAAC ATGTTCACCG ACGCCCTCGC CACCGCCGCC GAAAACCAAC TCGGTGAAGG GGCACGTGCA CGCTTGGACG CCCTCGCCGA ATCCGTGCAC CCCGATCTCA CGACCTGCAC CGCATGGCCG GTCCTGCGCC GCAGCCTGGC CATGCGCGCA CTATCCGGCG CCGATCCGGG TCGGCTGCTC ACCGCCGCGT ACGCCCGCGG CGGCCTCGAC GACGCCGCTG ACGCGGCCGC CGTCCTGGAC TACCGCATCG ACCTCACCGA TGGCGACGGC CGCGGCAGCG GGCCGCTGCG CTGGCTGCCC GCTATCCCCA CCGTCCTGGC AGACAACCCG CACTGGGGTG ACTACCTGCA CCGGCGCGAA CACCTCGTCA CCGACCTGGC CGACCAAATC CGCGACCACG CCCGCTCCTG GACCACCGAG ATCGCCCCGC CCTGGGCGCG GCCCTTGATC GCCGCTAACG CCAATCCCAC CCTGACCGCC GAGATCGCCG TGTTCCGGGC CGCCACCGCC GTCACGGCCG CCGACACCCG CCTCACCGGC TCCCCGCAAT ACCCGGTGGC CACCCGCGCC GTGCAAGACC TGCTGCAGCG CCACGGCATC GACGCCATCG CACACAGCCG GCCCGACACC ACACAGTGGC ATGACCTGAT CGACGCGATC GACCCCCGTG TCCGCGCTGA CGAATACTGG CCCACCCTGG CCGCGCAGCT TGCAGAAGCC GCCCCCACCA ACAACATTCC TCAACTACTC GCCGACGCCG CACACCAAGG ACCGCTCCCC GACGAGATGC CCGCCGCCGC CCTCTGGTGG CGCATCACCG AGACGCTGGC CCCCACCGCC GCCGTACCCG AACCACAGGA GTCCGTCGAG CACGCTTCTG ATGCTGCGCT CGTCGAGGTC CTGGCCGCCG CCCACCGCAA CGGCGACCCG CTGCGCCTCC CGGCACCGCA GCCCGATCCT GTCGCCCCCG CACTGACCGC TCCGTATGTC GTGACCACCC TCGCCGCCGA CCACGACACC CCCGAACTGG CCGAAGCGAT CGCGGCGGCC GCCGCCGATG CCGGCCGCAG CTACCTACGC GTGCCCGCCA CACCACAGTC CGACCGCTAC AGCCTGGCCG AGCTCGCCGA CAAGATCACC GACCGCCGGC CACCGCGCGC GGCGCTGGTG GTCGTCGAGG ACGCTGCCGC CGCCGACCCC GCCCAGCTCG CCACGGTGGC CACCGCCTTG GCCGACGCCC ACGGCCGACT CCTGCTCATC GACAACGGTG AGCCCGGACC CGGCCGCCGC CTCCTCGACG GGATGGGACT TCCGTGGAGC GAAAACAGCC GTCCCACAAT CACAATCGAA GACACGGCCC TAGCCGATTC CGCCGACAAC CACCGCGCCG CCGCCGCGAA ACGCTGGCGC ACCCTCACCA ATCGGCATGG GCGCGACCGA GGCCGCGACA GAGACCAGCA ATACGGATTG GACATTGACT GA
|
Protein sequence | MHVMVMTLHK LTAGDGYVYL VRQVAAADST ERGRSPLADY YSAKGESPGR WTGRGLAALA DTGAREVSDE VRQQMWTVEG GSVVTEEQMK ALFGLGLHPN ADRISDHLSP RLNVRPSIAA TQLGRKYAVR DESSEFTRRV GKAFRAHNIA AGLPGAATID DDVRAGIRTR VATEMFAEQF DRAPADAREL SGFIARATRA RTTAVAGYDL TFSPVKSISA LWAIAPPEVS EQIEAAHEAA VAEVLEWLQD NAAFTRTGTN GVAQVDTQGL IAAVFTHRDS RAGDPDLHTH VAISNKVSYL DHNGVRRWLA LDGQPLHRVI VAASEMYNTR MEAHTIDRLG VDFAETSRGR GKRTVREIVG MSTELMTRWS SRRTAIKART AELAKAFQHD HGREPTTIES LALAQQATLE SREAKHEPRS LAEQRDTWRH QAVEVLGHDG VDRMLANVLT PERGHTTPEI TDEWVAAQAG ALIATVSESR ATWQRHHVHA EALRVVRNEG VARVSQLVER LTDTALSEAF SVPHARTADA ELGEPVALRR SDGSSVYSRH GTATYTSRDI LTAERRILAA AHQLDGRVAA ATDVRLALAD AAAHGKNLND GQAALVAQMA LGGRRVGLAL APAGAGKTTA MAALAHAWRS SGGQVLGLAP TAAAAIVLGE DLGATTDTLE KYVHCTDENN AAIYGTPDWF TQVGPDTLIV VDEAGMASTP GLDALITHAL SAGASVRLVG DDAQLASISA GGVLRDIATD TGALTLSEVV RFHSRAEAAA SLALRGGDPA GIGFYLDAGR IHIGTEQTSA DAAFTAWQAD QDAGRDSLLL APTNAIVNEL NVRARTARLA AALEADPQAT PGRETTLSDG LTASVGDVIR TRRNNRQLPL SGTDYVRNGY TFTIVDIATD GALTARHRGT GREITLPTDY VTDHVTLGYA TTIDAAQGST AGHACHVVGA EHLTRQHLYV ALTRGRAENH LYLSTAETDP HRILAPKATH PDTAVDVLSR IVARDGAQVS ATTAARQAAD PSRRLAAAAN MFTDALATAA ENQLGEGARA RLDALAESVH PDLTTCTAWP VLRRSLAMRA LSGADPGRLL TAAYARGGLD DAADAAAVLD YRIDLTDGDG RGSGPLRWLP AIPTVLADNP HWGDYLHRRE HLVTDLADQI RDHARSWTTE IAPPWARPLI AANANPTLTA EIAVFRAATA VTAADTRLTG SPQYPVATRA VQDLLQRHGI DAIAHSRPDT TQWHDLIDAI DPRVRADEYW PTLAAQLAEA APTNNIPQLL ADAAHQGPLP DEMPAAALWW RITETLAPTA AVPEPQESVE HASDAALVEV LAAAHRNGDP LRLPAPQPDP VAPALTAPYV VTTLAADHDT PELAEAIAAA AADAGRSYLR VPATPQSDRY SLAELADKIT DRRPPRAALV VVEDAAAADP AQLATVATAL ADAHGRLLLI DNGEPGPGRR LLDGMGLPWS ENSRPTITIE DTALADSADN HRAAAAKRWR TLTNRHGRDR GRDRDQQYGL DID
|
| |