Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3734 |
Symbol | |
ID | 4646799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 3968222 |
End bp | 3974080 |
Gene Length | 5859 bp |
Protein Length | 1952 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639807198 |
Product | exonuclease V subunit alpha |
Protein accession | YP_954522 |
Protein GI | 120404693 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member |
TIGRFAM ID | [TIGR02686] conjugative relaxase domain, TrwC/TraI family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.391856 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGATGA CGCTGCACAA GCTGACCGCC GGGGACGGAT ACCTGTACCT GGTGCGGCAG GTCGCTGCCT CCGACAGCAC CGAGCGGGGC CGCTCCACCC TAGCCGACTA CTACTCCGCC AAGGGGGAAT CGCCGGGACG CTGGATGGGA CGCGGCCTGG CCGCCCTCTC TGACACCGGC CGATGCGAAG TCAGTCCGAC TGCCCGCGAG GAGATTTGGA CCGTCGGGGA AGGATCGGGG GTCAGCGAGG CACAGATGCG CGCCCTCTAC GGCGTGGGTT TGCACCCCAA CGCAGAGCGC ATTCAGACCT ACGTCACCGG CCGCAACATG GGCAACCAGC GCGCGGCCAG CCGCCTGGGC CGCGAGTTCC ATGTCTGTGA CGGCAAGCCG GAGTTCGCAC GCCGTTTGGC AGTGTCGTTC CGCGATCACA ACGCCGAAGT CGGCGCGCAT TGGAACGCCA CCATCGATCC AGAGATTCGC TCAGGCATCC GGACGCGCGT GGCGATGGAG TTGTTCGGCG AGGAGTACGG TCGCCCGCCC GCCGATGACC GCGAACTGTC CGGATTCATC GCCCGCAACA CCCGCGCGAA GACCACCGCT GTGGCCGGCT ACGACCTGAC ATTCTCGCCG GTGAAATCCG TCTCGGCACT GTGGGCTATC GCCCCGTTAC CGGTGTCCGA GCAGATCGAA GCCGCACACG ATGCGGCGGT CGCGGACGTC CTGAAATGGC TGCAAGATCA GGCCGCATTC ACCCGCACCG GCGCCGGTGG TATCGCCCAG GTCGACACCG AAGGGCTCAT CGCCGCGGTG TTCACCCACC GCGATTCCCG CGCCGGGGAT CCCGACCTGC ACACCCACGT CGCGATCTCC AACAAGGTGT CCTATGTCGA CGCCAACGGC GTGCGCCGCT GGCTGGCTCT GGACGGCCAG CCCCTGCACC GTGTCACCGT CGCGGCCTCG GAGTTGTACA ACACCCGACT GGAAGCCCAT TTGATCTCCA GGCTGGGTGT GCGCTTCGCC GAACAGTCCC GTGGGCGGGG CAAACGCCCG GTGCGTGAAA TCGTCGGAAT GTCAGCGAAA TTGATGGAGC GGTGGTCAAG TCGCCGTGCG GCGATCAAGG CCCGCACCGC CGAGCTGGCC AAGCAGTTTC AGGCCGATCA TGGACGCGAG CCAACCAACG TCGAGATCAT CGCGCTGGCC CAGCAGGCAA CCCTGGAATC GCGCGAGGCC AAGCACGAGC CGCGCTCACT CGGCGAGCAG CGCCAGGAGT GGCACGCCCA GGCCGTCGAG GTCCTTGGGG AGCGCGGCGT GAACCAGATT CTGGCCGACA CGTTGGCCGC ACCGCAGACC ACCCGCGAGG CCGTCACGGT CGACGAGGAA TGGATCGCGT CGCGCGCTGG CGAACTCATC GCCACGGTGG CTGAAACCCG CTCCACCTGG CAGCGCCACC ATGTGTGCGC CGAAGCTCTG CGGATGGCTC GCGGCCATGA CGTGGCGCAC GATGTCGCAT TGGTCGAAAG ACTCACCGAT ACCGCCCTCG GAGGGGGCTT CTCGGTGCCG CACGCCCGCG TCGAAGACGC CGAGTTGGGT GAGCCTGTCG CATTGCGTCG CCGCGACGGT GCCAGCGTCT ACCGCCGCCA TGGCGTCGCG CTCTACACCA GCCAGGAGAC ACTGGCCGCC GAGCAGCGAA TCCTGGACGC AGTGCACCGC GGCGACGGGC GTGTCGCAAG CGCCGCGGAC GTCGAAATGG CACTGGCGGA TTCGACGGCG CGAGGGCGCA CACTCAACCC CGGCCAGGCC GCACTGGTCA CCGACATGGC CACCGCGGGG CGTCGTGTCG CACTGGCGCT GGCCCCGGCC GGAACCGGCA AGACCACGGC CATGGCAGCG CTGGCGCACG CGTGGCGCAG CTCGGGCGGA CACGTCATCG GATTGGCCCC CACTGCTGAC GCGGCGATCG TGCTTGGCGA GGACCTGGGC GCGACCACCG ACACCCTGGA CAAATACGTG TGGTCGGCCG ATCTCGCCAA GGCCGCGACA TCCTATGTGC CCGACTGGTT CAGCCGTGTC GGACCCGACA CCCTGATCGT GGTCGACGAG GCCGGAAAAG CGGCTACCGC CGGGCTGGAC GCGATGATCC GCGACGCTCT CAAGAAGGGC GCCAGCGTGC GTCTGGTCGG TGATGACGGG CAGCTCTCGT CGATCTCGGC CGGCGGCATT CTGCGTGACA TCGCCGAGGC CACCGATGCC CTCACCTTGA GCGAAGTGGT GCGGTTCAAG TCCCCGGCCG AAGCCGCCGC CGGGCTCGCG CTGCACGACG CCGACCCGGC CGGAATCGGC TTCTACATCG ACCATCACCG CATCCATGTC GGCACCGACG AGACCGCCGC CGATATGGCG TATCAGGCGT GGCGCGCCGA CCTGGCTGCC GGTGGGGATT CGATTCTGCT CGCCCCGACC AACGACGTCA TCAACGAGCT CAACGCCCGC GCCCGACTCG ACCGATTGGC CACCGACCCC GAGGCCGCCA AAGCGCCCAC CGTGGTGCTG GCCGATCAGC TCCACGCCAG CGTCGGAGAC ACCATCCGCA CCCGCAAGAA CAACCGCCGA ATCACGATCG GCCGCAACGA TTTCGTGCGC AACGGCTACC GCTACACCAT CACCGAAGTC CTGCCCGATG GCGGCGTTAA AGCCCGCCAT CTGCGCAGCG GGCGCATCGT GACGCTGCCC GCGGACTACA TCGCAGAACA CGTCACGCTG GGCTATGCGG CCACCATCGA CTCAGCACAG GGTTTGACCG CCGGCGGCCG CGACACCAGG GGCACCTGCC ACATCGTCGG CTCCGACATG CTGACCCGCC AGCACCTTTA CGTCGCGTTG ACCCGCGGCA CCGACGAAAA CCACCTCTAC CTGTCCACGG CCGAGGGCGA CCCGCACCGG CTGCTGTCCC CCAAGGCCAC CCACCCCGAC ACCGCGGTCG ACGTGCTGAC CAGGATCTTG GCCCGCGACG GCGCCCAGGT CTCGGCGACC ACCGCCGCCC GCCAAGCCGC CGATGCGGCC ATGCGTCTGC AGGCGGCGGC CGATATGTAC TACGACGCGC TCGGTGCCGC CGCCGAGAAC CGTCTCGGCG TCGGTGCCCG TGATCGTCTC GATGCCCTTG CCGACGCGGT GATCCCGCAG TTGAGCCAGC GTGAGGCCTG GCCAGTGCTG CGCCGCAACC TGTCCGTCCT GGCCCTGCGT GGGGCGGACC CGCGCGAGCT GCTGACCGAG GCGCTGGCCA AGGGCAGCGT CGCCGACGCA GCGGACCCCG CCGCGGTGCT CGATCACCGC ATCGACCCCG CCGGCACGCA TTCCTCCGGC ATCGGGGTGC TGCGCTGGCT GCCCGCCGTG CCCGAGGCGC TGGCCGACGA TCCGCAATGG GGCGCCTATC TGGCCCGCCG CGAGAAGCTG GTCGAGGACC TGGCCGAAGA GATCCGCGAA CGCGCACGCG GCTGGACCAA TGCCACCGCA CCCGCCTGGG CGCGCCCCCT GATCACGGTA AATCCGGTGC TGACCGCTGA AATCGCGGTG TTCCGCGCCG CCACCGGGGT AGCCGAAGCC GACACCCGCA TCACCGGCGC TCGCCAGTTC CCGGTGCGCA CGCGCGGTGT GCAGGCAGCG CTGCAGCGCC AGGCCGCCGC CGACATCGGC CGCCACAGCG CCGACACCTC CCGCTGGAAC GAACTCATCG ACGCCATCGA CCCGCGCCTA CGCTCGGATG CCTACTGGCC ACAGCTGGCC GACCAGCTGG TCAAGGCCGC CCGCATCACT CCGCAACTGC GTCAGATCAT TACGACCGCA GCACGGCAAG GACCGCTGCC CGACGAGCTG CCCGCGGCAG CGCTGTGGTG GCGCATTTCC GGTGCGCTGT CCCCGACCGC GACCCTGGCC ACCACCCATT CGCGGCTACG CCCGCCCTGG GTCACCGATA TCGACGTCGT GTTCGGAACC GCGCTCGCCG AAATCATCAC CTCCGATCCC GCCTGGCCTG GGCTGGTCGC CGCGATCAGC GCGGCCGACC CGCAGAAGTG GACACCGCGG GATCTGCTGC ACGTCGCCGC CGAGCAGCTC GCCGACGCCA CCGACGAGGA CCATCTGATC CCGCCCGGCG ATTACGCCCG GCTCATCACC TACACCGTCG ATGCCTTCAC CCATCGCCTA CAAGCCCGCC TCGGCGTGGA CTTCGCGGAC CTACCCACGC CCGAAGACCC ACCGATCGAT CCCGCCGAGG AAGCACTGTT CCCGCCCGAC CCGCAAGATC CCTACGCGCA CGTCGAAGAG CACTCGCCCT TCGACGACTA CTTCGACGTC GCCCCGCCCG AAGACGTCGA CTCCTTCGAA TACAGGGCAG AAGAGCATGC CGGTCTGCAA TTTGAAGACC TCTCCGCAAA TCGCCCTACA CCCGAATTGG GCATCACTAT GGAGGTATTC CTTGCGCGGC TCACCGAGTA CCGGAATGTC TGCGACGAAA TCAAAACTCT GGCCGCCGAT ATCCGCGCCG GAAATGGCTC GGCGCTGCGC GCCGCCGCCG ACGATCTGGT GCGCATGCGC CACCAGGTCG ACGCCGACCG CCCCTACAGC CACGCCGTCA CCGACGTGAT GGAGCAATGG TCCGACGCTG ATGCCCGCTA CAACGACACC CTGCGCCTCA TCGAACATTC CCGCAGCCAG CTCGATGTCC TGCTGGCCAC CCCCGACGCC GAGGAACTCG ACATCATCTC GGCCCGCCAA CAGATCGCGT TCTACACCGA CCTGCTCCCC GACCAGCCGC CTTCATTGCA GTTCCAGCAG GCACTCGCCG ACGCCCAGGC TGCCCGCGCT GCGGCCGCCG GCGGCGCCGA CAAGATCGTC ACCGAGCGCG ACATCGCCGC CGCCCGCGGC GCCGCCGAAC GCGCCGACAT CGCCGCGCTC AACGCGCTGC GGGTGCGGCG CCCGGTGCTG CGCCGCGAGC TCGAACGAGC CGAGCGCGAC ATTGCCACCG CATTCGCGGC CGCACAGACC TCCACCTCCG ACACCCTCGA ACAACTCCTT GAGTCCGCAC GCTCGGAAGT CGCACTACTC CACGTCGCCG GCCACCTCGA CCCCGAGCAC ACTCCCCTGC TCATCCCCGA CGCCGCACTG TCCGGCCTCG AACCCCAGAC CGCCGACCGC CTCAAATCCG TGGCCGCACA GCCCTATCGG CTCGCCGTCA TCCGCGCCGA CATCACCGAC CGCGAAACCG TCGCAGCCCT CTACACCCTG CGCAACGCCG CCAGCGCCGA AGACCGCAAG GTGCTCTGGC TCTCGCCCAC CGAGGCCAGG TCGACCCCGG CCCACGACGC CGAGCTGGCT GACACCATCA CCACCATCGA GCACGCCCGC CACCAGGTCC GTGAGCAGCA GTGGGCACTG CCGCGCGGAG CCATCATCGT CATCGATGAC CCCGCGGCCG CCGAGCCCGA CCAGCTCGTT GACCTCGCCC GCCACGCTGC GGCCGCAGAC GCCCGACTCA TCCTCCTCGA CCCCGGCACC CGCCGCGGGC CCAGCTCGTC CGCGGTGCAG CTCCTCACGC AATCGCTGCC CTGGAACAAC GCACTCGCGA AAACCCCGTC AGCGCCCAAA GATCCGCTCC TGGCGTCGAC GCCGGCGGTC ACGCTGGCCG ACCGCCTCGG CCGCAAGCGC CTCAGCGAAC CATGGCAGCA ACTGCTCACC CAATACGACA CCGCGACCCG CGCGGTCCGC TCCGCACAGC GTCGTCAACT CGCCCGGGGA TGGCGCAGCA CCGACCTCGA GGCCTCAAAG GACCGCGACC GCACCCTGGG CGCCGGGATC GACGACTAA
|
Protein sequence | MVMTLHKLTA GDGYLYLVRQ VAASDSTERG RSTLADYYSA KGESPGRWMG RGLAALSDTG RCEVSPTARE EIWTVGEGSG VSEAQMRALY GVGLHPNAER IQTYVTGRNM GNQRAASRLG REFHVCDGKP EFARRLAVSF RDHNAEVGAH WNATIDPEIR SGIRTRVAME LFGEEYGRPP ADDRELSGFI ARNTRAKTTA VAGYDLTFSP VKSVSALWAI APLPVSEQIE AAHDAAVADV LKWLQDQAAF TRTGAGGIAQ VDTEGLIAAV FTHRDSRAGD PDLHTHVAIS NKVSYVDANG VRRWLALDGQ PLHRVTVAAS ELYNTRLEAH LISRLGVRFA EQSRGRGKRP VREIVGMSAK LMERWSSRRA AIKARTAELA KQFQADHGRE PTNVEIIALA QQATLESREA KHEPRSLGEQ RQEWHAQAVE VLGERGVNQI LADTLAAPQT TREAVTVDEE WIASRAGELI ATVAETRSTW QRHHVCAEAL RMARGHDVAH DVALVERLTD TALGGGFSVP HARVEDAELG EPVALRRRDG ASVYRRHGVA LYTSQETLAA EQRILDAVHR GDGRVASAAD VEMALADSTA RGRTLNPGQA ALVTDMATAG RRVALALAPA GTGKTTAMAA LAHAWRSSGG HVIGLAPTAD AAIVLGEDLG ATTDTLDKYV WSADLAKAAT SYVPDWFSRV GPDTLIVVDE AGKAATAGLD AMIRDALKKG ASVRLVGDDG QLSSISAGGI LRDIAEATDA LTLSEVVRFK SPAEAAAGLA LHDADPAGIG FYIDHHRIHV GTDETAADMA YQAWRADLAA GGDSILLAPT NDVINELNAR ARLDRLATDP EAAKAPTVVL ADQLHASVGD TIRTRKNNRR ITIGRNDFVR NGYRYTITEV LPDGGVKARH LRSGRIVTLP ADYIAEHVTL GYAATIDSAQ GLTAGGRDTR GTCHIVGSDM LTRQHLYVAL TRGTDENHLY LSTAEGDPHR LLSPKATHPD TAVDVLTRIL ARDGAQVSAT TAARQAADAA MRLQAAADMY YDALGAAAEN RLGVGARDRL DALADAVIPQ LSQREAWPVL RRNLSVLALR GADPRELLTE ALAKGSVADA ADPAAVLDHR IDPAGTHSSG IGVLRWLPAV PEALADDPQW GAYLARREKL VEDLAEEIRE RARGWTNATA PAWARPLITV NPVLTAEIAV FRAATGVAEA DTRITGARQF PVRTRGVQAA LQRQAAADIG RHSADTSRWN ELIDAIDPRL RSDAYWPQLA DQLVKAARIT PQLRQIITTA ARQGPLPDEL PAAALWWRIS GALSPTATLA TTHSRLRPPW VTDIDVVFGT ALAEIITSDP AWPGLVAAIS AADPQKWTPR DLLHVAAEQL ADATDEDHLI PPGDYARLIT YTVDAFTHRL QARLGVDFAD LPTPEDPPID PAEEALFPPD PQDPYAHVEE HSPFDDYFDV APPEDVDSFE YRAEEHAGLQ FEDLSANRPT PELGITMEVF LARLTEYRNV CDEIKTLAAD IRAGNGSALR AAADDLVRMR HQVDADRPYS HAVTDVMEQW SDADARYNDT LRLIEHSRSQ LDVLLATPDA EELDIISARQ QIAFYTDLLP DQPPSLQFQQ ALADAQAARA AAAGGADKIV TERDIAAARG AAERADIAAL NALRVRRPVL RRELERAERD IATAFAAAQT STSDTLEQLL ESARSEVALL HVAGHLDPEH TPLLIPDAAL SGLEPQTADR LKSVAAQPYR LAVIRADITD RETVAALYTL RNAASAEDRK VLWLSPTEAR STPAHDAELA DTITTIEHAR HQVREQQWAL PRGAIIVIDD PAAAEPDQLV DLARHAAAAD ARLILLDPGT RRGPSSSAVQ LLTQSLPWNN ALAKTPSAPK DPLLASTPAV TLADRLGRKR LSEPWQQLLT QYDTATRAVR SAQRRQLARG WRSTDLEASK DRDRTLGAGI DD
|
| |