Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_0037 |
Symbol | |
ID | 4644891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 45645 |
End bp | 48830 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639803548 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_950894 |
Protein GI | 120401065 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGACT TCAGCGAAGC CGAATGGGAA TCTCTCGCGC TGGAGACGCT GGCCCAGCAG GAATGGTTGC CGCTGAATGG TTCCGCGATC GCCCCGGGCA CCGAGAACGG TCGCGCCAGC TGGGACGAGT TGGTGCTCCC GGACCGGATG CTGGCCAAGC TGCGCGAACT CAACGCCCAT GTGCCCGGCG AATACCTCGA ACAAGCTCGT GCGGCAATCC TGCAGCCGTC CTCCCAAGAC GCCATCGCCG AGAACTACCG ACTGCACCAG TACCTGGTCG GGGGTTACCG AGGCATCAGC TACGTGGATT CCGATGGCAT CGAGCAGAAT CCGACGATCC GGCTCATCAG CCACCGGCCC GAGGAGAACG AACTCCTCGC CGTGCAGCAG GTCACCATCC GCGACGCCGA ACACGACCGC CGATTCGACA TCGTTCTGTA CCTCAACGGC ATGCCGATCG CGTTCTTCGA ACTCAAGCAG GCCGGCTCGA AGTACGCGGA CCTGCCCGGC GCGCACGCCC AATTCGCCAC TTACCTGCGA GAATTCCCGA TGGCGTTCCG GTTCGCGGTG CTCAACGTCA TCAGCGACGG TCTCACCGCC CGCTACGGGA CGCCGTTCAC ACCGCTGGAG CACTTTGCGC CCTGGAACGT CGACGACGAC GGCAAGCCGG TCGCGTTCGG TGATCCCGTC GACGATGTGC ACCTGGGCAC CGAACTCGAG TATCTGATCG ACGGCCTCTT CAACCCCGAG CGCTTCCTAC AACTAGTCCG CAACTTCACC GCCTTCGACG CCGGGGCCGA CGGATTGATC AAACGGATCG CCAAGCCGCA CCAGTACTTT GCCGTTACCA AGGCCGTCGG CTCGACGGTC ACCGCCGCTG AGAGCAACGG CAAGGCCGGA GTCGTCTGGC ACACCCAGGG CTCTGGAAAG TCCATGGAGA TGGAGCTTTA CACCCATCTG GTGGCCCAAC AGCCCAAACT GAAGAACCCG ACAGTCGTCG TCGTCACCGA CCGCAAGGAC CTCGACGGCC AGCTCTACGC AACTTTCGAC CGCTCAAAGC TTCTCGGTGA ATCACCGGTC AAGGTCACCA CGCGTTCCCA GCTGCGTGAC GAGTTGTCCA ATCGCACCAC CGGCGGCATC TACTTCACCA CATTGCAGAA GTTCGGACTG TCGAAGGCAG AACGGGAGAG TGGCGCCGAC CATCCGCTGC TGACCGACCG CCGCAACATC ATCGTCGTCG TCGACGAAGC CCACCGCAGC CACTACGACG ACCTCGACGG GTATGCCCGG CACATCCGCG ATGCGCTGCC CAACGCGGTG TACATCGCCT TCACCGGTAC CCCGATCTCC GAAGCCGACC GCGACACCCG CGACGTCTTC GGACCCGACA TCGACGTCTA CGACCTGACC CGCGCAGTCA ACGACAAGGC CACGGTACCA GTGTTTTTCG AGCCGCGGCT CATCAAGGTC GCCCTCGCCC AGGGCGTCAC CGAGGATGAC CTCGACAAGG CCGCCGATGA GGTCACCGCC GGCCTCGACG ACGTCGAACG CGATCAGATC GAGAAGTCGG TCGCCGTCAT CAACGCGGTT TACGGCGCGC CGGACCGACT GGCGGCGTTG GCCCGCGACA TCGTCGACCA TTGGGAAGTT CGGTCGCAGG AGATGCGTAA GTTCATCTCC TGCCCCGGCA AGGCATTCAT CGTCGGAGCG ACGCGCGAGA TCTGTGCCGA GCTCTACGAA GAGATCGTCA AACTCAAGCC GGAATGGCAT GACGATGCCG TCGATAAGGG CGTCATCAAG GTCGTCTACT CGGGTAGCGC CAAGGACCAG GGTTTGGTCG CCAAACACGT TCGCCGGGAT GGGCAGAACA AGACCATTCA GCAGCGGCTG CGAGACCCCG ATGACGAACT GCAGATCGTG CTCGTCAAGG ACATGCTGCT GACGGGTTTC GACGCGCCGC CGCTGCACAC CCTCTACCTG GACCGCCCAC TCAAGGGTGC GCTGTTGATG CAGACGCTGG CCCGGGTGAA CCGGACCTTC CGGGAGAAGC CCAACGGGCT GCTGGTCGCC TACGCTCCGC TGGTCGAGAA CCTGAACAAG GCCCTGGCGG AGTACACCCA GACCGACCGC ACCGAGAAGC CCGTCGGCAA GAACATCGAC GAGGCCGTCG CGCTGACCGA AACCCTGATC GCCCAGCTGG ATGCGCTTTG CGCCGGGTAC GACTGGCGCG CCAAAGTCGC CCAGCCGCAC GGTTGGATGA AGGCGGCCGT GGGATTGACC AACTACCTGC GCTCGCCGGC GACTGCCGGC AACCAGGTCG CCGAAGGCGA GGCGACGGTC TCAGACAGGT TCCGGGCACT GGCCAACCAG CTGTCGCGCG CCTGGTCGCT GTGCGCTGGA AACCAGGCAC TCGATGCGCT GCGCCCGACC GCCAAGTTCT ACGAAGAAGT CCGGGTGTGG ATGGCGAAAT TCGACGCCAA CGACCGGCAA GCATCCGGAA AGCCAGTGCC AGAGGCGATA CGACGGATGC TCGAGTCCCT GGTCTATGAC TCGACCGCCT CCGACGGCAT CGTCGACATC TACGACGCCG CCGGATTGCC CAAGCCATCG CTGTCGGATC TGACGCCCGA GTTCGAGGCG AAGGCTGCGT CCGCGAGCAA CCCACACCTG GCGATCGAAG CGTTACGTGC GGTGATCACC GAGGAAGCAG TGCGCGCCAC CAAGAGCAAC GTGGTGCGGC AGCGGGCTTT CTCCGAACGG CTCACCGACC TGATGCGGCG CTACACCAAT CAGCAGCTCA CCTCGGCCGA GGTGATCGCC GAGTTGATCG AAATGGCCAA GGAGGTTGCG GCAGAGAGCA ATCGCGGCGC GCATTTCAGC CCGCCGCTGT CCCATGACGA ACTGGCGTTC TACGACGCCG TCGCGCAGAA CGAGTCCGCC GTCGAGTTGC AGGGTGAAGA CGTGCTAGCG CAGATCGCCC GGGAGCTCGT CGGGGTCATG CAACGTGATA CCAAAACCGA CTGGACCGTG CGCGACGACG TTCGGGCCAA GCTGCGTTCT TCGATCAAGC GGCTGCTGGT GAAGTACAAG TACCCGCCAG ACAAGCAACC CGAAGCGATC AAGCTGGTAA TCGAACAGAT GGAAGCGCTG GCACCCGGGT ATGCGGATGC CGCGAGGGCG GGGTAG
|
Protein sequence | MTDFSEAEWE SLALETLAQQ EWLPLNGSAI APGTENGRAS WDELVLPDRM LAKLRELNAH VPGEYLEQAR AAILQPSSQD AIAENYRLHQ YLVGGYRGIS YVDSDGIEQN PTIRLISHRP EENELLAVQQ VTIRDAEHDR RFDIVLYLNG MPIAFFELKQ AGSKYADLPG AHAQFATYLR EFPMAFRFAV LNVISDGLTA RYGTPFTPLE HFAPWNVDDD GKPVAFGDPV DDVHLGTELE YLIDGLFNPE RFLQLVRNFT AFDAGADGLI KRIAKPHQYF AVTKAVGSTV TAAESNGKAG VVWHTQGSGK SMEMELYTHL VAQQPKLKNP TVVVVTDRKD LDGQLYATFD RSKLLGESPV KVTTRSQLRD ELSNRTTGGI YFTTLQKFGL SKAERESGAD HPLLTDRRNI IVVVDEAHRS HYDDLDGYAR HIRDALPNAV YIAFTGTPIS EADRDTRDVF GPDIDVYDLT RAVNDKATVP VFFEPRLIKV ALAQGVTEDD LDKAADEVTA GLDDVERDQI EKSVAVINAV YGAPDRLAAL ARDIVDHWEV RSQEMRKFIS CPGKAFIVGA TREICAELYE EIVKLKPEWH DDAVDKGVIK VVYSGSAKDQ GLVAKHVRRD GQNKTIQQRL RDPDDELQIV LVKDMLLTGF DAPPLHTLYL DRPLKGALLM QTLARVNRTF REKPNGLLVA YAPLVENLNK ALAEYTQTDR TEKPVGKNID EAVALTETLI AQLDALCAGY DWRAKVAQPH GWMKAAVGLT NYLRSPATAG NQVAEGEATV SDRFRALANQ LSRAWSLCAG NQALDALRPT AKFYEEVRVW MAKFDANDRQ ASGKPVPEAI RRMLESLVYD STASDGIVDI YDAAGLPKPS LSDLTPEFEA KAASASNPHL AIEALRAVIT EEAVRATKSN VVRQRAFSER LTDLMRRYTN QQLTSAEVIA ELIEMAKEVA AESNRGAHFS PPLSHDELAF YDAVAQNESA VELQGEDVLA QIARELVGVM QRDTKTDWTV RDDVRAKLRS SIKRLLVKYK YPPDKQPEAI KLVIEQMEAL APGYADAARA G
|
| |