Gene Mvan_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0037 
Symbol 
ID4644891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp45645 
End bp48830 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content64% 
IMG OID639803548 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_950894 
Protein GI120401065 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACT TCAGCGAAGC CGAATGGGAA TCTCTCGCGC TGGAGACGCT GGCCCAGCAG 
GAATGGTTGC CGCTGAATGG TTCCGCGATC GCCCCGGGCA CCGAGAACGG TCGCGCCAGC
TGGGACGAGT TGGTGCTCCC GGACCGGATG CTGGCCAAGC TGCGCGAACT CAACGCCCAT
GTGCCCGGCG AATACCTCGA ACAAGCTCGT GCGGCAATCC TGCAGCCGTC CTCCCAAGAC
GCCATCGCCG AGAACTACCG ACTGCACCAG TACCTGGTCG GGGGTTACCG AGGCATCAGC
TACGTGGATT CCGATGGCAT CGAGCAGAAT CCGACGATCC GGCTCATCAG CCACCGGCCC
GAGGAGAACG AACTCCTCGC CGTGCAGCAG GTCACCATCC GCGACGCCGA ACACGACCGC
CGATTCGACA TCGTTCTGTA CCTCAACGGC ATGCCGATCG CGTTCTTCGA ACTCAAGCAG
GCCGGCTCGA AGTACGCGGA CCTGCCCGGC GCGCACGCCC AATTCGCCAC TTACCTGCGA
GAATTCCCGA TGGCGTTCCG GTTCGCGGTG CTCAACGTCA TCAGCGACGG TCTCACCGCC
CGCTACGGGA CGCCGTTCAC ACCGCTGGAG CACTTTGCGC CCTGGAACGT CGACGACGAC
GGCAAGCCGG TCGCGTTCGG TGATCCCGTC GACGATGTGC ACCTGGGCAC CGAACTCGAG
TATCTGATCG ACGGCCTCTT CAACCCCGAG CGCTTCCTAC AACTAGTCCG CAACTTCACC
GCCTTCGACG CCGGGGCCGA CGGATTGATC AAACGGATCG CCAAGCCGCA CCAGTACTTT
GCCGTTACCA AGGCCGTCGG CTCGACGGTC ACCGCCGCTG AGAGCAACGG CAAGGCCGGA
GTCGTCTGGC ACACCCAGGG CTCTGGAAAG TCCATGGAGA TGGAGCTTTA CACCCATCTG
GTGGCCCAAC AGCCCAAACT GAAGAACCCG ACAGTCGTCG TCGTCACCGA CCGCAAGGAC
CTCGACGGCC AGCTCTACGC AACTTTCGAC CGCTCAAAGC TTCTCGGTGA ATCACCGGTC
AAGGTCACCA CGCGTTCCCA GCTGCGTGAC GAGTTGTCCA ATCGCACCAC CGGCGGCATC
TACTTCACCA CATTGCAGAA GTTCGGACTG TCGAAGGCAG AACGGGAGAG TGGCGCCGAC
CATCCGCTGC TGACCGACCG CCGCAACATC ATCGTCGTCG TCGACGAAGC CCACCGCAGC
CACTACGACG ACCTCGACGG GTATGCCCGG CACATCCGCG ATGCGCTGCC CAACGCGGTG
TACATCGCCT TCACCGGTAC CCCGATCTCC GAAGCCGACC GCGACACCCG CGACGTCTTC
GGACCCGACA TCGACGTCTA CGACCTGACC CGCGCAGTCA ACGACAAGGC CACGGTACCA
GTGTTTTTCG AGCCGCGGCT CATCAAGGTC GCCCTCGCCC AGGGCGTCAC CGAGGATGAC
CTCGACAAGG CCGCCGATGA GGTCACCGCC GGCCTCGACG ACGTCGAACG CGATCAGATC
GAGAAGTCGG TCGCCGTCAT CAACGCGGTT TACGGCGCGC CGGACCGACT GGCGGCGTTG
GCCCGCGACA TCGTCGACCA TTGGGAAGTT CGGTCGCAGG AGATGCGTAA GTTCATCTCC
TGCCCCGGCA AGGCATTCAT CGTCGGAGCG ACGCGCGAGA TCTGTGCCGA GCTCTACGAA
GAGATCGTCA AACTCAAGCC GGAATGGCAT GACGATGCCG TCGATAAGGG CGTCATCAAG
GTCGTCTACT CGGGTAGCGC CAAGGACCAG GGTTTGGTCG CCAAACACGT TCGCCGGGAT
GGGCAGAACA AGACCATTCA GCAGCGGCTG CGAGACCCCG ATGACGAACT GCAGATCGTG
CTCGTCAAGG ACATGCTGCT GACGGGTTTC GACGCGCCGC CGCTGCACAC CCTCTACCTG
GACCGCCCAC TCAAGGGTGC GCTGTTGATG CAGACGCTGG CCCGGGTGAA CCGGACCTTC
CGGGAGAAGC CCAACGGGCT GCTGGTCGCC TACGCTCCGC TGGTCGAGAA CCTGAACAAG
GCCCTGGCGG AGTACACCCA GACCGACCGC ACCGAGAAGC CCGTCGGCAA GAACATCGAC
GAGGCCGTCG CGCTGACCGA AACCCTGATC GCCCAGCTGG ATGCGCTTTG CGCCGGGTAC
GACTGGCGCG CCAAAGTCGC CCAGCCGCAC GGTTGGATGA AGGCGGCCGT GGGATTGACC
AACTACCTGC GCTCGCCGGC GACTGCCGGC AACCAGGTCG CCGAAGGCGA GGCGACGGTC
TCAGACAGGT TCCGGGCACT GGCCAACCAG CTGTCGCGCG CCTGGTCGCT GTGCGCTGGA
AACCAGGCAC TCGATGCGCT GCGCCCGACC GCCAAGTTCT ACGAAGAAGT CCGGGTGTGG
ATGGCGAAAT TCGACGCCAA CGACCGGCAA GCATCCGGAA AGCCAGTGCC AGAGGCGATA
CGACGGATGC TCGAGTCCCT GGTCTATGAC TCGACCGCCT CCGACGGCAT CGTCGACATC
TACGACGCCG CCGGATTGCC CAAGCCATCG CTGTCGGATC TGACGCCCGA GTTCGAGGCG
AAGGCTGCGT CCGCGAGCAA CCCACACCTG GCGATCGAAG CGTTACGTGC GGTGATCACC
GAGGAAGCAG TGCGCGCCAC CAAGAGCAAC GTGGTGCGGC AGCGGGCTTT CTCCGAACGG
CTCACCGACC TGATGCGGCG CTACACCAAT CAGCAGCTCA CCTCGGCCGA GGTGATCGCC
GAGTTGATCG AAATGGCCAA GGAGGTTGCG GCAGAGAGCA ATCGCGGCGC GCATTTCAGC
CCGCCGCTGT CCCATGACGA ACTGGCGTTC TACGACGCCG TCGCGCAGAA CGAGTCCGCC
GTCGAGTTGC AGGGTGAAGA CGTGCTAGCG CAGATCGCCC GGGAGCTCGT CGGGGTCATG
CAACGTGATA CCAAAACCGA CTGGACCGTG CGCGACGACG TTCGGGCCAA GCTGCGTTCT
TCGATCAAGC GGCTGCTGGT GAAGTACAAG TACCCGCCAG ACAAGCAACC CGAAGCGATC
AAGCTGGTAA TCGAACAGAT GGAAGCGCTG GCACCCGGGT ATGCGGATGC CGCGAGGGCG
GGGTAG
 
Protein sequence
MTDFSEAEWE SLALETLAQQ EWLPLNGSAI APGTENGRAS WDELVLPDRM LAKLRELNAH 
VPGEYLEQAR AAILQPSSQD AIAENYRLHQ YLVGGYRGIS YVDSDGIEQN PTIRLISHRP
EENELLAVQQ VTIRDAEHDR RFDIVLYLNG MPIAFFELKQ AGSKYADLPG AHAQFATYLR
EFPMAFRFAV LNVISDGLTA RYGTPFTPLE HFAPWNVDDD GKPVAFGDPV DDVHLGTELE
YLIDGLFNPE RFLQLVRNFT AFDAGADGLI KRIAKPHQYF AVTKAVGSTV TAAESNGKAG
VVWHTQGSGK SMEMELYTHL VAQQPKLKNP TVVVVTDRKD LDGQLYATFD RSKLLGESPV
KVTTRSQLRD ELSNRTTGGI YFTTLQKFGL SKAERESGAD HPLLTDRRNI IVVVDEAHRS
HYDDLDGYAR HIRDALPNAV YIAFTGTPIS EADRDTRDVF GPDIDVYDLT RAVNDKATVP
VFFEPRLIKV ALAQGVTEDD LDKAADEVTA GLDDVERDQI EKSVAVINAV YGAPDRLAAL
ARDIVDHWEV RSQEMRKFIS CPGKAFIVGA TREICAELYE EIVKLKPEWH DDAVDKGVIK
VVYSGSAKDQ GLVAKHVRRD GQNKTIQQRL RDPDDELQIV LVKDMLLTGF DAPPLHTLYL
DRPLKGALLM QTLARVNRTF REKPNGLLVA YAPLVENLNK ALAEYTQTDR TEKPVGKNID
EAVALTETLI AQLDALCAGY DWRAKVAQPH GWMKAAVGLT NYLRSPATAG NQVAEGEATV
SDRFRALANQ LSRAWSLCAG NQALDALRPT AKFYEEVRVW MAKFDANDRQ ASGKPVPEAI
RRMLESLVYD STASDGIVDI YDAAGLPKPS LSDLTPEFEA KAASASNPHL AIEALRAVIT
EEAVRATKSN VVRQRAFSER LTDLMRRYTN QQLTSAEVIA ELIEMAKEVA AESNRGAHFS
PPLSHDELAF YDAVAQNESA VELQGEDVLA QIARELVGVM QRDTKTDWTV RDDVRAKLRS
SIKRLLVKYK YPPDKQPEAI KLVIEQMEAL APGYADAARA G