Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_0024 |
Symbol | |
ID | 4108912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 29896 |
End bp | 33222 |
Gene Length | 3327 bp |
Protein Length | 1108 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638029150 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_637202 |
Protein GI | 108797005 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.955051 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGCAA GGGGTCCGGA GTACGACTAC GTCGAGAAGC CCAGCATGGA GTTGTTGGCT GAGCTCGGCT GGAACCCGGT CGATGCTTTC CACGAAATCC TCGGCGCCGA GGGCACCCTT GGACGTGACT CCCAGCACGA CGTGATTCTG ACCCATCGCC TGCGCCTAGC GATGCGGAAC CTAAACGCCG AAGACGTGCC GGACCTCTCC ATCAACGAGG CGATCGAGGC GTTGACCAAG GACCGTTCGG TGATGGATCG GGTCCGCGCC AACCGTGAGG TTTACGACCT GCTGCGCGAC GGATACCAGG CCGAGTGGGA AGATGACAAC GGCGACAAGC GAATCGAATT GATCCGCTAC ATCGATCTGC GAGACAAACT GAACAATGAC CTGCTAGCGG TCCAGCAGAT GTGGGTTAAG GGCCCGTTGC ACAGCCGTCG CCTCGACGTC GCCCTGTTCG TTAACGGCGT ACCGCTGGCA GTGCTGGAGT TCAAGGAGCC GAATGCGCCG GTCAAGTCGG CCTACGACGA CAACGTCACC GACTATCGCG ACACCATCCC GCAGCTCTTC ATACCGAACT GTCTTGTCCT GCTGTCCAAT GGAAGTGAGG CGAAAGTTGG GTCGACGTAT GCGCCGTGGG ACTTCTTCTC GGACTGGAAG GTCATTGACG CCTTCGGAGC GCGTAGTGAG ATCGCGTTGC AGACAGCTCT ACGGGGCACC TGCGATCCTG CGAACCTGCT CGACCTTTTC GAGAGCTTCG TAGCTTACAT GGAACGTCCC GGCGGATTGG TCAAGATCGT CGCGCGTTCG CATCAGTATC TCGGCGTCAA CGCTGCGATC GAGAATCTTA ATCGGGCGCG CGCTGTGCAC GACAAACGCC TCGGCGTCTT TTGGCATACC CAGGGTTCGG GCAAAAGCCT GTCGATGTTG TGGTTCACCC AGAAGGTGCT CCGGCACGTC CGGGGCAAGT GGACTTTCGT AATGGTTACT GACAGAACCG AACTCGACAC GCAACTCCAC GGAGAGTTCG CTGATGCCGG CGCGATCCCG CCCGAAGCGC GGGTCCACGC AGACTCCATT GCGCATCTGC GTGAACTGTT GGAGGCCGAC CACCGGTACG TCTTCACGCT TATCCAGAAA TTTCAGCCAG CTAAGGGTGA ACGCCAGATG CCGGTCTTGT CGGAGCGGTC AGACATCATC GTCATCACCG ATGAGGCACA CCGCAGCCAG TACGACACCC TCGCGCTCAA CATGCGAACC GCGCTGCCCA ACGCCTCGAT GATGGGCTTT ACCGGGACCC CACTGATTGC CGGCGAAGAG CAGGCGACCC GTCAGCAGTT TGGTGACTAC GTCAGCATCT ACAACTTCGG CGACGCCATC GAAGACGGCG CCACGGTCCC CCTCTACTAC GAAAATCGCA TTCCCGAGCT ACAACTCACG AACGCCGAGT TCGCCGATGA ACTTGACGCG CTCCTGGAAA AAGCCGAGCT CGATGAAGAC GCGGAGGGCG CGCTGGCACG CAAATTCGGG ACCCAGTACA CCCTGCTGAC CCGTCCCGAA CGTTTGCAGA CCCTCGCACA AGACCTCGTT TCACACTTCG TTGGCCGCGG ATTCTCAGGC AAGGCAATGT ACGTCGGCCT CGACAAAGCC GCAGCCGTGA CAATGCACGA CCTCGTCCAG GACGCGTGGG CCGAGCACCT CGCCGATCTG CGACGCCAGC ACGACGCGCT GCCCGAGCTG GAGCGTCCGT GGCTGGCATC GCGGATTGAG CTGATGGAAA CCACCGACAT GGCAGTCGTG GTCTCCCAAA GCCAGAACGA ACTGAAGATG CTCGATGACT TAGGCCTCGA CATCCGACCC CACCGAGAGC GGATGAACCG CGAAGACCTC GCAGAGAAGT TCAAAGACCC AAACGACCCG TTACGGCTCG TATTCGTGTG CGCGATGTGG ATGACCGGCT TCGACGCCCC GAGCGTGTCG ACTATCTACC TCGACAGGCC GATGAAAAAC CACACCCTCA TGCAGACCAT CGCCCGCGCC AACCGCGTAT TCCCGGACAA GGACAACGGT CTCATTGTCG ATTACGTCGG AGTCTTCCGG AACTTGGAGA AGGCTCTCGC CATCTACGGT GCAGCCAAGG AAGGGGAGTC GCCTATCGAG ATCATCGATG CTCTCGCGGA TGAACTCGAC ACTGCCGTCG CAGATCTGAT CGCCTTCTGC GCGGGCGTCG GCGTCGACTT GATCGCAGTG CGCAATGCTC AAGGCTTCGA CCACGTCGCT AAACGGGACG CTGCCATCGA GGCGTTGTTG GTTGACGAAC AGACCCGCAA CGACTTCACG ACTAAGGCTC GACAGGTCCG CCGCCTATAC AAGGCACTGC TCCCAAACCC GAAAGCGGCT GCACAGCAAC GTAACGTCGC TGCGATCCGA GTGTTGGCCG AGCGCATCCA CGAAGTCACC AAGCCGCCAA CACCTGACAT TGGAGTTGTC GCCGACGCCG TAGACGCCCT GCTGGATCGA TCGGTGGGGG CCGAGGAGTA CGTCATCCGC GCTGCTGCCG AGGGTAGCGA GCCGGACCCA CTGATCGACC TATCGCAGAT CGATTTCGAC GGACTCGCAG CGAGACTGGC CGGCCGCAAG CGCGCAGAGA CGGACAGGAT GGCTCAGCTT CTTCGGCAGC AAGCCATCGG CGCGGCGATG CGGAACCCGA CGCGCTACGA GCTGGTCGAG CGTATCGAGC AGCTGATTGC TGACTACAAC GCGGGCAGCG TCAATATCGA TGAGTACCTG CGTCGTCTCG TCGAGCTTTC AAGAACACTC ACTTCCGAAG AGGAACGAGC AGTCCGCGAA GGCATGACCG AGGAAGAACT TGCGATCTTC GACCTCCTAA CGCAACCCGA TCCCGTGCTC ACCGCGGAGG AACGAGAAAG AGTTCAGGCG AGCGCGAAGA CACTGCTCCA ACATCTGCAT GAAAAGCTCG TCCAGGATTG GCGACGCAAG GTCGACGTGA TGAACGACGT CAACAGCACC ATCCGCCGCG TACTCGACGA CGGTTTACCC GAGACGCCAT ACACAGTCGA TGTCTTCCGC GAGAAGGTTC AGCTCGTATA CGACCATGTT CTGAGTGCGT ACGGAGATGA CGGTGAGAGT GTGTACACCC GACGCGTGGA CGTCGGCTTT CCGCCTCAGA CAGCGGGGCA AGTGCCTTTG GGTCCGATTG ATGTAGACAG GATCGCCGAT GACGTCGTGG CGCGCATCCA CGCCGATCCC GCCTTCGCTG AGCACGTTGC GCGACAGCTG ATAGACCGGC CAAACGCAGC AGAATGA
|
Protein sequence | MMARGPEYDY VEKPSMELLA ELGWNPVDAF HEILGAEGTL GRDSQHDVIL THRLRLAMRN LNAEDVPDLS INEAIEALTK DRSVMDRVRA NREVYDLLRD GYQAEWEDDN GDKRIELIRY IDLRDKLNND LLAVQQMWVK GPLHSRRLDV ALFVNGVPLA VLEFKEPNAP VKSAYDDNVT DYRDTIPQLF IPNCLVLLSN GSEAKVGSTY APWDFFSDWK VIDAFGARSE IALQTALRGT CDPANLLDLF ESFVAYMERP GGLVKIVARS HQYLGVNAAI ENLNRARAVH DKRLGVFWHT QGSGKSLSML WFTQKVLRHV RGKWTFVMVT DRTELDTQLH GEFADAGAIP PEARVHADSI AHLRELLEAD HRYVFTLIQK FQPAKGERQM PVLSERSDII VITDEAHRSQ YDTLALNMRT ALPNASMMGF TGTPLIAGEE QATRQQFGDY VSIYNFGDAI EDGATVPLYY ENRIPELQLT NAEFADELDA LLEKAELDED AEGALARKFG TQYTLLTRPE RLQTLAQDLV SHFVGRGFSG KAMYVGLDKA AAVTMHDLVQ DAWAEHLADL RRQHDALPEL ERPWLASRIE LMETTDMAVV VSQSQNELKM LDDLGLDIRP HRERMNREDL AEKFKDPNDP LRLVFVCAMW MTGFDAPSVS TIYLDRPMKN HTLMQTIARA NRVFPDKDNG LIVDYVGVFR NLEKALAIYG AAKEGESPIE IIDALADELD TAVADLIAFC AGVGVDLIAV RNAQGFDHVA KRDAAIEALL VDEQTRNDFT TKARQVRRLY KALLPNPKAA AQQRNVAAIR VLAERIHEVT KPPTPDIGVV ADAVDALLDR SVGAEEYVIR AAAEGSEPDP LIDLSQIDFD GLAARLAGRK RAETDRMAQL LRQQAIGAAM RNPTRYELVE RIEQLIADYN AGSVNIDEYL RRLVELSRTL TSEEERAVRE GMTEEELAIF DLLTQPDPVL TAEERERVQA SAKTLLQHLH EKLVQDWRRK VDVMNDVNST IRRVLDDGLP ETPYTVDVFR EKVQLVYDHV LSAYGDDGES VYTRRVDVGF PPQTAGQVPL GPIDVDRIAD DVVARIHADP AFAEHVARQL IDRPNAAE
|
| |