Gene Mmcs_3779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3779 
Symbol 
ID4112610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4031427 
End bp4033838 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content64% 
IMG OID638032918 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_640941 
Protein GI108800744 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCCT CGGATGACGA CGCCCGGCTG CCGGCCGAAG CGCGCGCCCG CAAACTTATC 
GATGCACAGC TGACGCAGGC CGGTTGGGTT GTCCAGGACA AGCAGGACCT CAACCTCTTC
GCGGCGCAAG GCGTCGCCGT CCGCGAGGTG GTGATGAAGC CCGGCCACGG CCGTGTCGAT
TACCTGTTGT ACGTCGACAA AGCCGTGGTC GGCGTCATCG AGGCTAAACC GGTCGGCACG
CCGCTCTCCG GTGTGGAGTG GCAGTCGGCA ATGTACGCCG ACGGTCTGCC CGCCGACGTC
AGGATTGCGG CCAAGACACG CGATGGCAGG TTGCCGTTCG TGTTCGAGGC TTCCGGCGTC
GAAACGCACT TCACCAACGG CTTCGAGCCG GAATCGCGGG CGCGGCTGAT CTTCAATGTC
CCGCGACCGG AGACGCTCGC GCGTTATCTG CGCGAGGCCG AGACAAACCC CGATTACCCC
ACCTGGCGTG CGAAGGTCCG CAAGCTGCCG GCGCTGGACA CGGCTGCTCT GCGGCCGGCG
CAGATCGAGG CCGTCAACGG GGTGGAGCAG AGCCTGGCCG GTCAGCAGTG CAACCGGTCG
TTGGTGCAGA TGGCGACCGG CGCGGGCAAG ACGTTCACTG CGGTGACGCA GTCCTACCGG
CTGCTCAAGC ACGGCGGCTT CGAACGGATC CTGTTCCTGG TCGACCGCAA CAATCTGGCC
GACCAGACGT TGGGCGAATT TCAGAACTAC CGCACACCCG ATGACGGTCG ACGTTTCACC
GAGCTGTACA ACGTCAACAA GCTTTCCAGC GCGGGCTTGC TGGGATCGAC CAAGGTCACG
ATCTCAACGA TTCAGCGGGT GTTCCGTTTC ATCAAGGCCG GCGAGGTCAG TGATGCCGAC
GACCCCGACA TCGACGATTA CGTGCCGGAC GCGCCTGTCA CCGTTTCGTA CAGCGAGGCG
CTGCCACCGG AGACCTTCGA CCTGGTCATT GTCGACGAGG CGCACCGCAG CATCTACGGC
GTGTGGCGCG GAGTCCTGGA GTACTTCGAT GCGCACGTCG TCGGGCTGAC GGCGACACCG
GGCAAGCAGA CGTTCGCGTT CTTCCGGCAG AACCTGGTGT CGGACTACAC CTACCCGGAA
TCGGTGGCCG ACGGCGTCAA CGTCGACTTC GACGTCTACC GCATCCGCAC CAAGATCAGC
GATCAGGGAT CGCATATCGA TGCCGGAACC ATCGTCCCCA AGGTGGACCG CCGCACCCGC
GAACAGCGTC TGGAAGCACT CGATGACGAT CTGGATTACG TTCCCGGGCA ACTTGATCGA
GCAGTCACCG CCACAGATCA GATCCGCACG GTGCTGGAGA CTTTTCGCGA CCGGCTGTTC
ACCGAGATCT TTCCCGGCCG CAGTACGGTC CCGAAGACGC TGATCTTCGC CAAGGACGAC
AACCACGCCG AGGAGATCGT CAGACAGGTG CGTGAGGTGT TCGGCAAGGG CAACGACTTT
GCCGCCAAGA TCACCTACAA CGCCCGCAAC GCGAAGGAAC AGCTGAAGGC GTTCCGCACC
AGCCCGGCTC TGCGCATCGC GGTGACGGTC GACATGATCG CCACCGGCAC CGATGTCAAA
CCGCTGGAGT GCGTGTTCTT CATGCGTGAC GTGCGTTCGG CGCAGTACTT CGAGCAGATG
AAGGGCCGGG GTGCACGCAC CATTCCCGAC GCGGACTTCC AGGCGGTCAC GCCGGACGCC
ACGACCAAGA CCCGGTTCGT CATCGTCGAC GCGATCGGTG TCACCGAGCA CGATTTCGTC
GAACCCCCGC TGAACCGGCA GCGCACGGTG CCGCTGAAGA AGCTGCTGGA GAAAGCCGCC
AACCTGACGA TCGCTGAGGA CGAGGCGGCC ACCCTGGCCT CACGCCTGGC CAAGCTCGAA
CTCGATCTGA CCGATGCCGA GCGTGTCGAG CTCGATGAGG TGGCGGGGCA GCCGGTGCGC
GAGATCATCC GGTCGTTGGT GGATGCGGTC GAGCCGTCGG CCACAACTCC CGAAGCCATT
GAAGCGGCCC TGGTGCCGAT CGCGTCGAAC CCGGAGCTGC GCAACCGTGT CCTCGAACTT
CGGGCCGCCC ACGACCGCAT CATCGATGAG GTCAGCGCCG ATGAGCTGAT CGAGGCCGGG
GGAGTGGTGG ACCCCGGCAA GGCCCAGTCG ATCGTAGAGT CGTGGACCGC GTACTCGAAG
AGCACCGTGA TGAGATCACG GCGCTGCAGT TGGGATATGA GGCCAGCGAG CGACGCATCG
ACTTCGGCTT CATCGAGGGG TTGGCCGCGA GAATTGCTCG GCCGCCGCAC AATTGGACGC
CTGATCTACT CTGGAACGCC TATGCCGCGG TCGACGGCCC GAAGGTGCAC AAGAGCGCCA
CCCATGCGGT GA
 
Protein sequence
MSASDDDARL PAEARARKLI DAQLTQAGWV VQDKQDLNLF AAQGVAVREV VMKPGHGRVD 
YLLYVDKAVV GVIEAKPVGT PLSGVEWQSA MYADGLPADV RIAAKTRDGR LPFVFEASGV
ETHFTNGFEP ESRARLIFNV PRPETLARYL REAETNPDYP TWRAKVRKLP ALDTAALRPA
QIEAVNGVEQ SLAGQQCNRS LVQMATGAGK TFTAVTQSYR LLKHGGFERI LFLVDRNNLA
DQTLGEFQNY RTPDDGRRFT ELYNVNKLSS AGLLGSTKVT ISTIQRVFRF IKAGEVSDAD
DPDIDDYVPD APVTVSYSEA LPPETFDLVI VDEAHRSIYG VWRGVLEYFD AHVVGLTATP
GKQTFAFFRQ NLVSDYTYPE SVADGVNVDF DVYRIRTKIS DQGSHIDAGT IVPKVDRRTR
EQRLEALDDD LDYVPGQLDR AVTATDQIRT VLETFRDRLF TEIFPGRSTV PKTLIFAKDD
NHAEEIVRQV REVFGKGNDF AAKITYNARN AKEQLKAFRT SPALRIAVTV DMIATGTDVK
PLECVFFMRD VRSAQYFEQM KGRGARTIPD ADFQAVTPDA TTKTRFVIVD AIGVTEHDFV
EPPLNRQRTV PLKKLLEKAA NLTIAEDEAA TLASRLAKLE LDLTDAERVE LDEVAGQPVR
EIIRSLVDAV EPSATTPEAI EAALVPIASN PELRNRVLEL RAAHDRIIDE VSADELIEAG
GVVDPGKAQS IVESWTAYSK STVMRSRRCS WDMRPASDAS TSASSRGWPR ELLGRRTIGR
LIYSGTPMPR STARRCTRAP PMR