Gene Mmcs_3964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3964 
Symbol 
ID4112794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4222589 
End bp4224970 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content67% 
IMG OID638033107 
Productsulfatase 
Protein accessionYP_641125 
Protein GI108800928 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGATT CGTCACCCCA GAACACCGGA CAGCAGGCAG CGGCCGCGGT GCGGCGCGAC 
GTGCTTCCGA TCCCGGATCC CCGACACGTC GGGTTGACGA CGTACGACGC GAAGGATCCG
AATACGACGT ATCCGCCGAT AACGCGGCTG CGCCCACCCG ACGGTGCGCC GAATGTGCTG
ATCGTGCTGA TCGACGACGT GGGTTTCGGC GCGAGTTCGG CCTTCGGCGG CCCGTGCCGC
ACACCGGTCG CGGAGCGGCT GGCGGCCAAC GGTGTGAAAC TCAACCGCTT CCACACCACC
GCGCTGTGCT CGCCGACGCG CCAGGCGCTG CTCACCGGAC GCAACCACCA CTCGGTGGGG
ATGGGCGGTG TCACCGAGAT CGCGACGTCC GCGCCGGGAT ACAGCAGCAT CCGGCCGAAG
GACAAGGCGC CGATCTCGGA AACGCTTCGG CTCAACGGAT ACTCGACGAG TCAGTTCGGC
AAGTGCCACG AGGTCCCGGT GTGGGAGGTG TCGCCGGTCG GGCCGTTCGA CCAGTGGCCC
ACCGGGTCGG GGTTCGAGCA CTTCTACGGG TTCGTCGGCG GCGAGGCCAA CCAGTACTAC
CCCGGACTGT ACGAGGGCAC CACACCCGTC GAACCGGCCA AGACGCCCGA GCAGGGTTAC
ACCCTGACCG AGGACCTCGC CGACCGGGCG ATCACCTGGG TGCGCCAACA GCAGGCGCTC
ACCCCGGACA AGCCGTTCTT CATGTATTTC GCGCCCGGTG CCACCCACGC GCCACACCAC
GTGCCCCGGG AGTGGTCCGA CCGGTACCGC GGCGAGTTCG ACGCCGGCTG GGACGTGTTG
CGCGAGCAGA TCTTCGCCCG CCAGAAGGAA CTCGGCGTGA TCCCGCAGGA CGCCGAGCTG
ACCCGGCGCC ACGACGAGAT CCCCGGCTGG GACGACATGC CCGACCGGCT CAAGCCGGTG
CTCGCCCGGC AGATGGAGAT CTATGCCGGC TTCATGGAGC AGACCGACCA CGAGGTCGGG
CGGCTGATCG ACGCGGTCGA CGGCCTCGGC GCACTCGACA ACACGCTGAT CTACTACATC
ATCGGCGACA ACGGCGCGTC GGCGGAGGGC ACGCCGAACG GGTGCTTCAA CGAGATGTGC
ACGCTCAACG GGCTGGCCGG GATCGAGACC GAGGATTTCC TGCTGTCCAA GATCGACGAC
TTCGGCACGC CGGACGCGTA CAACCACTAT GCGGTGGGCT GGGCGCACGC CCTGTGCGCG
CCGTATCAGT GGACCAAGCA GGTCGCCTCC CACTGGGGCG GCACCCGCAA CGGCACGATC
GTGCACTGGC CCGCGGGGAT CGACGCCGTC GGCCAGACCC GCGACCAGTT CCACCACGTG
ATCGACGTCG TGCCAACGAT TCTCGAGGCG GCCGACATCC CCGCACCACT GATGGTCAAC
GGCATCGCGC AGGCGCCCAT CGAGGGCTTC AGCATGATGT CGACGCTGCG GGCCGCCGAT
GCCGCCGAGA CCCACCGGGT GCAGTACTTC GAGATGTTCG GCAACCGCGG CATCTACCAC
GACGGTTGGA CGGCGGTCAC CAAACACCGG ACGCCGTGGC TGTCCGACCA GCCGGCGCTC
GAGGACGACG TCTGGGAGCT GTACGGGCCG GGCGACTGGA CCCAGGCGCA CGACCTGGCG
GCCGATGATC CGGCGAAACT CGCCGAACTA CAGCGTCTCT GGCTCATCGA GGCGGTCAAG
TACGACGTGT TGCCGATCGA CGACAGGTCG TACGAGCGCT TCAACCCGGA CATCTCCGGG
CGGCCGGTGC TCATCACCGG CAGCACCCAG ACGATCTTCC CCGGTATGCG GCTGATGGAG
AGCTGCGTGC TCAACATCAA GAACAAGTCG CATTCGGTGA CGGCCGAGCT GTCGGTGCCC
GAGTCCGGTG CGCAGGGGGT GGTGGTCAGC CAGGGCGGCG GTGTCGGTGG TTGGTGTCTG
TACGCGCACG AGGGCCGGTT GAAATACTGT TACAACTTCC TCGGCATCGA GTACTACTAC
GTCACCGCCG AGGCTCCGCT CTCGGCCGGC GATCACCTCC TGCGTATGGA ATTCGCCTAC
GACGGTGGGG GTTTGGGCAA GGGCGGCACC GTCACGCTCT TCTGTGACGG TGAAGCCGTC
GGGACGGGGC GGGTCGACCA GAGCGAGCCG ATGGCGTTCT CCGCCGACGA GATGTGCGAC
GTCGGCTCGG ACAGCGGCTC GCCGACGTCA CCCGACTACG GGCCGCACGA CAACGCTTTC
ACCGGCCGAA TCGACTGGGT GAAGATCGAT ATCGGCGCCG CCGACCACGA CCACCTGATC
ACCGCGGAGG ACAAGCTGAA CATCGCGATG TCGCGGCAGT GA
 
Protein sequence
MSDSSPQNTG QQAAAAVRRD VLPIPDPRHV GLTTYDAKDP NTTYPPITRL RPPDGAPNVL 
IVLIDDVGFG ASSAFGGPCR TPVAERLAAN GVKLNRFHTT ALCSPTRQAL LTGRNHHSVG
MGGVTEIATS APGYSSIRPK DKAPISETLR LNGYSTSQFG KCHEVPVWEV SPVGPFDQWP
TGSGFEHFYG FVGGEANQYY PGLYEGTTPV EPAKTPEQGY TLTEDLADRA ITWVRQQQAL
TPDKPFFMYF APGATHAPHH VPREWSDRYR GEFDAGWDVL REQIFARQKE LGVIPQDAEL
TRRHDEIPGW DDMPDRLKPV LARQMEIYAG FMEQTDHEVG RLIDAVDGLG ALDNTLIYYI
IGDNGASAEG TPNGCFNEMC TLNGLAGIET EDFLLSKIDD FGTPDAYNHY AVGWAHALCA
PYQWTKQVAS HWGGTRNGTI VHWPAGIDAV GQTRDQFHHV IDVVPTILEA ADIPAPLMVN
GIAQAPIEGF SMMSTLRAAD AAETHRVQYF EMFGNRGIYH DGWTAVTKHR TPWLSDQPAL
EDDVWELYGP GDWTQAHDLA ADDPAKLAEL QRLWLIEAVK YDVLPIDDRS YERFNPDISG
RPVLITGSTQ TIFPGMRLME SCVLNIKNKS HSVTAELSVP ESGAQGVVVS QGGGVGGWCL
YAHEGRLKYC YNFLGIEYYY VTAEAPLSAG DHLLRMEFAY DGGGLGKGGT VTLFCDGEAV
GTGRVDQSEP MAFSADEMCD VGSDSGSPTS PDYGPHDNAF TGRIDWVKID IGAADHDHLI
TAEDKLNIAM SRQ