Gene Mkms_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4038 
Symbol 
ID4611978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4257071 
End bp4259452 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content67% 
IMG OID639793722 
Productsulfatase 
Protein accessionYP_940020 
Protein GI119870068 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATT CGTCACCCCA GAACACCGGA CAGCAGGCAG CGGCCGCGGT GCGGCGCGAC 
GTGCTTCCGA TCCCGGATCC CCGACACGTC GGGTTGACGA CGTACGACGC GAAGGATCCG
AATACGACGT ATCCGCCGAT AACGCGGCTG CGCCCACCCG ACGGTGCGCC GAATGTGCTG
ATCGTGCTGA TCGACGACGT GGGTTTCGGC GCGAGTTCGG CCTTCGGCGG CCCGTGCCGC
ACACCGGTCG CGGAGCGGCT GGCGGCCAAC GGTGTGAAAC TCAACCGCTT CCACACCACC
GCGCTGTGCT CGCCGACGCG CCAGGCGCTG CTCACCGGAC GCAACCACCA CTCGGTGGGG
ATGGGCGGTG TCACCGAGAT CGCGACGTCC GCGCCGGGAT ACAGCAGCAT CCGGCCGAAG
GACAAGGCGC CGATCTCGGA AACGCTTCGG CTCAACGGAT ACTCGACGAG TCAGTTCGGC
AAGTGCCACG AGGTCCCGGT GTGGGAGGTG TCGCCGGTCG GGCCGTTCGA CCAGTGGCCC
ACCGGGTCGG GGTTCGAGCA CTTCTACGGG TTCGTCGGCG GCGAGGCCAA CCAGTACTAC
CCCGGACTGT ACGAGGGCAC CACACCCGTC GAACCGGCCA AGACGCCCGA GCAGGGTTAC
ACCCTGACCG AGGACCTCGC CGACCGGGCG ATCACCTGGG TGCGCCAACA GCAGGCGCTC
ACCCCGGACA AGCCGTTCTT CATGTATTTC GCGCCCGGTG CCACCCACGC GCCACACCAC
GTGCCCCGGG AGTGGTCCGA CCGGTACCGC GGCGAGTTCG ACGCCGGCTG GGACGTGTTG
CGCGAGCAGA TCTTCGCCCG CCAGAAGGAA CTCGGCGTGA TCCCGCAGGA CGCCGAGCTG
ACCCGGCGCC ACGACGAGAT CCCCGGCTGG GACGACATGC CCGACCGGCT CAAGCCGGTG
CTCGCCCGGC AGATGGAGAT CTATGCCGGC TTCATGGAGC AGACCGACCA CGAGGTCGGG
CGGCTGATCG ACGCGGTCGA CGGCCTCGGC GCACTCGACA ACACGCTGAT CTACTACATC
ATCGGCGACA ACGGCGCGTC GGCGGAGGGC ACGCCGAACG GGTGCTTCAA CGAGATGTGC
ACGCTCAACG GGCTGGCCGG GATCGAGACC GAGGATTTCC TGCTGTCCAA GATCGACGAC
TTCGGCACGC CGGACGCGTA CAACCACTAT GCGGTGGGCT GGGCGCACGC CCTGTGCGCG
CCGTATCAGT GGACCAAGCA GGTCGCCTCC CACTGGGGCG GCACCCGCAA CGGCACGATC
GTGCACTGGC CCGCGGGGAT CGACGCCGTC GGCCAGACCC GCGACCAGTT CCACCACGTG
ATCGACGTCG TGCCAACGAT TCTCGAGGCG GCCGACATCC CCGCACCACT GATGGTCAAC
GGCATCGCGC AGGCGCCCAT CGAGGGCTTC AGCATGATGT CGACGCTGCG GGCCGCCGAT
GCCGCCGAGA CCCACCGGGT GCAGTACTTC GAGATGTTCG GCAACCGCGG CATCTACCAC
GACGGTTGGA CGGCGGTCAC CAAACACCGG ACGCCGTGGC TGTCCGACCA GCCGGCGCTC
GAGGACGACG TCTGGGAGCT GTACGGGCCG GGCGACTGGA CCCAGGCGCA CGACCTGGCG
GCCGATGATC CGGCGAAACT CGCCGAACTA CAGCGTCTCT GGCTCATCGA GGCGGTCAAG
TACGACGTGT TGCCGATCGA CGACAGGTCG TACGAGCGCT TCAACCCGGA CATCTCCGGG
CGGCCGGTGC TCATCACCGG CAGCACCCAG ACGATCTTCC CCGGTATGCG GCTGATGGAG
AGCTGCGTGC TCAACATCAA GAACAAGTCG CATTCGGTGA CGGCCGAGCT GTCGGTGCCC
GAGTCCGGTG CGCAGGGGGT GGTGGTCAGC CAGGGCGGCG GTGTCGGTGG TTGGTGTCTG
TACGCGCACG AGGGCCGGTT GAAATACTGT TACAACTTCC TCGGCATCGA GTACTACTAC
GTCACCGCCG AGGCTCCGCT CTCGGCCGGC GATCACCTCC TGCGTATGGA ATTCGCCTAC
GACGGTGGGG GTTTGGGCAA GGGCGGCACC GTCACGCTCT TCTGTGACGG TGAAGCCGTC
GGGACGGGGC GGGTCGACCA GAGCGAGCCG ATGGCGTTCT CCGCCGACGA GATGTGCGAC
GTCGGCTCGG ACAGCGGCTC GCCGACGTCA CCCGACTACG GGCCGCACGA CAACGCTTTC
ACCGGCCGAA TCGACTGGGT GAAGATCGAT ATCGGCGCCG CCGACCACGA CCACCTGATC
ACCGCGGAGG ACAAGCTGAA CATCGCGATG TCGCGGCAGT GA
 
Protein sequence
MSDSSPQNTG QQAAAAVRRD VLPIPDPRHV GLTTYDAKDP NTTYPPITRL RPPDGAPNVL 
IVLIDDVGFG ASSAFGGPCR TPVAERLAAN GVKLNRFHTT ALCSPTRQAL LTGRNHHSVG
MGGVTEIATS APGYSSIRPK DKAPISETLR LNGYSTSQFG KCHEVPVWEV SPVGPFDQWP
TGSGFEHFYG FVGGEANQYY PGLYEGTTPV EPAKTPEQGY TLTEDLADRA ITWVRQQQAL
TPDKPFFMYF APGATHAPHH VPREWSDRYR GEFDAGWDVL REQIFARQKE LGVIPQDAEL
TRRHDEIPGW DDMPDRLKPV LARQMEIYAG FMEQTDHEVG RLIDAVDGLG ALDNTLIYYI
IGDNGASAEG TPNGCFNEMC TLNGLAGIET EDFLLSKIDD FGTPDAYNHY AVGWAHALCA
PYQWTKQVAS HWGGTRNGTI VHWPAGIDAV GQTRDQFHHV IDVVPTILEA ADIPAPLMVN
GIAQAPIEGF SMMSTLRAAD AAETHRVQYF EMFGNRGIYH DGWTAVTKHR TPWLSDQPAL
EDDVWELYGP GDWTQAHDLA ADDPAKLAEL QRLWLIEAVK YDVLPIDDRS YERFNPDISG
RPVLITGSTQ TIFPGMRLME SCVLNIKNKS HSVTAELSVP ESGAQGVVVS QGGGVGGWCL
YAHEGRLKYC YNFLGIEYYY VTAEAPLSAG DHLLRMEFAY DGGGLGKGGT VTLFCDGEAV
GTGRVDQSEP MAFSADEMCD VGSDSGSPTS PDYGPHDNAF TGRIDWVKID IGAADHDHLI
TAEDKLNIAM SRQ