Gene Mkms_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1040 
Symbol 
ID4614561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1129966 
End bp1132317 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content67% 
IMG OID639790717 
Productsulfatase 
Protein accessionYP_937044 
Protein GI119867092 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.088448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.214837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCACGG AATTCAACGG CAAGATCGAA CTGGACATCC GCGATTCCGA GCCCGACTGG 
GGTCCGTACG CCGCGCCGAC CGCACCCGAG GGCGCGCCCA ATGTGCTGTA CCTCGTGTGG
GATGACACCG GTATCGCGAC GTGGGACTGC TTCGGCGGCC TGGTCGAGAT GCCGGCGATG
AGCCGTATCG CCGAACGCGG TGTGCGCCTG TCGCAGTTCC ACACCACGGC GCTGTGCTCG
CCGACCCGTG CCTCCCTGCT CACCGGCCGC AACGCGACGA CCGTCGGGAT GGCCACCATC
GAGGAGTTCA CCGACGGCTT CCCGAACTGC AGCGGCCGCA TCCCGTTCGA CACCGCGCTG
ATCTCGGAGG TCCTCGCGGA GAACGGCTAC AACACCTACT GCGTCGGCAA GTGGCACCTC
ACTCCGCTCG AGGAGTCGAA TCTGGCTGCC ACCAAACGAC ACTGGCCGCT GTCTCGTGGG
TTCGAACGGT TCTACGGCTT CATGGGCGGC GAAACCGACC AGTGGTATCC CGAACTCGTC
TACGACAACC ACCCGGTCGC CCCGCCCGGC ACCCCCGAGG ACGGCTATCA CCTGTCGAAG
GACCTCGCGG ACAAGACGAT CGAGTTCATC CGCGACGCCA AGGTGATCGC GCCCGACAAA
CCGTGGTTCT CCTACGTCTG CCCGGGTGCC GGCCACGCCC CACACCACGT GTTCAAGGAA
TGGGCCGACC GCTACGCGGG CCGCTTCGAC ATGGGCTACG AGGCCTACCG CGAGATCGTG
CTGGAGAACC AGAAACGCCT CGGCATCGTC CCGTCGGACA CCGAACTCTC ACCGATGAAC
CCGTACGCCG ACGTCACCGG GCCCAACGGG GAGCCGTGGC CCGTCCAGGA CACGGTGCGG
CCGTGGGATT CCCTGTCCGA CAACGAGAAA CGGCTCTTCT GCCGGATGGC GGAGGTCTTC
GCCGGATTCC TGTCCTACAC CGATGCGCAG ATCGGCCGGA TCCTCGACTA CCTGGAGGAG
TCGGGTCAGC TCGACAACAC GATCATCGTG GTGATCTCCG ACAACGGGGC CAGCGGCGAA
GGCGGACCCG ACGGTTCGGT CAACGAGACG AAGTTCTTCA ACGGCTACAT CGACACCGCC
GAGGAGGGGC TCAAGGTCAT CGACGATCTC GGTGGCCCGC ACACCTACAA CCACTATCCG
ACCGGCTGGG CGATGGCGTT CAACACGCCC TACAAACTGT TCAAGCGCTA CGCCTCCCAT
GAGGGGGGCA TCGCCGACAC CGCGATCATC TCGTGGCCCG ACGGCATCGC CGCGCACGGT
GAGGTGCGGG ACAACTACGT CAACGTCTGC GACATCACCC CGACGGTGTA CGACCTGCTG
GGGCTCACCG CCCCGGCCTC CGTGCGCGGG GTGCCGCAGA AACCGCTCGA CGGTGTGAGT
TTCAAAGTGA CACTGGACAA TCCGACCGCG CCCACCGGCA AGGAGACGCA GTTCTACTCG
ATGCTCGGCA CCCGGGGGAT CTGGCACCAG GGCTGGTTCG CCAACACCGT GCACGCGGCG
TCGCCGGCCG GCTGGTCGCA TTTCGACGAC GACCGCTGGG AGTTGTTCCA CATCGAGGCC
GACCGCGCCC AGGTGCACGA TCTGGCCGCC GAACACCCGG AGAAACTCGA GGAACTCAAG
GCGCTGTGGT TCAGCGAGGC CGCCAAGTAC AACGGGCTGC CGCTCGGCGA TCTCAACATC
TTCGACACCA TCGGGCGGTG GCGGCCCAGC CTGTCGGGTG CGCGGGACGC CTACGTGTAC
TACCCGGGCA CCGCGGATGT CGGAACCGGC GCCGTCGTCG AGGTCCAGGG CCGCTCGTTC
GTGGTGCTGG CCGAGGTCAC CGTCGACGAC GACACCGCAC AGGGTGTGGT GTTCAAACAC
GGTGGCGCAC ACGGTGGTCA CGTGATGTAC GTGCAGGACG GCCGGCTGCA CTACGCGTAC
AACTTCCTCG GCGAGACCGA ACAGAAGATG GCGTCGTCGG TGCCGATCAC CCCCGGCCGG
CACACCTTCG GGATCGCCTA CACCCGGACC GGCACCGTCG AGGGCAGCCA CACCCCCCTC
GGTGACGCCG TGCTCTACGT CGACGACGAC GCGGTCGCCT CCTACCCGGG CATGATGAGC
CACCCCGGGA CGTTCGGACT GGCCGGCGCC ACGCTCTCGG TGGGCCGCAA CAGCGGATCC
CCGGTCTCGC GGGCCTACCG GCCGCCGTTC GAGTTCACCG GCGGCACCAT CGCCCAGGTC
TCGTTCGACG TCTCCGGCAA GCCCTACCTC GACCTGGAAC GCGAGTTCGC CCGGGCCTTC
GCGAAGGACT GA
 
Protein sequence
MRTEFNGKIE LDIRDSEPDW GPYAAPTAPE GAPNVLYLVW DDTGIATWDC FGGLVEMPAM 
SRIAERGVRL SQFHTTALCS PTRASLLTGR NATTVGMATI EEFTDGFPNC SGRIPFDTAL
ISEVLAENGY NTYCVGKWHL TPLEESNLAA TKRHWPLSRG FERFYGFMGG ETDQWYPELV
YDNHPVAPPG TPEDGYHLSK DLADKTIEFI RDAKVIAPDK PWFSYVCPGA GHAPHHVFKE
WADRYAGRFD MGYEAYREIV LENQKRLGIV PSDTELSPMN PYADVTGPNG EPWPVQDTVR
PWDSLSDNEK RLFCRMAEVF AGFLSYTDAQ IGRILDYLEE SGQLDNTIIV VISDNGASGE
GGPDGSVNET KFFNGYIDTA EEGLKVIDDL GGPHTYNHYP TGWAMAFNTP YKLFKRYASH
EGGIADTAII SWPDGIAAHG EVRDNYVNVC DITPTVYDLL GLTAPASVRG VPQKPLDGVS
FKVTLDNPTA PTGKETQFYS MLGTRGIWHQ GWFANTVHAA SPAGWSHFDD DRWELFHIEA
DRAQVHDLAA EHPEKLEELK ALWFSEAAKY NGLPLGDLNI FDTIGRWRPS LSGARDAYVY
YPGTADVGTG AVVEVQGRSF VVLAEVTVDD DTAQGVVFKH GGAHGGHVMY VQDGRLHYAY
NFLGETEQKM ASSVPITPGR HTFGIAYTRT GTVEGSHTPL GDAVLYVDDD AVASYPGMMS
HPGTFGLAGA TLSVGRNSGS PVSRAYRPPF EFTGGTIAQV SFDVSGKPYL DLEREFARAF
AKD