Gene Mmcs_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1023 
Symbol 
ID4109862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1124232 
End bp1126583 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content67% 
IMG OID638030146 
Productsulfatase 
Protein accessionYP_638193 
Protein GI108797996 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0349992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCACGG AATTCAACGG CAAGATCGAA CTGGACATCC GCGATTCCGA GCCCGACTGG 
GGTCCGTACG CCGCGCCGAC CGCACCCGAG GGCGCGCCCA ATGTGCTGTA CCTCGTGTGG
GATGACACCG GTATCGCGAC GTGGGACTGC TTCGGCGGCC TGGTCGAGAT GCCGGCGATG
AGCCGTATCG CCGAACGCGG TGTGCGCCTG TCGCAGTTCC ACACCACGGC GCTGTGCTCG
CCGACCCGTG CCTCCCTGCT CACCGGCCGC AACGCGACGA CCGTCGGGAT GGCCACCATC
GAGGAGTTCA CCGACGGCTT CCCGAACTGC AGCGGCCGCA TCCCGTTCGA CACCGCGCTG
ATCTCGGAGG TCCTCGCGGA GAACGGCTAC AACACCTACT GCGTCGGCAA GTGGCACCTC
ACTCCGCTCG AGGAGTCGAA TCTGGCTGCC ACCAAACGAC ACTGGCCGCT GTCTCGTGGG
TTCGAACGGT TCTACGGCTT CATGGGCGGC GAAACCGACC AGTGGTATCC CGAACTCGTC
TACGACAACC ACCCGGTCGC CCCGCCCGGC ACCCCCGAGG ACGGCTATCA CCTGTCGAAG
GACCTCGCGG ACAAGACGAT CGAGTTCATC CGCGACGCCA AGGTGATCGC GCCCGACAAA
CCGTGGTTCT CCTACGTCTG CCCGGGTGCC GGCCACGCCC CACACCACGT GTTCAAGGAA
TGGGCCGACC GCTACGCGGG CCGCTTCGAC ATGGGCTACG AGGCCTACCG CGAGATCGTG
CTGGAGAACC AGAAACGCCT CGGCATCGTC CCGTCGGACA CCGAACTCTC ACCGATGAAC
CCGTACGCCG ACGTCACCGG GCCCAACGGG GAGCCGTGGC CCGTCCAGGA CACGGTGCGG
CCGTGGGATT CCCTGTCCGA CAACGAGAAA CGGCTCTTCT GCCGGATGGC GGAGGTCTTC
GCCGGATTCC TGTCCTACAC CGATGCGCAG ATCGGCCGGA TCCTCGACTA CCTGGAGGAG
TCGGGTCAGC TCGACAACAC GATCATCGTG GTGATCTCCG ACAACGGGGC CAGCGGCGAA
GGCGGACCCG ACGGTTCGGT CAACGAGACG AAGTTCTTCA ACGGCTACAT CGACACCGCC
GAGGAGGGGC TCAAGGTCAT CGACGATCTC GGTGGCCCGC ACACCTACAA CCACTATCCG
ACCGGCTGGG CGATGGCGTT CAACACGCCC TACAAACTGT TCAAGCGCTA CGCCTCCCAT
GAGGGGGGCA TCGCCGACAC CGCGATCATC TCGTGGCCCG ACGGCATCGC CGCGCACGGT
GAGGTGCGGG ACAACTACGT CAACGTCTGC GACATCACCC CGACGGTGTA CGACCTGCTG
GGGCTCACCG CCCCGGCCTC CGTGCGCGGG GTGCCGCAGA AACCGCTCGA CGGTGTGAGT
TTCAAAGTGA CACTGGACAA TCCGACCGCG CCCACCGGCA AGGAGACGCA GTTCTACTCG
ATGCTCGGCA CCCGGGGGAT CTGGCACCAG GGCTGGTTCG CCAACACCGT GCACGCGGCG
TCGCCGGCCG GCTGGTCGCA TTTCGACGAC GACCGCTGGG AGTTGTTCCA CATCGAGGCC
GACCGCGCCC AGGTGCACGA TCTGGCCGCC GAACACCCGG AGAAACTCGA GGAACTCAAG
GCGCTGTGGT TCAGCGAGGC CGCCAAGTAC AACGGGCTGC CGCTCGGCGA TCTCAACATC
TTCGACACCA TCGGGCGGTG GCGGCCCAGC CTGTCGGGTG CGCGGGACGC CTACGTGTAC
TACCCGGGCA CCGCGGATGT CGGAACCGGC GCCGTCGTCG AGGTCCAGGG CCGCTCGTTC
GTGGTGCTGG CCGAGGTCAC CGTCGACGAC GACACCGCAC AGGGTGTGGT GTTCAAACAC
GGTGGCGCAC ACGGTGGTCA CGTGATGTAC GTGCAGGACG GCCGGCTGCA CTACGCGTAC
AACTTCCTCG GCGAGACCGA ACAGAAGATG GCGTCGTCGG TGCCGATCAC CCCCGGCCGG
CACACCTTCG GGATCGCCTA CACCCGGACC GGCACCGTCG AGGGCAGCCA CACCCCCCTC
GGTGACGCCG TGCTCTACGT CGACGACGAC GCGGTCGCCT CCTACCCGGG CATGATGAGC
CACCCCGGGA CGTTCGGACT GGCCGGCGCC ACGCTCTCGG TGGGCCGCAA CAGCGGATCC
CCGGTCTCGC GGGCCTACCG GCCGCCGTTC GAGTTCACCG GCGGCACCAT CGCCCAGGTC
TCGTTCGACG TCTCCGGCAA GCCCTACCTC GACCTGGAAC GCGAGTTCGC CCGGGCCTTC
GCGAAGGACT GA
 
Protein sequence
MRTEFNGKIE LDIRDSEPDW GPYAAPTAPE GAPNVLYLVW DDTGIATWDC FGGLVEMPAM 
SRIAERGVRL SQFHTTALCS PTRASLLTGR NATTVGMATI EEFTDGFPNC SGRIPFDTAL
ISEVLAENGY NTYCVGKWHL TPLEESNLAA TKRHWPLSRG FERFYGFMGG ETDQWYPELV
YDNHPVAPPG TPEDGYHLSK DLADKTIEFI RDAKVIAPDK PWFSYVCPGA GHAPHHVFKE
WADRYAGRFD MGYEAYREIV LENQKRLGIV PSDTELSPMN PYADVTGPNG EPWPVQDTVR
PWDSLSDNEK RLFCRMAEVF AGFLSYTDAQ IGRILDYLEE SGQLDNTIIV VISDNGASGE
GGPDGSVNET KFFNGYIDTA EEGLKVIDDL GGPHTYNHYP TGWAMAFNTP YKLFKRYASH
EGGIADTAII SWPDGIAAHG EVRDNYVNVC DITPTVYDLL GLTAPASVRG VPQKPLDGVS
FKVTLDNPTA PTGKETQFYS MLGTRGIWHQ GWFANTVHAA SPAGWSHFDD DRWELFHIEA
DRAQVHDLAA EHPEKLEELK ALWFSEAAKY NGLPLGDLNI FDTIGRWRPS LSGARDAYVY
YPGTADVGTG AVVEVQGRSF VVLAEVTVDD DTAQGVVFKH GGAHGGHVMY VQDGRLHYAY
NFLGETEQKM ASSVPITPGR HTFGIAYTRT GTVEGSHTPL GDAVLYVDDD AVASYPGMMS
HPGTFGLAGA TLSVGRNSGS PVSRAYRPPF EFTGGTIAQV SFDVSGKPYL DLEREFARAF
AKD