Gene Mkms_0910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0910 
Symbol 
ID4614751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp993636 
End bp995432 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content72% 
IMG OID639790584 
Productsulfatase 
Protein accessionYP_936914 
Protein GI119866962 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAACC CCGACATCGT CATCCTGATG ACCGACGAGG AACGCGCGGT CCCGCCGTAC 
GAGACACCCG AGGTGCTGGC GTGGCGCGAC CGCACCCTGC CGTGCCGCAA GTGGTTCGAC
GACCACGGCG TCAGCTTCGG CAGGCACTAC ACCGGATCGC TGGCGTGCGT GCCGAGCCGG
CCGACGATCT TCACCGGCCA GTACCCGGAT CTGCACGGCG TCACCCAGAC CGACGGCATC
GGCAAGACCT ACGGCGACTC GCGCATGCGG TGGCTGCGCC CGGGTGAGGT GCCGACGCTG
GGCAACTGGT TCCGCGCCGC CGGCTACGAC ACCCATTACG ACGGTAAGTG GCACATCTCC
CACGCCGACG TCACCGACCC GGCCACCGGG CTGCCGCTGG ACACCAACGA CGACGACGGT
GTGGTCGACG CGGATGCGGT GCGGCGCTAC CTCGACGCCG ACCCGTTGGC GCCGTACGGC
TTCTCCGGCT GGGTCGGTCC CGAACCGCAC GGAGCGGCCT TGTCCAACAG CGGGTTTCGC
CGCGACCCGC TGATCGCCGC CCGGGTGGTG GCGTGGCTGG AGGACCGCTA CGCGCGCCGC
CGCGCCGGCG ACCCGCAGGC GTTGCGGCCG TTCCTGCTGG TGGCCAGCTT CGTCAACCCG
CACGACATCG TGCTGTTCCC GCAGTGGGTG CGGCGCAGCC CGGTCAAGCC GTCCCCGCTC
GACCCGCCGC ACGTCCCGGC CGCACCGACC GCCGACGAGG ACCTGTCGAC GAAACCGGCC
GCGCAGATCG CGTTCCGCGA GGCCTACTAC TCCGGATACG GCCCCGCGGC GGTGATGGAG
CGGACCTACC GGCGCAACGC CCAGCAGTAC CGGGATCTGT ACTACCGCCT GCACGCCCAG
GTCGACGGTC CGCTCGAGCG GGTGCGCCGC GCGGTCGTCG AGGGTTCGCA GGATGCGGTG
CTGGTCCGCA CGGCCGACCA CGGCGACCTG CTCGGCGCGC ACGGCGGTCT GCACCAGAAG
TGGTTCAACC TCTACGACGA GGCCACCCGC GTCCCGTTCG TCATCGCCCG CACCGGCGCC
AACGCGACCG CAGCCCGCAC GGTCACCGCC CCCACCTCAC ACGTCGACCT CGTTCCGACC
CTGCTGAGCG CCGCGGGTGT CGACGTCGCC GCCGCCGCGG CCACGCTCGC CGAGTCCTTC
ACCGAGGTGC ACCCCCTGCC GGGGCGTGAC CTCATGCCGG TGGTCGACGG GGCGGCCCCC
GACGAGGATC GCGCGGTGTA CCTGATGACC CGCGACAACA TGCTCGAAGG TGACAGCGGC
GCATCGGGTC TGGCGCGCAA GCTCAAGCGC ACCGTCAATC CGCCGGGGCC GCTGCGGATC
CGGGTGCCCG CACACGTCGC GTCCAACTTC GAGGGACTCG TGACGCAGGT CGACGGCCAC
CTCTGGAAGC TGGTGCGCAG CTTCGACGAT CCGGCCACCT GGACCGAACC GGGCGTGCGG
CACCTGGCCG CCAACGGTGT CGGCGGGGAG GCCTACCGTT CCAGCCCGCT CGACGACCAG
TGGGAGCTCT ACGACCTCAC CGCCGACCCG ACCGAGGCCG TCAACAGGTG GCCCGACCCC
TCACTCGACG AGCTGCGCGC ACACCTGCGC CGACAACTCA AACACGTCAG GACCGAATCG
ATTCCGGAGC GCAACCAACC GTGGCCGTAC GCCGTCCGCC GCCCACCGAC CGGAGGGGCC
CGGGTGGGCC TCGTCCGACG GGCGCTCGGA CGCCTGGGGG TCGGCGCCGC GGTTTGA
 
Protein sequence
MSNPDIVILM TDEERAVPPY ETPEVLAWRD RTLPCRKWFD DHGVSFGRHY TGSLACVPSR 
PTIFTGQYPD LHGVTQTDGI GKTYGDSRMR WLRPGEVPTL GNWFRAAGYD THYDGKWHIS
HADVTDPATG LPLDTNDDDG VVDADAVRRY LDADPLAPYG FSGWVGPEPH GAALSNSGFR
RDPLIAARVV AWLEDRYARR RAGDPQALRP FLLVASFVNP HDIVLFPQWV RRSPVKPSPL
DPPHVPAAPT ADEDLSTKPA AQIAFREAYY SGYGPAAVME RTYRRNAQQY RDLYYRLHAQ
VDGPLERVRR AVVEGSQDAV LVRTADHGDL LGAHGGLHQK WFNLYDEATR VPFVIARTGA
NATAARTVTA PTSHVDLVPT LLSAAGVDVA AAAATLAESF TEVHPLPGRD LMPVVDGAAP
DEDRAVYLMT RDNMLEGDSG ASGLARKLKR TVNPPGPLRI RVPAHVASNF EGLVTQVDGH
LWKLVRSFDD PATWTEPGVR HLAANGVGGE AYRSSPLDDQ WELYDLTADP TEAVNRWPDP
SLDELRAHLR RQLKHVRTES IPERNQPWPY AVRRPPTGGA RVGLVRRALG RLGVGAAV