Gene Mkms_0399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0399 
Symbol 
ID4615282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp443183 
End bp444580 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content66% 
IMG OID639790074 
Productsulfatase 
Protein accessionYP_936406 
Protein GI119866454 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.216517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.603146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGGAC AACCACGGGT GACTCCGCAG GACCGCGCCA ACGTGCTGAT CGTCCACTGG 
CACGATCTCG GTCGCTACCT CGGCGCCTAC GGACACCCGG ACGTACAGAG CCCCCGCCTC
GACCGGTTCG CCGCCGAAAG CATCCTGTTC ACCCGCGCCC ACGCCACCGC ACCGCTGTGC
TCACCGTCGC GCGGGTCGCT GTTCACGGGC CGCTACCCGC AGAGCAACGG CCTGGTCGGA
CTGGCGCACC ACGGCTGGGA GTACCGCGCC GGCGTCCGCA CCCTACCGCA CATCTTGTCT
GAAAACGGTT GGCACACCGC ACTTTTCGGG ATGCAGCACG AGACGTCGTA TCCGCCGAAA
CTGGGGTTCG ACGAGTTCGA CGTGTCCAAC TCCTACTGCG AATACGTGGT CGAACGCGCC
ACCGGGTGGC TGCTCGACGC ACCGCAGCGC CCCTTCCTGC TCACCGCGGG ATTCTTCGAG
ACCCACCGGC CCTACCCGCG TGACCGCTAC GAACCCGCCG ACGCCACCAC CGTCGCGCTA
CCCGACTACC TTCCCGAGGA CCGGGAGGTG CGCCAGGATC TGGCCGAGTT CTACGGGTCG
ATCACCGTCG CCGACGCGGC AGTCGGCCAA CTGCTCGACA CGCTCGCGGC CACCGGACTG
GACCGCAGCA CCTGGGTGGT GTTCATGACC GACCACGGTC CGGCCCTGCC CCGGGCGAAG
TCCACGCTGT ACGACGCGGG CACCGGTATC GCGATGATCA TCCGGCCGCC GCTTGACGCC
GGCATCGCCC CCGGCGTCTA CGACGATCTG TTCAGCGGCG TCGACCTGCT ACCCACGCTG
CTCGACGTGC TCGGCGTCGA CATTCCCGGG GAGGTCGAGG GACTCTCGCA TGCCGACAAT
TTGCTGGGCG GCGCGGAGAA AACGCGGGAA GTGCGCACCG CGGTGTACAC CACGAAGACC
TATCACGATT CCTTCGACCC AATTCGGGCG ATCCGGACAA AAGAATTCAG CTATATCGAG
AATTACGCGC AACGGCCGCT GTTGGATCTG CCGTGGGACA TCGCCGAAAG CGCCCCCGGG
CGCATCGTCG GACCGCGGGC ACGCACGCCA CGGCCCGCCC GCGAACTCTA CGACCTCCGC
ACCGACCCCA CCGAGCAACA CAACCTGCTG ACGTCGGAGA ACAAGATCAA CGCCGAGGCC
GTCGCGACCG ATCTGGCGCT CCTGCTCGAC GACTGGCGGG TGAAGACCAA CGACGTCATA
CCGTCGGATT TCGCGGGTAC GCGGATATCC GACCGATACA CCGAGACATA TCTGCGAATT
CACCGGCGGG AAGTCACCAG TCGCTCGGCC ATCGCTGCGG AACGAGGCGT CAAGGGTGAG
CGCCGAACGG CGCAATGA
 
Protein sequence
MTGQPRVTPQ DRANVLIVHW HDLGRYLGAY GHPDVQSPRL DRFAAESILF TRAHATAPLC 
SPSRGSLFTG RYPQSNGLVG LAHHGWEYRA GVRTLPHILS ENGWHTALFG MQHETSYPPK
LGFDEFDVSN SYCEYVVERA TGWLLDAPQR PFLLTAGFFE THRPYPRDRY EPADATTVAL
PDYLPEDREV RQDLAEFYGS ITVADAAVGQ LLDTLAATGL DRSTWVVFMT DHGPALPRAK
STLYDAGTGI AMIIRPPLDA GIAPGVYDDL FSGVDLLPTL LDVLGVDIPG EVEGLSHADN
LLGGAEKTRE VRTAVYTTKT YHDSFDPIRA IRTKEFSYIE NYAQRPLLDL PWDIAESAPG
RIVGPRARTP RPARELYDLR TDPTEQHNLL TSENKINAEA VATDLALLLD DWRVKTNDVI
PSDFAGTRIS DRYTETYLRI HRREVTSRSA IAAERGVKGE RRTAQ