Gene Mkms_4188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4188 
Symbol 
ID4612128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4414555 
End bp4416099 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content63% 
IMG OID639793872 
Productsulfatase 
Protein accessionYP_940170 
Protein GI119870218 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACT CCACACCCAA CATCCTCGTG ATCTGGGGTG ACGACATCGG GATCAGCAAC 
CTCAGTTGCT ATAGCCGCGG CATGATGGGG TACCGCACAC CCAACATCGA CCGGATCGCC
GACGAGGGCA TGCTCTTCAC CGACTCTTAC GGCGAGCAGA GCTGCACCGC GGGCCGGTCG
TCGTTCATCA CCGGCCAGAG CGTCTACCGC ACCGGCATGA GCAAGGTCGG GATGCCCGGC
GTCGACATCG GACTGCAGAA GGAGGACCCG ACCATCGCCG AGCTGCTCAA ACCGTTGGGT
TACGCCACCG GGCAGTTCGG CAAGAACCAC CTCGGTGACC TCAACAAGTA CCTTCCGACC
GCCCATGGGT TCGACGAGTT CTTCGGCAAT CTGTACCACC TCAACGCCGA GGAGGAACCC
GAGAACGCCG ACTACCCGAC CGAGGAGGAG GCACCGGTGA TGCGTCGGGC ATTGTTGCCG
CGCGGCGTCA TCCACTCCTG GGCCACCGAG GAGGATTCGG GCGAGGTCGA TGACCGGTAC
GGCCCGGCGG GAAAGCAGCG CATCGAGGAC ACCGGACCGC TGACCAAGAA GCGGATGGAG
ACCATCGACG ACGAAACCAC GGACGCCTGT GTTGATTTCA TCACCCGTGC GCACGGGACC
GGCACCCCGT TCTTCGTGTG GATGAACATG ACGCACATGC ACTTCCGGAC GCACACCAAG
CCGGAGAGCC TGGGACAAGC CGGGCGCTGG CAGTCGCCGT ACCACGACAC GATGATCGAC
CACGACCGCA ACGTCGGTCA GCTACTCGAC CTGCTCGACG AGCTGGGTAT CGCCGACGAC
ACCATCGTCA TCTACTCCAC CGACAACGGC CCGCATGCCA ACAGCTGGCC CGACGGTGCC
ACCACACCGT TCCGCAGCGA GAAGGCCACC AACTGGGAGG GCGCTTTCCG GATCCCGGAA
CTCATTCGCT GGCCCGGCAA GATCGAACCG CGCAGTGTGT CCAATGAGAT TGTGCAGCAT
CACGATTGGC TTCCGACCTT CCTGGCCGCC GCCGGTGACC CCGACATCGT CGACAAGCTC
AAAGCCGGGC ACACGATCGG GGACATCACG TACAAGGTGC ACATCGACGG GTACAACCTG
GTGCCCTATC TGACCGGCGA GGTGGCCAAG AGCCCGCGCC GCGGAATGAT CTACTTCTCC
GACGACTGCG ACGTACTCGG TATCCGCGCG GAGAACTGGA AGGTGGTCTT CCAGGAGCAG
CGTTGCCAGG GAACCCTGCA GATCTGGTTC GAGCCGTTCA CCCCGCTGCG GGCGCCGAAA
CTGTTCAACC TGCGCACCGA TCCGTACGAG CGCGCCGACA TCACGTCGAA CACCTACTGG
GACTGGGTCA TCGACCGCAT CTACCTGGTG CTCTACGGAT CTGCAATCGC GACTCAGTTC
CTCGAGACGT TCAAGGAGTT CCCGCCGCGC CAGGAACCGG CGTCCTTCAC CATCGACCAC
GCGGTCGATG AGCTCAACAA GTTCCTGTCC ACCCGAGGCG GCTGA
 
Protein sequence
MPNSTPNILV IWGDDIGISN LSCYSRGMMG YRTPNIDRIA DEGMLFTDSY GEQSCTAGRS 
SFITGQSVYR TGMSKVGMPG VDIGLQKEDP TIAELLKPLG YATGQFGKNH LGDLNKYLPT
AHGFDEFFGN LYHLNAEEEP ENADYPTEEE APVMRRALLP RGVIHSWATE EDSGEVDDRY
GPAGKQRIED TGPLTKKRME TIDDETTDAC VDFITRAHGT GTPFFVWMNM THMHFRTHTK
PESLGQAGRW QSPYHDTMID HDRNVGQLLD LLDELGIADD TIVIYSTDNG PHANSWPDGA
TTPFRSEKAT NWEGAFRIPE LIRWPGKIEP RSVSNEIVQH HDWLPTFLAA AGDPDIVDKL
KAGHTIGDIT YKVHIDGYNL VPYLTGEVAK SPRRGMIYFS DDCDVLGIRA ENWKVVFQEQ
RCQGTLQIWF EPFTPLRAPK LFNLRTDPYE RADITSNTYW DWVIDRIYLV LYGSAIATQF
LETFKEFPPR QEPASFTIDH AVDELNKFLS TRGG