Gene Mkms_3416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3416 
Symbol 
ID4611343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3582609 
End bp3584978 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content65% 
IMG OID639793089 
Productpolysaccharide deacetylase 
Protein accessionYP_939400 
Protein GI119869448 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCAGCC CCTTCAGAAG ACGCCCGGAC GCCATCCGCG GCACCGACCC TCTGACCGCC 
GCACCCGGTC CGGTGTTCTT CGACCGGACG GGAAAACGTC TGTACTACTT CGCCGCCGGT
GTCATCGTGC TCGCGGTGAT CGTGGCAGTG CTGCCGGTCC GGGTGACCCC CATGGCCTTC
GATCCGCTGT GGACCGTGCC GACCAACAGT GATTCGGGCT TCCCGCGTCG CGTGCTGTCC
ACCGCCGACG ACCGGCGCTT TCCCGTTCTC GGCAGCGAGG ATGACGAGGT CCTCAGCAGG
GTGGTCCGGG TCGACCGGCG GGGTGATCAG TGGAATCTCG TCGACCCGTT CACCAATCAG
CTGTACCGGG TCGCCGACGA TGACGACAAG GATCGGATTC GCGACAGTGC CTACGCGATA
GACCATTACG GCCGTCCGCC GGACCGCCAC CTGATGCTGA CATTCGACGA CGGACCCGAC
GTCGAGTTCA CCACCCAGGT GCTCGACATC CTGTCCCGCG AGAAGATTCC CGCGACCTTC
TTCGTCCTCG GCTCTCAGGT CGTCCTGCAT CCGGATGTCT TCCGGCGGAT GATCCGCGAG
GGCCATATGG CCGGCAACCA CACGATGTTC CATGTCGACT TCGACGAGCA CACCGATTTC
CGCAACCGGC AGGAGATCAT CGCCACCGAC CGCGTCATGC GCGCCACCGC GGCCTATGCC
AGCAGACTCT TCCGGATCCC GCGAGGCGAC CCCGACAACA ACACGCTCGC GCTGCTGCAG
TCACAGCAGC TCGGATACCT CCAGGTCGAC CAGGACATCG ACACCCTCGA CTGGAAGGTG
TCGCCGGAGA ACGAGGTGGC GGTCCCACAA CTCGACGGGC GTGGACACGT CGTCCTCCTG
CACGACGGTG GCGGCGACCG GTCAGCGACC GTCCGCATGC TCGAGCAACT GATCGCCGAG
GCCAAGAGCA AGGGCTACAC CTTCTCGACG CTCGCACCGC TGCTGTCTGA GCAGGAGCTG
CCCGTGAGCA ACGCCGAACA GTATCCGGCC GACGCCGCGA CGTACCACAC GCTGCGGCTG
ATGGAGAAGG CGCCGGGAGC AGTGCTCGGG TTCCTGTTCT GGCTGGGCAT GGGCTCGCTG
ACGGTGATGT CGTTGCTGTA CCTGATACTC GCGCTCGTCT GTCAATACCG GCAGAACCGG
TTGCGCTGGA ACGACATCGG CGATGACCAG CTGCCGATGG TGAGCGTGGT GTTGGCCGCC
TTCAACGAGG AGAAGGTGAT CGCGAGAACG ATTGCCGAAC TCCGCCGCAG CGACTACCCG
CGGTCGAGGT TCGAGGTGGT CGCGGTCAAC GACGGGTCCA CGGACGGCAC CCTGCGGATC
CTCACCGAAC TGGCGCGCGA CTGGCCCAAA TTGCGCGTGG TCGACCAGGC GAACAGCGGA
AAATCGTCGG CGATCAACAA CGGGATCAAT CACGCGTCGG CGGTGTCGAC CGTGATGGTC
ACGATGGACG CGGACACCCT CTTCAGACCG GACACGATCC GCAACCTCGC CCGGCACTTC
GCCCGCCACA CCCACGGCAG ACAGGTCGGC GCGGTGGCCG GGCACATCAA GGTCGGCAAT
CGGCGCAATC TCCTGACCGC CTGGCAGAGC CTCGAGTACA TCTCCGGAAT ATGCGTCACC
CGTATGGCGG AGCGCCTGCT CAACGCGATC TCGATCGTCC CCGGCGCGTG CTCGGCGTGG
AGCCGCACGG CCCTCGAGGA GATCGGCGGC TTCTGCGATG ACACCATGGC CGAGGACTGT
GACGCGACCC TGGCGCTGCA GCGACGCGGT TACCGGATCC TGCAGGAGAA CAACGCCATC
GCCGACACCG AGGCGCCGGA AACCATCCGT GCCCTTGCCA AACAACGCAA ACGGTGGACC
TACGGCAACA TCCAGGCGCT GTGGAAGCAC CGCGCCATGC TGTTCCGGCC GCGCTACGGG
GCACTGGGTC TGGTCGCGTT GCCCTACGCC GCGCTCTCGC TCATCGTGCC GCTGCTGTTC
ATGCCGCTGA CGATCGTCGC GGCCGGGATG AGCCTCGCGG CCGGCAACTG GCAGAGCATC
GCGCTGTTCG CGGGATTCGT TGCGGCACTG CACATGATCA TCTCGATCAC CGCTGTCGCG
ATGGCCCGGG AACGCGCCTG GCATCTGCTC GTCGTCCCCG TCTACCGGAT CATCTACGAG
CCGCTGCGGG CCTATCTGCT CTACGCCTCG GCCTATCGGG CCATCAAGGG CACCATCGTG
GCCTGGGACA AGTTGGAACG CAGGAACACG GTGAGCGCCT TCGTCGAACG TCACCGCCCG
ATGCCACCGA TCCTCGGGGC TCAGCAGTAG
 
Protein sequence
MFSPFRRRPD AIRGTDPLTA APGPVFFDRT GKRLYYFAAG VIVLAVIVAV LPVRVTPMAF 
DPLWTVPTNS DSGFPRRVLS TADDRRFPVL GSEDDEVLSR VVRVDRRGDQ WNLVDPFTNQ
LYRVADDDDK DRIRDSAYAI DHYGRPPDRH LMLTFDDGPD VEFTTQVLDI LSREKIPATF
FVLGSQVVLH PDVFRRMIRE GHMAGNHTMF HVDFDEHTDF RNRQEIIATD RVMRATAAYA
SRLFRIPRGD PDNNTLALLQ SQQLGYLQVD QDIDTLDWKV SPENEVAVPQ LDGRGHVVLL
HDGGGDRSAT VRMLEQLIAE AKSKGYTFST LAPLLSEQEL PVSNAEQYPA DAATYHTLRL
MEKAPGAVLG FLFWLGMGSL TVMSLLYLIL ALVCQYRQNR LRWNDIGDDQ LPMVSVVLAA
FNEEKVIART IAELRRSDYP RSRFEVVAVN DGSTDGTLRI LTELARDWPK LRVVDQANSG
KSSAINNGIN HASAVSTVMV TMDADTLFRP DTIRNLARHF ARHTHGRQVG AVAGHIKVGN
RRNLLTAWQS LEYISGICVT RMAERLLNAI SIVPGACSAW SRTALEEIGG FCDDTMAEDC
DATLALQRRG YRILQENNAI ADTEAPETIR ALAKQRKRWT YGNIQALWKH RAMLFRPRYG
ALGLVALPYA ALSLIVPLLF MPLTIVAAGM SLAAGNWQSI ALFAGFVAAL HMIISITAVA
MARERAWHLL VVPVYRIIYE PLRAYLLYAS AYRAIKGTIV AWDKLERRNT VSAFVERHRP
MPPILGAQQ