Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_4188 |
Symbol | |
ID | 4612128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 4414555 |
End bp | 4416099 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639793872 |
Product | sulfatase |
Protein accession | YP_940170 |
Protein GI | 119870218 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAACT CCACACCCAA CATCCTCGTG ATCTGGGGTG ACGACATCGG GATCAGCAAC CTCAGTTGCT ATAGCCGCGG CATGATGGGG TACCGCACAC CCAACATCGA CCGGATCGCC GACGAGGGCA TGCTCTTCAC CGACTCTTAC GGCGAGCAGA GCTGCACCGC GGGCCGGTCG TCGTTCATCA CCGGCCAGAG CGTCTACCGC ACCGGCATGA GCAAGGTCGG GATGCCCGGC GTCGACATCG GACTGCAGAA GGAGGACCCG ACCATCGCCG AGCTGCTCAA ACCGTTGGGT TACGCCACCG GGCAGTTCGG CAAGAACCAC CTCGGTGACC TCAACAAGTA CCTTCCGACC GCCCATGGGT TCGACGAGTT CTTCGGCAAT CTGTACCACC TCAACGCCGA GGAGGAACCC GAGAACGCCG ACTACCCGAC CGAGGAGGAG GCACCGGTGA TGCGTCGGGC ATTGTTGCCG CGCGGCGTCA TCCACTCCTG GGCCACCGAG GAGGATTCGG GCGAGGTCGA TGACCGGTAC GGCCCGGCGG GAAAGCAGCG CATCGAGGAC ACCGGACCGC TGACCAAGAA GCGGATGGAG ACCATCGACG ACGAAACCAC GGACGCCTGT GTTGATTTCA TCACCCGTGC GCACGGGACC GGCACCCCGT TCTTCGTGTG GATGAACATG ACGCACATGC ACTTCCGGAC GCACACCAAG CCGGAGAGCC TGGGACAAGC CGGGCGCTGG CAGTCGCCGT ACCACGACAC GATGATCGAC CACGACCGCA ACGTCGGTCA GCTACTCGAC CTGCTCGACG AGCTGGGTAT CGCCGACGAC ACCATCGTCA TCTACTCCAC CGACAACGGC CCGCATGCCA ACAGCTGGCC CGACGGTGCC ACCACACCGT TCCGCAGCGA GAAGGCCACC AACTGGGAGG GCGCTTTCCG GATCCCGGAA CTCATTCGCT GGCCCGGCAA GATCGAACCG CGCAGTGTGT CCAATGAGAT TGTGCAGCAT CACGATTGGC TTCCGACCTT CCTGGCCGCC GCCGGTGACC CCGACATCGT CGACAAGCTC AAAGCCGGGC ACACGATCGG GGACATCACG TACAAGGTGC ACATCGACGG GTACAACCTG GTGCCCTATC TGACCGGCGA GGTGGCCAAG AGCCCGCGCC GCGGAATGAT CTACTTCTCC GACGACTGCG ACGTACTCGG TATCCGCGCG GAGAACTGGA AGGTGGTCTT CCAGGAGCAG CGTTGCCAGG GAACCCTGCA GATCTGGTTC GAGCCGTTCA CCCCGCTGCG GGCGCCGAAA CTGTTCAACC TGCGCACCGA TCCGTACGAG CGCGCCGACA TCACGTCGAA CACCTACTGG GACTGGGTCA TCGACCGCAT CTACCTGGTG CTCTACGGAT CTGCAATCGC GACTCAGTTC CTCGAGACGT TCAAGGAGTT CCCGCCGCGC CAGGAACCGG CGTCCTTCAC CATCGACCAC GCGGTCGATG AGCTCAACAA GTTCCTGTCC ACCCGAGGCG GCTGA
|
Protein sequence | MPNSTPNILV IWGDDIGISN LSCYSRGMMG YRTPNIDRIA DEGMLFTDSY GEQSCTAGRS SFITGQSVYR TGMSKVGMPG VDIGLQKEDP TIAELLKPLG YATGQFGKNH LGDLNKYLPT AHGFDEFFGN LYHLNAEEEP ENADYPTEEE APVMRRALLP RGVIHSWATE EDSGEVDDRY GPAGKQRIED TGPLTKKRME TIDDETTDAC VDFITRAHGT GTPFFVWMNM THMHFRTHTK PESLGQAGRW QSPYHDTMID HDRNVGQLLD LLDELGIADD TIVIYSTDNG PHANSWPDGA TTPFRSEKAT NWEGAFRIPE LIRWPGKIEP RSVSNEIVQH HDWLPTFLAA AGDPDIVDKL KAGHTIGDIT YKVHIDGYNL VPYLTGEVAK SPRRGMIYFS DDCDVLGIRA ENWKVVFQEQ RCQGTLQIWF EPFTPLRAPK LFNLRTDPYE RADITSNTYW DWVIDRIYLV LYGSAIATQF LETFKEFPPR QEPASFTIDH AVDELNKFLS TRGG
|
| |