Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_5001 |
Symbol | |
ID | 4612680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 5241264 |
End bp | 5242289 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639794694 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_940980 |
Protein GI | 119871028 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.951966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCATGC TCCGTACTCG TTGGCTGTCC GTGCTCTCGG TCGTGTTGGC CGCGCTACTC GCACTGGTGG GCCCGTTCTC CCCGGCCACC TCGAGCGCCG CACCCGCTGA TCCGATCGCG GCGGCGGCCA CCGTCGAGCC CGCCGTGGCG CGCATCGACA CCGAGATCGA CTACCAGAGC GCGATCGGTA CCGGCACCGG CATCGTGCTC GACCCCGGCG GTCAGGTGCT GACGAACTTC CACGTCGTCC AGGGCGCCGA CCGGGTCAAC GTCGCCGTCG CGGGCCGGTC GTATCCGGCC GAACTCGTCG GCTACGACCG TGGCCGCGAC ATCGCGGTGA TCCAGTTGCT CGGGGCGGGC GGTCTGCCGG TGGCCCCGAT CGGCGACTCC GCGGCGCTGG CGGCGGGTGA GCCCGTCGTC GCCCTCGGCA ACGCGCAGGG CTCCGCTGCC CCGCTGACCC GGGAGGTCGG CAGCGTCACC GCCTTCGGGC GGACGGTCCA GGCCGAAGAC TCGCTGACCG GCAGCTCCGA CGAACTGACC GGGTTGATCG AGTTCGCCGC ACCCGTGCGG GCCGGCGATT CGGGTGGACC GGTCGTCAAC AGCGCGGGCC AGGTCGTCGG CATCACCACC GCGGCCTCGG TGAACTACCG GATGGGCCCC GGGGGTAAGG GCTTCGCGAT CCCGATCAAC GAGGCGGTCG GCGTCGCGAA CCAGATCCGG TCGCGGATTC CGTCGGACAC CGTCCACATC GGCCCGCCCG CGCTGCTCGG CGTCGGCGTG CGGACCGCAC CTAGCGACGT GCCCGGGGTC CTCATCCAGG AGGTGCTGCG CGGCGGACCC GCCGAAGCGG CCGGTTTGAT GGACCGCGAC GTGCTGATCG CGATCAACGG CAACCGCCTC ACCTCGGCCA CCCAACTGAC CTACACGCTG GACCGGTTCT ACCCCGGCGA CGTCGTGGAC GTCACGTGGA TCGACGGCTT CGGCCAGGAG CGCACCGCGA AGGCGACGCT GGCACCCGGC CCCTAG
|
Protein sequence | MGMLRTRWLS VLSVVLAALL ALVGPFSPAT SSAAPADPIA AAATVEPAVA RIDTEIDYQS AIGTGTGIVL DPGGQVLTNF HVVQGADRVN VAVAGRSYPA ELVGYDRGRD IAVIQLLGAG GLPVAPIGDS AALAAGEPVV ALGNAQGSAA PLTREVGSVT AFGRTVQAED SLTGSSDELT GLIEFAAPVR AGDSGGPVVN SAGQVVGITT AASVNYRMGP GGKGFAIPIN EAVGVANQIR SRIPSDTVHI GPPALLGVGV RTAPSDVPGV LIQEVLRGGP AEAAGLMDRD VLIAINGNRL TSATQLTYTL DRFYPGDVVD VTWIDGFGQE RTAKATLAPG P
|
| |