Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_4066 |
Symbol | |
ID | 4612006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 4292228 |
End bp | 4293733 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639793750 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_940048 |
Protein GI | 119870096 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATACCAC CGGTGACCAA TCAGGACCAG GCCAGCCGTC GCATGGCACC TCGCCCCGTC GAGCGGCCTC CGGTCGACCC GACGGCCCAA CGCGTCTTCG GCCGGCCCAG GGGGGTCAGC GGTTCGTTCC TCGGCGTGGA CCAGCACCGC GGCCAGGGGG AGTACGCCCC GAAGGACCAG GCGCCCGACC CGGTGCTCGC CGAGGCGTTC GGCCGTCCGC CGTACGCCGG TGCCGACTCC CTGCAGCGCC ATCCCGCCGA CTCGGGTGCG CTCGACGCCG AACGGGCCGG TGACACCGGC GACGTCGAAC CCGATCCGTG GCGCGACCCG AACGCGCCCG TGGCGCTGGG CACCCCCGCC GTCGAGGCAC CCGCACCCGT CCACGGTCCG GCACAGACCG GCAAACTCGG TGTGCGCGAC GTGCTGTTCG GTCGCAAGGT GTCCTATGTC GGGCTGGCGA TCCTGCTGCT CACCGCGTTG ATGGTCGGCG CGCTCGGCGG CTGGGTCGGC AACAAGACCG CCGAGACCGT GCAGGCGTTC ACCACGTCGA AGGTCACGCT GGAGACCAGT GACAGCGGGG ACCCGCCCGA GGGACGCATC ACCAAGGTGG CCGACGCGGT CGCCGACTCC GTGGTGACCA TCGAGGCCAA GAGCGACCAG GAGGGCTCCC AGGGTTCCGG TGTGGTGATC GACGGTCGCG GCTACATCGT CACCAACAAC CACGTGATCT CCGAGGCCGC CAACAACCCC GCCAAGTACA AGATGACCGT CGTGTTCAAC GACGGTAAAG AGGTCCCCGC CAACCTGGTC GGCCGCGACC CGAAGACCGA CCTCGCCGTG CTGAAGGTCG ACAACGTCGA CAACCTCACC GTGGCCAAGA TGGGTGACTC GGACAAACTG CAGGTCGGTG AGGAGGTGAT CGCCGCGGGC GCCCCGCTGG GTCTGCGCAG CACCGTCACC TCCGGCATCA TCAGCGCCCT GCACCGGCCG GTTCCGCTGT CGGGCGACGG ATCCGACACC GACACCGTGA TCGACGGGGT GCAGACCGAC GCGTCGATCA ACCACGGCAA CTCCGGCGGC CCGCTGATCG ACATGGACGC CAACGTGATC GGCATCAACA CCGCGGGTAA GTCGCTGTCC GACAGCGCCA GCGGTCTGGG CTTCGCGATC CCGGTCAACG AGGTCAAGAC CGTCGTCGAG GCGTTGATCA GGGACGGCAG GATCGAGCAT CCGACACTCG GCCTGACCGC GAAGTCCGTC AGCAACGACG TGGCCTCCGG CGCCCAGGTC GCCAACGTCA AGGCGGGCAG CGCCGCCGAG CGGGCCGGCA TCCTGGAGAA CGACGTCGTG GTCAAGGTCG GCAACCGCGA CGTCGCGGAC GCCGACGAGT TCGTGGTCGC GGTGCGTCAG CTCAAGATCA ATGAACCCGC CCCGATCGAG GTCGTCCGCG ACGGCCGTCC GGTGACGCTC ACCGTGACAC CGACGCCAGA CGCCGCCACC GACTGA
|
Protein sequence | MIPPVTNQDQ ASRRMAPRPV ERPPVDPTAQ RVFGRPRGVS GSFLGVDQHR GQGEYAPKDQ APDPVLAEAF GRPPYAGADS LQRHPADSGA LDAERAGDTG DVEPDPWRDP NAPVALGTPA VEAPAPVHGP AQTGKLGVRD VLFGRKVSYV GLAILLLTAL MVGALGGWVG NKTAETVQAF TTSKVTLETS DSGDPPEGRI TKVADAVADS VVTIEAKSDQ EGSQGSGVVI DGRGYIVTNN HVISEAANNP AKYKMTVVFN DGKEVPANLV GRDPKTDLAV LKVDNVDNLT VAKMGDSDKL QVGEEVIAAG APLGLRSTVT SGIISALHRP VPLSGDGSDT DTVIDGVQTD ASINHGNSGG PLIDMDANVI GINTAGKSLS DSASGLGFAI PVNEVKTVVE ALIRDGRIEH PTLGLTAKSV SNDVASGAQV ANVKAGSAAE RAGILENDVV VKVGNRDVAD ADEFVVAVRQ LKINEPAPIE VVRDGRPVTL TVTPTPDAAT D
|
| |