Gene Mkms_4066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4066 
Symbol 
ID4612006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4292228 
End bp4293733 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content69% 
IMG OID639793750 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_940048 
Protein GI119870096 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATACCAC CGGTGACCAA TCAGGACCAG GCCAGCCGTC GCATGGCACC TCGCCCCGTC 
GAGCGGCCTC CGGTCGACCC GACGGCCCAA CGCGTCTTCG GCCGGCCCAG GGGGGTCAGC
GGTTCGTTCC TCGGCGTGGA CCAGCACCGC GGCCAGGGGG AGTACGCCCC GAAGGACCAG
GCGCCCGACC CGGTGCTCGC CGAGGCGTTC GGCCGTCCGC CGTACGCCGG TGCCGACTCC
CTGCAGCGCC ATCCCGCCGA CTCGGGTGCG CTCGACGCCG AACGGGCCGG TGACACCGGC
GACGTCGAAC CCGATCCGTG GCGCGACCCG AACGCGCCCG TGGCGCTGGG CACCCCCGCC
GTCGAGGCAC CCGCACCCGT CCACGGTCCG GCACAGACCG GCAAACTCGG TGTGCGCGAC
GTGCTGTTCG GTCGCAAGGT GTCCTATGTC GGGCTGGCGA TCCTGCTGCT CACCGCGTTG
ATGGTCGGCG CGCTCGGCGG CTGGGTCGGC AACAAGACCG CCGAGACCGT GCAGGCGTTC
ACCACGTCGA AGGTCACGCT GGAGACCAGT GACAGCGGGG ACCCGCCCGA GGGACGCATC
ACCAAGGTGG CCGACGCGGT CGCCGACTCC GTGGTGACCA TCGAGGCCAA GAGCGACCAG
GAGGGCTCCC AGGGTTCCGG TGTGGTGATC GACGGTCGCG GCTACATCGT CACCAACAAC
CACGTGATCT CCGAGGCCGC CAACAACCCC GCCAAGTACA AGATGACCGT CGTGTTCAAC
GACGGTAAAG AGGTCCCCGC CAACCTGGTC GGCCGCGACC CGAAGACCGA CCTCGCCGTG
CTGAAGGTCG ACAACGTCGA CAACCTCACC GTGGCCAAGA TGGGTGACTC GGACAAACTG
CAGGTCGGTG AGGAGGTGAT CGCCGCGGGC GCCCCGCTGG GTCTGCGCAG CACCGTCACC
TCCGGCATCA TCAGCGCCCT GCACCGGCCG GTTCCGCTGT CGGGCGACGG ATCCGACACC
GACACCGTGA TCGACGGGGT GCAGACCGAC GCGTCGATCA ACCACGGCAA CTCCGGCGGC
CCGCTGATCG ACATGGACGC CAACGTGATC GGCATCAACA CCGCGGGTAA GTCGCTGTCC
GACAGCGCCA GCGGTCTGGG CTTCGCGATC CCGGTCAACG AGGTCAAGAC CGTCGTCGAG
GCGTTGATCA GGGACGGCAG GATCGAGCAT CCGACACTCG GCCTGACCGC GAAGTCCGTC
AGCAACGACG TGGCCTCCGG CGCCCAGGTC GCCAACGTCA AGGCGGGCAG CGCCGCCGAG
CGGGCCGGCA TCCTGGAGAA CGACGTCGTG GTCAAGGTCG GCAACCGCGA CGTCGCGGAC
GCCGACGAGT TCGTGGTCGC GGTGCGTCAG CTCAAGATCA ATGAACCCGC CCCGATCGAG
GTCGTCCGCG ACGGCCGTCC GGTGACGCTC ACCGTGACAC CGACGCCAGA CGCCGCCACC
GACTGA
 
Protein sequence
MIPPVTNQDQ ASRRMAPRPV ERPPVDPTAQ RVFGRPRGVS GSFLGVDQHR GQGEYAPKDQ 
APDPVLAEAF GRPPYAGADS LQRHPADSGA LDAERAGDTG DVEPDPWRDP NAPVALGTPA
VEAPAPVHGP AQTGKLGVRD VLFGRKVSYV GLAILLLTAL MVGALGGWVG NKTAETVQAF
TTSKVTLETS DSGDPPEGRI TKVADAVADS VVTIEAKSDQ EGSQGSGVVI DGRGYIVTNN
HVISEAANNP AKYKMTVVFN DGKEVPANLV GRDPKTDLAV LKVDNVDNLT VAKMGDSDKL
QVGEEVIAAG APLGLRSTVT SGIISALHRP VPLSGDGSDT DTVIDGVQTD ASINHGNSGG
PLIDMDANVI GINTAGKSLS DSASGLGFAI PVNEVKTVVE ALIRDGRIEH PTLGLTAKSV
SNDVASGAQV ANVKAGSAAE RAGILENDVV VKVGNRDVAD ADEFVVAVRQ LKINEPAPIE
VVRDGRPVTL TVTPTPDAAT D