Gene Mkms_4385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4385 
Symbol 
ID4612328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4605057 
End bp4606379 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content70% 
IMG OID639794071 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_940366 
Protein GI119870414 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC ACCCGAGGTA TTCGACGCCG CCGCAACAAC AGCCGGGTCA CCGCCCGGTC 
GGCCCGGACA CGGGGTATCA GGGCGCGGAC CCCTATTCGC AGCAGCAGCC CTACGACTGG
CGGTACGCGG CGCAACCGCA GCAGCAGTTC CGCGCGCCGT ATGACCCCTA CCGGGGCGCC
GCCCAGCCGA CCGCTGTGAT GCCGCAGCCG CGCCCGACAC AAAAGCGTTC GCGCGCAGGC
GCATTGACGG TCGGCGCCTT GGCGGTGGCC GTGGTGTCGG CGGGTATCGG TGGCGGTGTG
GCGACGATGG TCCAGCAGGA CCGCCCGTCC TTCGGCAGCT CTATCACGGG TGCGGCGCCG
AGCGTGCCCG CCGCCGCGCT GCCCGCGGGC TCGGTGGAGC AGGTGGCCGC CAAGGTGGTG
CCGAGTGTGG TGAAGCTGGA GACGAACCTG GGCCGGGCGT CGGAGGAGGG TTCGGGCATC
ATCCTCACCT CCGACGGTCT GATCCTGACG AACAATCACG TCGTGGCCGC GGCCGCCGAC
GGTCCCGGGG CCCCCGGCGG CGCTCAGACC AAGGTGATCC TCTCCGACGG CCGCACCACG
TCGTTCACCG TCGTCGGCAC CGATCCCAGC AGCGACATCG CGGTGGTCCG AGCCGAGAAG
GTCTCGGGCC TGACGCCGAT CACGCTGGGT TCGTCGAGCG ATCTGCGCGT CGGTCAGGAC
GTGGTCGCGA TCGGTTCGCC GCTCGGGCTC GAGGGGACGG TCACCACCGG CATCATCAGC
GCGCTGAACC GGCCGGTCGC CGCCGGCGGC GATACGCGCA ACCAGAACAC GGTTCTCGAC
GCCATCCAGA CCGACGCCGC GATCAACCCC GGTAACTCGG GTGGTGCGCT GGTGAACATG
AACGGTGAGC TGGTCGGCGT GAACTCGGCC ATCGCCACCA TGGGCGGTGA CTCGGCGCAG
GCGCAGAGCG GTTCGATCGG TCTCGGCTTC GCGATCCCCG TGGATCAGGC CAAGCGCATC
GCCGACGAGT TGATCCAGAA CGGCAGCGCC TCACACGCGT CGCTCGGGGT GCAGGTCAGC
AACGACGCCG CGACCGACGG CGCGAAGATC GTCGAGGTCA ACCAGGGTGG CGCCGCGGCG
GCGGCGGGTC TGCCCAGCGG CGTGGTGGTG ACCAAGGTCG ACGACCGGGT GATCAACAGC
GCCGATGCGC TCGTGGCGGC GGTGCGGTCC AAGGCACCCG GCGACAAGGT CACGCTGACC
TATCTCGATC CGTCGGGCAA GCCGCAGAGC GTGCAGGTGA CTCTCGGGAA GATGCAGCAG
TGA
 
Protein sequence
MTNHPRYSTP PQQQPGHRPV GPDTGYQGAD PYSQQQPYDW RYAAQPQQQF RAPYDPYRGA 
AQPTAVMPQP RPTQKRSRAG ALTVGALAVA VVSAGIGGGV ATMVQQDRPS FGSSITGAAP
SVPAAALPAG SVEQVAAKVV PSVVKLETNL GRASEEGSGI ILTSDGLILT NNHVVAAAAD
GPGAPGGAQT KVILSDGRTT SFTVVGTDPS SDIAVVRAEK VSGLTPITLG SSSDLRVGQD
VVAIGSPLGL EGTVTTGIIS ALNRPVAAGG DTRNQNTVLD AIQTDAAINP GNSGGALVNM
NGELVGVNSA IATMGGDSAQ AQSGSIGLGF AIPVDQAKRI ADELIQNGSA SHASLGVQVS
NDAATDGAKI VEVNQGGAAA AAGLPSGVVV TKVDDRVINS ADALVAAVRS KAPGDKVTLT
YLDPSGKPQS VQVTLGKMQQ