Gene Mjls_4679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4679 
Symbol 
ID4880378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4914227 
End bp4915549 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content70% 
IMG OID640141984 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_001072935 
Protein GI126437244 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0383364 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC ACCCGAGGTA TTCGCCGCCG CCGCAACAAC AGCCGGGTCA CCGCCCGGTC 
GGCCCGGACA CGGGGTATCA GGGCGCGGAC CCCTACTCGC AGCAGCAGCC CTACGACTGG
CGGTACGCGG CGCAACCGCA GCAGCAGTTC CGCGCGCCGT ATGACCCCTA CCGCGGCGCC
GCCCAGCCGA CCGCTGTGAT GCCGCAGCCG CGCCCGACGC AAAAGCGTTC GCGCGCAGGC
GCATTGACGG TCGGCGCCTT GGCGGTGGCC GTGGTGTCGG CGGGTATCGG TGGCGGTGTG
GCGACGATGG TCCAGCAGGA CCGCCCGTCC TTCGGCAGCT CTATCACGGG TGCGGCGCCG
AGCGTGCCCG CCGCCGCGCT GCCCGCGGGC TCGGTGGAGC AGGTGGCCGC CAAGGTGGTG
CCGAGTGTGG TGAAGCTGGA GACGAACCTG GGCCGGGCGT CGGAGGAGGG TTCGGGCATC
ATCCTCACCT CCGACGGTCT GATCCTGACG AACAACCACG TCGTGGCCGC GGCCGCCGAC
GGTCCCGGGG CCCCCGGCGG CGCTCAGACC AAGGTGATCC TCTCCGACGG CCGCACCACG
TCGTTCACCG TCGTCGGCAC CGATCCCAGC AGCGACATCG CGGTGGTCCG AGCCGAGAAG
GTCTCGGGCC TGACGCCGAT CACGCTGGGT TCGTCGAGCG ATCTGCGCGT CGGTCAGGAC
GTGGTCGCGA TCGGTTCGCC GCTCGGGCTC GAGGGGACGG TCACCACCGG CATCATCAGC
GCGCTGAACC GGCCGGTCGC CGCCGGCGGC GATACGCGCA ACCAGAACAC GGTCCTCGAC
GCCATCCAGA CCGACGCCGC GATCAACCCC GGTAACTCGG GTGGTGCGCT GGTGAACATG
AACGGTGAGC TGGTCGGCGT GAACTCGGCC ATCGCCACCA TGGGCGGTGA CTCGGCGCAG
GCGCAGAGCG GTTCGATCGG TCTCGGCTTC GCGATCCCCG TGGATCAGGC CAAGCGCATC
GCCGACGAGT TGATCCAGAA CGGCAGCGCC TCACACGCGT CGCTCGGGGT GCAGGTCAGC
AACGACGCCG CGACCGACGG TGCGAAGATC GTCGAGGTCA ACCAGGGTGG CGCCGCGGCG
GCGGCGGGTC TGCCCAGCGG CGTGGTGGTG ACCAAGGTCG ACGACCGGGT GATCAACAGC
GCCGATGCGC TCGTGGCGGC GGTGCGGTCC AAGGCACCCG GCGACAAGGT CACGCTGACC
TATCTCGATC CGTCGGGCAA GCCGCAGAGC GTGCAGGTGA CTCTCGGGAA GATGCAGCAG
TGA
 
Protein sequence
MTNHPRYSPP PQQQPGHRPV GPDTGYQGAD PYSQQQPYDW RYAAQPQQQF RAPYDPYRGA 
AQPTAVMPQP RPTQKRSRAG ALTVGALAVA VVSAGIGGGV ATMVQQDRPS FGSSITGAAP
SVPAAALPAG SVEQVAAKVV PSVVKLETNL GRASEEGSGI ILTSDGLILT NNHVVAAAAD
GPGAPGGAQT KVILSDGRTT SFTVVGTDPS SDIAVVRAEK VSGLTPITLG SSSDLRVGQD
VVAIGSPLGL EGTVTTGIIS ALNRPVAAGG DTRNQNTVLD AIQTDAAINP GNSGGALVNM
NGELVGVNSA IATMGGDSAQ AQSGSIGLGF AIPVDQAKRI ADELIQNGSA SHASLGVQVS
NDAATDGAKI VEVNQGGAAA AAGLPSGVVV TKVDDRVINS ADALVAAVRS KAPGDKVTLT
YLDPSGKPQS VQVTLGKMQQ