Gene Mjls_4006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4006 
Symbol 
ID4879715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4238049 
End bp4239542 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content70% 
IMG OID640141318 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_001072272 
Protein GI126436581 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.503851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCAATC AGGACCAGGC CAGCCGTCGC ATGGCACCGC GCCCCGTCGA ACGGCCTCCG 
GTCGACCCGG CTGCCCAACG CGCCTTCGGC AGGCCCAGCG GAGTCCGCGG GTCGTTCCTC
GGCGTCGACC AGCAGCGCGA CCAGGGGCAG TACACCCCGA AGGACAAGGC GCCCGACCCG
GTGCTGGCCG AGGCCTTCGG CCGTCCGCCC TACAGCGGCG GTGACTCCCT GCAGCGCCAC
CCCGCCGACG CGGGCGCGCT CGACGCCGAA CGGGCCGGTG ACCCCGGCGA CACCGAACCC
GATCCGTGGC GCGACCCGCA CGCCCCCGTC GCACTGGGCA CCCCCGCCGT CCACGCCCCC
GCACCGAGCC ACGCGCCCGC CCAGGTCGGC AAGCTCGGTG TGCGCGACGT CCTGTTCGGC
CGCAAGGTGT CCTACGCCGG TCTGGCGATC CTGCTGCTCA CCGCGCTGAT GGTCGGCGCG
CTCGGCGGCT GGGTCGGCAA CAAGACGGCC GAGACGGTGC AGGCGTTCAC CACCTCGAAG
GTCACGCTGG AGACCGGCGA CAGCGGTGAC CCGCCCGAGG GCCGCATCAC CAAGGTGGCC
GACGCGGTCG CCGACTCCGT GGTGACCATC GAGGCCAAGA GCGACCAGGA GGGCTCCCAG
GGTTCCGGTG TGGTGATCGA CGGTCGCGGC TACATCGTCA CCAACAACCA CGTGATCTCC
GAGGCCGCCA ACAACCCCGC CAAGTACAAG ATGACCGTCG TGTTCAACGA CGGTAAAGAG
GTCCCCGCCA ACCTGGTCGG CCGCGACCCG AAGACCGACC TCGCCGTGCT GAAGGTCGAC
AACGTCGACA ACCTCACCGT GGCCAAGATG GGTGACTCGG ACAAACTGCA GGTCGGTGAG
GAGGTGATCG CCGCGGGCGC CCCGCTGGGT CTGCGCAGCA CCGTCACCTC CGGCATCATC
AGCGCCCTGC ACCGGCCGGT TCCGCTGTCG GGCGACGGAT CCGACACCGA CACCGTGATC
GACGGGGTGC AGACCGACGC GTCGATCAAC CACGGCAACT CCGGCGGCCC GCTGATCGAC
ATGGACGCCA ACGTGATCGG CATCAACACC GCGGGTAAGT CGCTGTCCGA CAGCGCCAGC
GGTCTGGGCT TCGCGATCCC GGTCAACGAG GTCAAGACCG TCGTCGAGGC GTTGATCAGG
GACGGCAGGA TCGAGCATCC GACACTCGGC CTGACCGCGA AGTCCGTCAG CAACGACGTG
GCCTCCGGCG CCCAGGTCGC CAACGTCAAG GCGGGCAGCG CCGCCGAGCG GGCCGGCATC
CTGGAGAACG ACGTCGTGGT CAAGGTCGGC AACCGCGACG TCGCGGACGC CGACGAGTTC
GTGGTCGCGG TGCGTCAGCT CAAGATCAAT GAACCCGCCC CGATCGAGGT CGTCCGCGAC
GGCCGTCCGG TGACGCTCAC CGTGACGCCG ACGCCAGACG CCGCCACCGA CTGA
 
Protein sequence
MTNQDQASRR MAPRPVERPP VDPAAQRAFG RPSGVRGSFL GVDQQRDQGQ YTPKDKAPDP 
VLAEAFGRPP YSGGDSLQRH PADAGALDAE RAGDPGDTEP DPWRDPHAPV ALGTPAVHAP
APSHAPAQVG KLGVRDVLFG RKVSYAGLAI LLLTALMVGA LGGWVGNKTA ETVQAFTTSK
VTLETGDSGD PPEGRITKVA DAVADSVVTI EAKSDQEGSQ GSGVVIDGRG YIVTNNHVIS
EAANNPAKYK MTVVFNDGKE VPANLVGRDP KTDLAVLKVD NVDNLTVAKM GDSDKLQVGE
EVIAAGAPLG LRSTVTSGII SALHRPVPLS GDGSDTDTVI DGVQTDASIN HGNSGGPLID
MDANVIGINT AGKSLSDSAS GLGFAIPVNE VKTVVEALIR DGRIEHPTLG LTAKSVSNDV
ASGAQVANVK AGSAAERAGI LENDVVVKVG NRDVADADEF VVAVRQLKIN EPAPIEVVRD
GRPVTLTVTP TPDAATD