Gene Mvan_4494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4494 
Symbol 
ID4645533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4829384 
End bp4830874 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content69% 
IMG OID639807964 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_955275 
Protein GI120405446 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.18457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.102663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGAATC TGGACCAGAC CGGTCGGGAA CGCCTCGAGC CGCGGCCGGT GTCGCGTCCG 
CCGGTTGATC CGGCGGCCCA GCGCGCCTTC GGCCGGCCCA CCGGCGTGAA CGGCTCGTTC
CTGGGTGCCG ACCAGCATCG CGACCAAGGG GAGTACACCC CGAAGGATCA GGCGCCCGAT
CCGGTGCTCG CGGAGGCGTT CGGCCGTCCG TATCAGGGCG GCGACTCGCT GCAGCGCCAT
CCGACCGACG CCGGTGCCCT CGACGCCGAA CGCGACGGCG ATGCCGACGA GCTCGACGAC
CCGTGGCGCA ACCCGGGCGC CGCGGCCGCG CTGGGGACGC CCGCGCTCGC CCCGGCCGGC
CCCACCCAGG CCGCCGCGCC TTTGGGGAAG CTCGGGGTTC GGGACGTGTT ATTCGGCGGC
AAGGTGTCCT ACGTCGCCCT CGTGGTGCTC GGACTCGTCG CGCTCGTCAT CGGGGTGGCC
GGTGGCTGGG TGGGCCGCAC CACCGCCGAG GTGGTCTCGG CGTTCACCAC GTCGAAGGTC
ACATTGGAGA CCAGTGACAC CGGCGGCGCC GAATCTGAGG GGCAGTTCGC GAAAGTTGCT
GCCGCGGTTG CCGATTCGGT CGTGACGATC GAGGCGACCA GCAAGACGGA GGGTTCGCAG
GGTTCCGGTG TGGTGGTCGA TGGCCGCGGC TACATCGTCA CCAACAACCA CGTGATCTCC
GAGGCCGCCA CCAATCCCAG TGAGTTCAAG ATGTCGGTGG TGTTCAACGA CGGTACTGAG
GTGCCCGCGA ATCTCGTCGG CCGTGATCCC AAGACCGATC TCGCGGTGCT CAAGGTCGAC
AACGTCGACA ACCTTTCCGT CGCGCGGATG GGGGATTCCG AGAAGATCCG CGTGGGTGAG
GAGGTCATCG CCGCAGGCGC CCCGCTCGGG CTCAGAAGCA CCGTCACGCA CGGCATCGTC
AGCGCCCTGC ACCGGCCCGT GCCGTTGTCG GGTGACGGGT CGGACACCGA CACCGTCATC
GACGGCGTGC AAACCGACGC GTCGATCAAC CACGGCAACT CCGGCGGTCC GCTGATCAAC
ATGAACTCCG AGGTGATCGG CATCAACACG GCCGGCAAGT CGCTGTCGGA CAGTGCCAGC
GGCCTCGGCT TCGCGATCCC GGTCAACGAG GTCAAGCAGG TTGTCGAGAC GTTGATCAAG
AACGGCAAGA TCGCGCACCC GACGCTGGGC CTGACGGCGC GGTCGGTGAG CAACGACGTG
GCCAAGGGCG CACAGATCGC CGACATCTCG CCGAACAGCC CGGCCGAGCG GGCGGGGCTG
CTGGAGAACG ACGTCGTCAT CAAGGTCGGT GACCGTGAGG TTGCAGACGC CGACGAGTTC
ATCGTCGCGG TGCGGCAGCT CGCGATCGGT CAGCCGGCTC CCATCGAGGT CCTGCGTGAC
GGGCGCCCCG TGACCCTCAC GGTGACCCCC AACGGCGACG ACAGCACGTA G
 
Protein sequence
MTNLDQTGRE RLEPRPVSRP PVDPAAQRAF GRPTGVNGSF LGADQHRDQG EYTPKDQAPD 
PVLAEAFGRP YQGGDSLQRH PTDAGALDAE RDGDADELDD PWRNPGAAAA LGTPALAPAG
PTQAAAPLGK LGVRDVLFGG KVSYVALVVL GLVALVIGVA GGWVGRTTAE VVSAFTTSKV
TLETSDTGGA ESEGQFAKVA AAVADSVVTI EATSKTEGSQ GSGVVVDGRG YIVTNNHVIS
EAATNPSEFK MSVVFNDGTE VPANLVGRDP KTDLAVLKVD NVDNLSVARM GDSEKIRVGE
EVIAAGAPLG LRSTVTHGIV SALHRPVPLS GDGSDTDTVI DGVQTDASIN HGNSGGPLIN
MNSEVIGINT AGKSLSDSAS GLGFAIPVNE VKQVVETLIK NGKIAHPTLG LTARSVSNDV
AKGAQIADIS PNSPAERAGL LENDVVIKVG DREVADADEF IVAVRQLAIG QPAPIEVLRD
GRPVTLTVTP NGDDST