Gene Mvan_4842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4842 
Symbol 
ID4646408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5178231 
End bp5179601 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content72% 
IMG OID639808313 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_955621 
Protein GI120405792 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.369386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC AGCCCAGTTA CTCGCCGACT CCGCAACCGG GTCGCCAGTC CGGATATCCC 
GGGCAGGTGC CCCCCGGTCA TGCTGGACCG CGGACGGGGT CCTACGAGCA GCAGGGTAGC
GGCGGTTGGG ATTGGCGGTA CGCCACCGAG CAGCAGCGCC AGGCGTTCCG CGCGCCCTAC
GATCCGTATG CCGGCGCTCC CATGCCGTCC GGTCCCGGTC ACCACCCGCG CCCCGGTATG
CCTGTGACCC CTCCGCGGCA GCCTCGGAAG GGCTCCCGTG CAACGGCTTT GGCTGCCGGC
GCGGTCGCGG TCGCACTTGT CTCCGGGGGT ATCGGTGGCG GCGTCGCGAT GCTGGTGCAC
CCCGACCACG GCGTCCCGGG CATCAGCGCG TCAGGCGCGG CGCCCGGCAT GCCCGCCGCG
AGCGTGCCGG CGGGTTCGGT GGAGGCGGTG GCCGCCGCGG TGGTGCCCAG CGTGGTCAAG
CTCGAGGTCA GCCAGGGCCG GGCATCGGAG GAGGGCTCCG GGGTGATCCT GTCGACCGAC
GGGCTGATCC TGACCAACAA CCACGTCGTC GCCACCGCCG CAGGTGCCGC GGGCGAACCC
GGAGGCCCGG CGAAGACCAA GGTCACCTTC GCCGACGGCA AGACCGCACC GTTCACGGTG
ATCGGCGCCG ACCCCAGCAG TGACATCGCG GTGGTCCGCG CCCAGAACGT GTCCGGCCTT
ACCCCCATCA CCGTCGGGTC CTCGGCAGAT CTGCGGGTCG GCCAGGACGT GGTCGCCATC
GGCTCACCGC TCGGGCTGGA GGGCACCGTC ACCACCGGCA TCATCAGCGC GCTGAACCGG
CCGGTGGCCG CCGGCGGCGA CGCGCGCAAC CAGAACACCG TGCTCGACGC CATCCAGACC
GATGCCGCGA TCAACCCCGG TAACTCCGGC GGGGCTCTGG TCAACATGAA CGGCGAACTG
GTCGGCATCA ACTCGGCGAT CGCCACGATG GGTGCCGACG CGGGCGCACA GCAGGGCGGT
TCGATCGGTC TGGGCTTCGC GATCCCGGTC GACCAGGCCA AGCGGATCGC CGACGAGATC
ATCCAGACCG GCTCCGCGTC GCGCGCCTCG CTGGGCGTCC AGGTCGGCAA CGAGGCCGGC
GTCGACGGCG CCAAGATCGT CGAGGTCACC GCAGGCGGGG CGGCGTCGGC CGCGGGACTG
CCCAGCGGCG TGATCGTCAC CAAGGTCGAC GACCGGCTGA TCAACAGTGC GGACGCGTTG
GTGGCCGCGG TGCGGTCCAA GGCGCCCGGT GACAAGGTGA CGCTGACCTA CCTCGATCCG
GCGGGCAAGT CGCAGTCGCT GGACGTCACG CTCGGCAAGG CCGGCCAGTG A
 
Protein sequence
MTNQPSYSPT PQPGRQSGYP GQVPPGHAGP RTGSYEQQGS GGWDWRYATE QQRQAFRAPY 
DPYAGAPMPS GPGHHPRPGM PVTPPRQPRK GSRATALAAG AVAVALVSGG IGGGVAMLVH
PDHGVPGISA SGAAPGMPAA SVPAGSVEAV AAAVVPSVVK LEVSQGRASE EGSGVILSTD
GLILTNNHVV ATAAGAAGEP GGPAKTKVTF ADGKTAPFTV IGADPSSDIA VVRAQNVSGL
TPITVGSSAD LRVGQDVVAI GSPLGLEGTV TTGIISALNR PVAAGGDARN QNTVLDAIQT
DAAINPGNSG GALVNMNGEL VGINSAIATM GADAGAQQGG SIGLGFAIPV DQAKRIADEI
IQTGSASRAS LGVQVGNEAG VDGAKIVEVT AGGAASAAGL PSGVIVTKVD DRLINSADAL
VAAVRSKAPG DKVTLTYLDP AGKSQSLDVT LGKAGQ