Gene Mvan_1412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1412 
Symbol 
ID4646432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1501881 
End bp1503032 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content71% 
IMG OID639804912 
Productpeptidase M50 
Protein accessionYP_952252 
Protein GI120402423 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.742384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGCACG GAGTAGCCCT GGGCCGGATG CGCGGTCTGA CCGTGACTGT CCACTGGAGT 
GTGGTGGTCA TCGTCTGGCT CTTCGCCTGG AGTCTTGCCA GCACACTCCC GGGGTCCGCC
CCCGGCTACC CGGATGCGGT GTACTGGGTC GCCGGCTTCT GCGGCGCGGC GCTGTTCGCG
GCCGGCCTCC TCGCTCACGA ACTGGCGCAC GCCGTGGTGG CAAGGCGCTC AGGGATCCCC
GTACCGGAAA TCACCCTGTG GTTGTTCGGC GGGGTGGCCC GGCTGGCAGG GGAGGCCAAG
ACACCGCGGG ACGAATTCCG GATGGCGGCA GCCGGTCCCG CGTTGAGCCT GGCCCTTTCC
GCCGTGTTCG CCGCGGTCGC GGCGGGCCTG GCCGGCACCG GGATCTCCCC GCTGGCCACC
GAGGTGGCGG CGTGGCTGGC CGCGGCCAAC GCGGTGCTCG CCGTCTTCAA CCTGCTGCCC
GGTGCACCGT TGGACGGGGG ACGGATCCTG CGTGCCTACC TCTGGCACCG GCACGGAGAT
CCCGTGCGCG CGGCCATCGG CGCGGCCCGC GCCGGGCGCG TCGTCGCCTA CCTGCTGATC
GGGCTCGGAT TGGTGGAGTT TCTGCTCGGC TCGCTCATCG GAGGCGCATG GCTGGCCTTC
ATCGGCTGGT TCCTGCTCAC CGCAGCACGT GACGAGAACG CCGCCGTCCG GGCCCGCGCG
TCGCTGGCCG GGGTCCGGGT CGCCGCCGTC ATGACGCCCA ACCCCCGCAC CGTCCCGGAG
TCACTCTCGG TGCAGCGCTT CATCGACGAT CACCTGCTCG GCGACCGCCA CTCGGCCTAT
CCGGTGACCC ACCCCGACGG CACGTGCACC GGACTCGTCA CCCTCGCGCA GGTACGGGTC
GTACCACCTG TCGAGCGCGA CACCACGCGG CTCGCCGACA TCGCCATCCC ACGGAACCGA
ATGGCCACCG CCGATCCGGG CGAACCGCTC GTCGAGGTGG TGCAGCGGCT CGACCGCAGC
ACCGGAAATC GCATCGTCGT GATGGCCGCC GACCGGGCGA TTGGCGTGGT GACCGCCGCA
GATGTCGCCA GAATGATCGA TGTGCGCAAC CTCGCCGCAG TCGGTCAGCC CGCGGGACCG
GGCAACGGGT AG
 
Protein sequence
MEHGVALGRM RGLTVTVHWS VVVIVWLFAW SLASTLPGSA PGYPDAVYWV AGFCGAALFA 
AGLLAHELAH AVVARRSGIP VPEITLWLFG GVARLAGEAK TPRDEFRMAA AGPALSLALS
AVFAAVAAGL AGTGISPLAT EVAAWLAAAN AVLAVFNLLP GAPLDGGRIL RAYLWHRHGD
PVRAAIGAAR AGRVVAYLLI GLGLVEFLLG SLIGGAWLAF IGWFLLTAAR DENAAVRARA
SLAGVRVAAV MTPNPRTVPE SLSVQRFIDD HLLGDRHSAY PVTHPDGTCT GLVTLAQVRV
VPPVERDTTR LADIAIPRNR MATADPGEPL VEVVQRLDRS TGNRIVVMAA DRAIGVVTAA
DVARMIDVRN LAAVGQPAGP GNG