Gene Mvan_5372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5372 
Symbol 
ID4647088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5746392 
End bp5748737 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content69% 
IMG OID639808847 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_956149 
Protein GI120406320 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00418248 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCGAA AGAATGTGAT CCGCACCCTC ACGGTGATCG CCGTGGTGCT GCTGCTGGGT 
TGGTCTTTCT TCTACTTCAG CGACGACACC CGCGGGTACA AGCCTGTCGA CACGTCGGTC
GCGATGGCCC AGATCAGCGG CGACAACGTC AAGAGCGCGC AGATCGACGA CCGTGAGCAG
CAGCTGCGGC TTGAGCTCAA GAACGGCAAC GGCGACACCG AGAACAGCGA CAAGATCATC
ACCAAGTACC CCACCGGGTA CGGGGTGCCG CTGTTCGAGG CGTTGCAGGC CAAGAACACC
AAGATCAACA CCGTCGTCAA CCAGAGCAGC ATCCTCGGCA CCCTGCTGAT CTACATGCTG
CCGCTGCTGC TGCTCGTCGG GCTGTTCGTG ATGTTCTCCC GCATGCAGAC CGGCGGCCGG
ATGGGCTTCG GGTTCGGCAA ATCGAAGGCC AAGATGCTGG GCAAGGACAT GCCCAAGACC
ACGTTCGCCG ACGTCGCGGG CGTGGACGAG GCGGTCGAGG AACTCTACGA GATCAAGGAC
TTCCTGCAGA ACCCCAGCCG CTACCAGGCG CTGGGCGCCA AGATCCCCAA GGGCGTGCTG
CTTTACGGTC CGCCCGGCAC CGGCAAGACC CTGCTGGCCC GCGCGGTCGC CGGTGAGGCG
GGGGTGCCGT TCTTCACCAT CTCCGGTTCG GACTTCGTCG AGATGTTCGT CGGCGTCGGC
GCGTCCCGCG TGCGCGACCT GTTCGAGCAG GCCAAGCAGA ACAGCCCGTG CATCATCTTC
GTCGACGAGA TCGACGCCGT CGGCCGCCAG CGCGGCGCCG GCCTGGGCGG CGGCCACGAC
GAACGCGAGC AGACGCTGAA CCAGCTGCTC GTCGAGATGG ACGGCTTCGG CGACCGCCAG
GGCGTCATCC TGATCGCCGC CACCAACCGG CCCGACATCC TCGACCCCGC GCTGCTGCGG
CCGGGACGCT TCGACCGCCA GATCCCGGTG TCCAATCCCG ACCTGGCGGG CAGGCGCGCG
GTGCTGCGCG TGCATTCGCA GGGCAAGCCG ATCGCCGACG ACGCCGACCT CGACGGTCTG
GCCAAGCGCA CCGTCGGCAT GTCCGGCGCG GACCTGGCCA ACGTCATCAA CGAGGCGGCA
CTGCTGACCG CCCGCGAGAA CGGCACGATC ATCACCGGGC CCGCCCTGGA GGAGGCCGTC
GACCGCGTCG TCGGCGGACC CCGCCGCAAG AGTCGCATCA TCAGCGAGCA CGAGAAGAAG
ATCACGGCCT ACCACGAGGG CGGGCACACG CTCGCGGCGT GGGCGATGCC CGACATCGAC
CCGATCTACA AGGTGACGAT CCTGGCCCGC GGTCGCACCG GCGGTCACGC GATGTCCGTG
CCCGAGGACG ACAAGGGCCT GATGACCCGC TCGGAGATGA TCGCGCGGCT GGTGTTCGCG
ATGGGCGGCC GCGCCGCCGA GGAGCTGGTG TTCCGCGAGC CGACCACCGG CGCGGTGTCC
GACATCGAGC AGGCCACCAA GATCGCCAGG GCGATGGTCA CCGAGTACGG CATGAGCTCC
AAGCTGGGCG CGGTGCGCTA CGGCACCGAG CACGGTGATC CGTTCCTGGG CCGCACCATG
GGCACCCAGG CCGACTACAG CCATGAGGTC GCCCAGATCA TCGACGACGA GGTCCGCAAG
CTCATCGAGG CCGCCCACAC CGAGGCGTGG GAGATCCTCA CCGAGTACCG CGATGTCCTC
GACACTCTGG CCGGCGAATT GCTGGAGAAG GAGACCCTGC ACCGGGTCGA GCTCGAGGCG
ATCTTCGGCG ACGTGAAGAA GCGTCCCCGG CTGACCATGT TCGACGACTT CGGTGGCCGG
GTGCCGTCGG ACAAGCCGCC GATCAAGACG CCGGGGGAGT TGGCGATCGA GCGCGGGGAA
CCGTGGCCCA AGCCGCTGCC CGAGCCCGCG TTCAAGACCG CCATCGCCCA GGCCTCCAGG
GCGGCCGCCG AGCAGGCTCA GAAGAACGGC GGCAACGGCT CTCAGGCATC CAACGGTGTC
CCGGGCGGTC CCACCCAGCC CGACTACGGT GCCCCCGCCG GCTGGCACGC ACCCGGCTGG
CCGCCCCAGG CCGGCGGGCC GCACCCCCCG CAGCCGCAGG GCTACTGGTA TCCGCCTCCG
CCGCCGTCGG GGTGGCAGGG GGCGCCGCAG GCGCCGGCAT ACCCGGGATA TCCGCCCTAT
CAGCCTCATT CGCAGCCCGG TCAGCCCGCT CCCCACGGCG GTGCCGCGGC GGCGAGCCCC
AAAGAGGATC CCGGCCACGA AGGCGGCGGG GAGAACGGGC GGAGCGCTCC GCCCTCCAAC
GGCTGA
 
Protein sequence
MNRKNVIRTL TVIAVVLLLG WSFFYFSDDT RGYKPVDTSV AMAQISGDNV KSAQIDDREQ 
QLRLELKNGN GDTENSDKII TKYPTGYGVP LFEALQAKNT KINTVVNQSS ILGTLLIYML
PLLLLVGLFV MFSRMQTGGR MGFGFGKSKA KMLGKDMPKT TFADVAGVDE AVEELYEIKD
FLQNPSRYQA LGAKIPKGVL LYGPPGTGKT LLARAVAGEA GVPFFTISGS DFVEMFVGVG
ASRVRDLFEQ AKQNSPCIIF VDEIDAVGRQ RGAGLGGGHD EREQTLNQLL VEMDGFGDRQ
GVILIAATNR PDILDPALLR PGRFDRQIPV SNPDLAGRRA VLRVHSQGKP IADDADLDGL
AKRTVGMSGA DLANVINEAA LLTARENGTI ITGPALEEAV DRVVGGPRRK SRIISEHEKK
ITAYHEGGHT LAAWAMPDID PIYKVTILAR GRTGGHAMSV PEDDKGLMTR SEMIARLVFA
MGGRAAEELV FREPTTGAVS DIEQATKIAR AMVTEYGMSS KLGAVRYGTE HGDPFLGRTM
GTQADYSHEV AQIIDDEVRK LIEAAHTEAW EILTEYRDVL DTLAGELLEK ETLHRVELEA
IFGDVKKRPR LTMFDDFGGR VPSDKPPIKT PGELAIERGE PWPKPLPEPA FKTAIAQASR
AAAEQAQKNG GNGSQASNGV PGGPTQPDYG APAGWHAPGW PPQAGGPHPP QPQGYWYPPP
PPSGWQGAPQ APAYPGYPPY QPHSQPGQPA PHGGAAAASP KEDPGHEGGG ENGRSAPPSN
G