Gene Mvan_5925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5925 
Symbol 
ID4647536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6316311 
End bp6319304 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content65% 
IMG OID639809400 
ProductWD40 domain-containing protein 
Protein accessionYP_956694 
Protein GI120406865 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACC TCCCCGGAAC CGGCGCCCCG AATTCGACCC CACCAAGGCG GCCGGCAAGT 
AGCGCTTGTC CTACAACACA ATCCGCGCCT GGAATGAGCG ACCAGCGAAG CGAGCCGTCG
GGGCTGAACA TCGGGCACAT CGGTAAGCGT GCCGCCCAGG TGGGTGGGCT GGCAGTTGCC
GCCTATGTCA TTACTGCCGT AGCCGCAGGG CACGGGGTGG CGGCCGCCGA CTCGGGGGAC
GCGGGCCCAG CTGGTTCGGT CACTTCTTCG GCGCAGCGTG ACAGCACGGA TGCGAACGCC
ACCGACAACG ACGAAAGCAC CACTCGCTCG TCGGCGCACA ACCGATCCGA CGCGGATGGA
TCTGAGGCAG CCGAGGCCGA GAACTCCTCG GATGACCGCC TGGACGTGGA CTCGTCTGCG
CCCGGGGACT CGGACAGGAC GCCGACGACG GACGAGCACC TGGACGCCGT GCCGGAGCCG
GACGCGGCCG TCGACCCCAC GAGCGACGCC GAAACCTCCG CCGACGAGTC GACAGACACG
CCGGCCTCCC CCTCCGATCA GCACCCGGTG CACTCAGACA CCGGCGAGGA ACTGCCTTCC
GCAGACGTCG ACGAAGCGAC GACCGACGTC GCGGCCCCGG CGGGTGCCGC CGAAACGCCG
ACAGCCGCCG AGACTGCGCA GACGTCCGAG CAGCAGCCGG CCCCTACGTC GGCCTCGTCG
CCCCAAGGCA CAGAAACGTC GATCGAGACA CCCACCGCTG CGGCAACGAG CGTCGTCACC
CTCGCGCCGA ACCCGACGAC TCCGCTGCGG GCGCCGATGG AGTGGAACCT GCTCGGTTGG
ATGCGGCGGA CGTTCTTCAA CCGACCGCCG ACCATCGACT ATGACGCCGG CTTGAACACT
CAAGACATGA GCAACGGCGT CATCACCGGC AACATCGGCG CCGTCGATCC TGATGGCGAC
CGCTTGACCT ACAAGCTGAT TGCGGGTCCG CAGTACGGCA CCGTGGTCAT CGACCCACTC
ACAGGGAACT TCACCTACAC GCCAACCTCA TACTTCGCCC AGCGCGGCGG CGCCGTCGGG
TTCACTGTCA GGGTCAGTGA CCACCGGCCC CATCTGCTGT CGTTCCTGTT CTCTCCGGAT
CGCGGCAGCC CGACAGCCCG GATCGCTGTC AACGTCGAGC CGTTGTCTGA CCCCGCGTCG
ACGCCGCCCT TGGAGGCATT CGAGGTCGAG AAGGTGATCG AGTTCGACCT GCCGGACGGT
GTCAAGGTCA CCGGCGCGGA TCTGAGCCCG GACGGCGAGC ACCTCATCCT CGAGGTCGAA
GTCGCCGACG GCAGCACCCA GATCGCCGTG ACCAACCTCG ATGGCGCTGA CTACCAATGC
ATTTCCTGTG GTTTGGTCCC TAACGCCACC AAAGCCAAGG CGCTTTACGA CAACCAGAGA
ATCTGGTTCG CAAGCACCAA CGGGCAGCAG TCCGCCGATG ATCCGCTCGG GGGAGCCGGA
GCCATCACCT ACATGGTGCT GGAGTGCGAG GGATCGATCC ACGCCTGTCA GAACCCCACC
GTCAAGAAGG TGAAGTTCCC ATCAGATCGA GGCCTGTTGG GGCTGCTGCC CGTGCAGAAT
CGAGAGGCCA AGCCCGACCC GTACGGCGAG TACGTGACGT GGACCGAGAA CACGATTTTC
AACGGCCCGC GGATGAGCAT CGCCAAGCTG GTCGCCACCG CCGACGGATA CAAACTCGTC
GAGCAACGCA TCTTCAGTCC GCAGTGGTAC GAGGGCACCG ACTACGCCAC GGACTTCGGC
AACGCCACGC GATTCTACGA GGGGGCAAGT TGGCACGCGG GCGGGCGATA TCTGAAGTAC
CAGACGACCA CCACCGGCCT GAATTACGAC ATCTACCTGC TGGACACCGC CACCGGTGAG
CGGCGTCAAC TCACCACCGA CCTCGACTAC AACGAATCCG GTGACATCTC TCCCGACGGC
CGATCGGTCT ACTTCAGCTC GGCGCGCGGC CTCGACCGGA TGGACGTGTT CACCGCCCTG
GAGCGGCCGT CGTTGATCGA CAGCGCGGCG TTCCCTCAGA TCGGGCGCGT GAGCCTGTGG
AACAACCGGC GCTCCATGAA CGAGCCCTGG TTGATGAACC TCGACGCCGG CCAGCAGCAG
GGCGGCTACT CGGGTCAGCC CATCGTCATC GACCCAGACT GGACGATTCG CGGGTGGAGC
TGGTTCCCCG ATTCGACGCG TGCCCTGATC AACGAGCAGC AGCGTCCGGA CACCGTGACC
GGGGGAGGAG CACCCGACAC CCCCTGGCGC GTCAGCATCA TCAGCTTCCC CTCTCGGGAA
GCAACGACCC CGCTCCCGCC CGTGCACCAG GATCCCGACG CCATCGCCGA GTGGTCGGTA
CCGGTCAAGG AGTACAAGCC GATGATGGGC CGCCAGGACA CCCGGGTCCT CAAGGGCACG
CATTCCGGAA CCGCGACTCT GCGATACAGC GGGATCTTCG CCTTCGGCAC GTACTCGGTG
ACCTACAAGA ATTACTCGGA CGACGGCAAG ACCTTCATCA ACGGCACCGA GAAGGTCAAG
ATCAAAAATC CGATGGGTGA CTCCGTATGG AGCGCCGATC TGACCAGCAC CGGTGAACGC
ACCGGCTATC TGAAGGGCAA GATCAACATC GGCAAGGGCA ACGCGTTTTC GGGCGAGGTG
GCCTCCGAGA TCAATGGCAT CACCTACAGC GGCGTACCGA CCCAGGCGGA CTTCCCGACC
ATCGAGCAGC CTCAACTCGC GGTCTCGGCA TCCGGTGACA GACTGCGGGT GACCGCGACG
GTGGCTGAGG ACAGCCAGGC CCGACCGGTG CGCGGTGCGA CCGTGACCAT TGGGTCGGTC
ACGGCGACCA CCGACGAGCA GGGCTACGTC CAGCTACCGT TCGCCCCCGG CGACACCGTC
ACGGCGAACG CCGGCGGCTT CCGTTCGGCC AGTCACCAGG TGCCGTTCAT CTGA
 
Protein sequence
MADLPGTGAP NSTPPRRPAS SACPTTQSAP GMSDQRSEPS GLNIGHIGKR AAQVGGLAVA 
AYVITAVAAG HGVAAADSGD AGPAGSVTSS AQRDSTDANA TDNDESTTRS SAHNRSDADG
SEAAEAENSS DDRLDVDSSA PGDSDRTPTT DEHLDAVPEP DAAVDPTSDA ETSADESTDT
PASPSDQHPV HSDTGEELPS ADVDEATTDV AAPAGAAETP TAAETAQTSE QQPAPTSASS
PQGTETSIET PTAAATSVVT LAPNPTTPLR APMEWNLLGW MRRTFFNRPP TIDYDAGLNT
QDMSNGVITG NIGAVDPDGD RLTYKLIAGP QYGTVVIDPL TGNFTYTPTS YFAQRGGAVG
FTVRVSDHRP HLLSFLFSPD RGSPTARIAV NVEPLSDPAS TPPLEAFEVE KVIEFDLPDG
VKVTGADLSP DGEHLILEVE VADGSTQIAV TNLDGADYQC ISCGLVPNAT KAKALYDNQR
IWFASTNGQQ SADDPLGGAG AITYMVLECE GSIHACQNPT VKKVKFPSDR GLLGLLPVQN
REAKPDPYGE YVTWTENTIF NGPRMSIAKL VATADGYKLV EQRIFSPQWY EGTDYATDFG
NATRFYEGAS WHAGGRYLKY QTTTTGLNYD IYLLDTATGE RRQLTTDLDY NESGDISPDG
RSVYFSSARG LDRMDVFTAL ERPSLIDSAA FPQIGRVSLW NNRRSMNEPW LMNLDAGQQQ
GGYSGQPIVI DPDWTIRGWS WFPDSTRALI NEQQRPDTVT GGGAPDTPWR VSIISFPSRE
ATTPLPPVHQ DPDAIAEWSV PVKEYKPMMG RQDTRVLKGT HSGTATLRYS GIFAFGTYSV
TYKNYSDDGK TFINGTEKVK IKNPMGDSVW SADLTSTGER TGYLKGKINI GKGNAFSGEV
ASEINGITYS GVPTQADFPT IEQPQLAVSA SGDRLRVTAT VAEDSQARPV RGATVTIGSV
TATTDEQGYV QLPFAPGDTV TANAGGFRSA SHQVPFI