Gene Mvan_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4149 
Symbol 
ID4648908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4459684 
End bp4460709 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID639807616 
Productvirulence factor Mce family protein 
Protein accessionYP_954932 
Protein GI120405103 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.418161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCT CGGCTGCGAC GCTCGTCAAG TTCACCGCCT TCGGGCTCGT GATGGCCTTG 
CTGACCGCGT TCCTCTTCCT TGTGTTCAGC GACACCAGAA CAGGTGCGGC CAACGAATAC
ACCGCGGTCT TCAAGGACGC GTCCCGCCTG AAGACCGGAG ATACCGTGCG GATCGCCGGT
ATCCGGGTCG GCACCGTGAA GGACGTGGAA CTGCAGGCCG ACCGAAGCGT CCTGGTGACG
TTTGACGCCG ACCGCAACAC CGTGCTGACG ACCGGGACGA ACGCCGCGAT CCGCTACCTG
AACCTCGTCG GGGATCGATA TCTGGAGTTG GTCGACACCC CCGATTCCAC CCAGATCGTG
CCTGCAGGCG GACAGATCCC GGAAGACCGG ACCACTCCCG CACTCGATCT CGACGTGTTG
CTCGGCGGCC TCAAGCCTGT CATCCAGGGC CTGAATCCCG AAGACGTGAA CGGGCTGACG
TCAGCCTTGA TCCAGATCCT CCAGGGTCAG GGGGGAACGC TCGACTCGCT GTTCTCGAAG
ACATCCTCGT TCAGCAATTC GCTGGCCGAC AACAACCAGG TCATCGAGGA GTTGATCGTC
GATCTGCGCA CGGTGCTCGA CACGTTGTCC AAGGACGGCG AGGAATTCTC CGGAGCGATC
GACAAACTCG AGCAACTGGT CAGCGGACTG TCCTCCGACC GGGATCCCAT CGGCACCGCC
ATCACCGCAC TGGACAACGG CACCGCGTCG ATCGCGGACC TGCTCGGCCG GGGCCGGGCG
CCGTTGGCCA ACACCGTCGA CGAGATGAAC AGGCTTGCGC CGCTTGTCGA CAACGACCTC
GACCGCTTGG ACGCCACGCT TCAGCGTCTG CCGGAGATCT ACCGAAAGCT GGCGCGCGTG
GGTTCGTACG GTGCATGGTT CCCCTACTAC ATCTGCGGTA TCTCGTTCCG CGCCAGCGAT
CTTGAGGGAC GCACGGTGGT CTTCCCCTGG ATCAAACAAG AAGAGGGAAG GTGCGTGGAC
GAATAA
 
Protein sequence
MTRSAATLVK FTAFGLVMAL LTAFLFLVFS DTRTGAANEY TAVFKDASRL KTGDTVRIAG 
IRVGTVKDVE LQADRSVLVT FDADRNTVLT TGTNAAIRYL NLVGDRYLEL VDTPDSTQIV
PAGGQIPEDR TTPALDLDVL LGGLKPVIQG LNPEDVNGLT SALIQILQGQ GGTLDSLFSK
TSSFSNSLAD NNQVIEELIV DLRTVLDTLS KDGEEFSGAI DKLEQLVSGL SSDRDPIGTA
ITALDNGTAS IADLLGRGRA PLANTVDEMN RLAPLVDNDL DRLDATLQRL PEIYRKLARV
GSYGAWFPYY ICGISFRASD LEGRTVVFPW IKQEEGRCVD E