Gene Mvan_5202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5202 
Symbol 
ID4645719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5568176 
End bp5569378 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content67% 
IMG OID639808677 
Productvirulence factor Mce family protein 
Protein accessionYP_955979 
Protein GI120406150 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.977841 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.17737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACG GTAATGCGAA GCGCAGTCAT GTGAGGATCG CTGCGGCGAT CCTCGCTGCG 
CTGGTGCTCG CCGCAGCCGT GTTCACGTAT CTGTCCTACA CCGCGGCGTT CACCCCGACG
GACAAGGTGA CGGTGCTCTC GCCGCGGGCC GGCCTCGTCA TGGACGTCGA CGCGAAGGTC
AAGTACCGGG GGATCCAGGT CGGCAAGGTC GAATCGATCG AGTACGCCGG TGACGCCGCC
AAGCTCACGC TGGCGATCAA CCGCGGCGAC CTGCGGTACA TCCCGGCCGA CGCCCCGGTG
CGCATCGGTG GCACCACGAT CTTCGGCGCC AAGTCGGTCG AGTTCCTGCC ACCGGAGTAC
CCGAACGGCC AGGCGTTGAG CCCCGGCGCC GAGGTGAAGG CCGACTCCGT TCAGCTCGAG
GTCAACACGC TGTTCCAGAC CCTGACCGAC CTGCTGGACA AGATCGACCC GATCGAACTC
AACGCCACCC TGTCCGCGCT GGGCGAGGGT TTGCGCGGCA ACGGCGACGA CGTCGGCGCG
CTGCTGTCCG GACTGAACTA CTACGTCGGT CAGCTCAATC CGAAACTGCC CGCGCTGCAG
GAGGACCTGC GCCGCACCGC GGTCGTCGCC GACATCTACG GCGACGCCGG ACCGGACCTG
GTCCGCGTCC TCGACAACGC CCCCGCCATC AGCAAGACCA TCGTCGACGA ACAGGACAAC
CTGAACGCGG CTCTGTTGGC GGCGACCGGG CTGGCCAACA ACGGGACGGC GACTTTCGAG
CCCGCCGCCG ACGACTACAT CGCCGCGGTG CAACGCCTGC GCGCTCCGCT CAAGGTCGCC
GGGGAATACT CGCCGGTGAT CGGTTGCACG CTGAAAGGCA CGGCGAACGC GATCGACCGG
TTCGCGCCGA TCATCGGTGG GATCAGGCCC GGCCTGTTCG TCTCGTCGAA CTTCCTGCCC
GGTTCGCCCG CGTACACCTA CCCGGAGAGC CTGCCGATCG TCAACGCCTC GGGCGGCCCG
AATTGCCGTG GCCTGCCTGA TGTTCCGAGC AAGCAGTACG GCGGGTCCTG GTATCACACA
CCGTTCCTGG TCACCGACAA CGCCTATGTT CCGTACCAGC CGAACACCGA GTTGCAGTTC
GACGCGCCGT CGACGCTGCA GTTCCTGTTC AACGGCGCTT ACGCGGAAAG GGACGACTTC
TGA
 
Protein sequence
MADGNAKRSH VRIAAAILAA LVLAAAVFTY LSYTAAFTPT DKVTVLSPRA GLVMDVDAKV 
KYRGIQVGKV ESIEYAGDAA KLTLAINRGD LRYIPADAPV RIGGTTIFGA KSVEFLPPEY
PNGQALSPGA EVKADSVQLE VNTLFQTLTD LLDKIDPIEL NATLSALGEG LRGNGDDVGA
LLSGLNYYVG QLNPKLPALQ EDLRRTAVVA DIYGDAGPDL VRVLDNAPAI SKTIVDEQDN
LNAALLAATG LANNGTATFE PAADDYIAAV QRLRAPLKVA GEYSPVIGCT LKGTANAIDR
FAPIIGGIRP GLFVSSNFLP GSPAYTYPES LPIVNASGGP NCRGLPDVPS KQYGGSWYHT
PFLVTDNAYV PYQPNTELQF DAPSTLQFLF NGAYAERDDF