Gene Mvan_0138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0138 
Symbol 
ID4647032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp151455 
End bp152600 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content66% 
IMG OID639803649 
Productvirulence factor Mce family protein 
Protein accessionYP_950995 
Protein GI120401166 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAGT GGACCAGATT GGCGCGCCGC ACCGTGGCGC TTGCCGCGGC GGCAGTGGTG 
CTGACGTCGT GCGGATCGTG GAAGGGCATC TCGAACGTGC CGCTGCCGGG TGGGCCCGGC
AGCGGTTCGG AGCGCACCAC GATCTACGTC CAGATGCCGG ACACGTTGGC GCTGAACGTG
AACAGCCGGG TCAGGGTCGC CGACGTGTAC GTCGGGCGGG TCCGCGCCAT CGAGCTGAAG
AACTGGATCG CGACGCTCAC GCTGGACCTG GAGCCGTCGG TGCAGTTGCC GGTCAACACG
CTGGCCAGGA TCGGCCAGAC CAGCCTGCTC GGTTCGCAGC ATGTCCAGCT GGACCTGCCG
CCGGATCCGT CACCGCAGAA GTTGAAGAGC GGCGATGTGA TCCCGTTGGC GAACGCGTCG
GCCTTCCCGA CCACCGAGCG CGTCCTCGCC AGCATCTCGT CCATCCTGAC CGGTGGCGGC
GTGGCCAACC TCGAGACGAT CCAGACCGAG ATCTACAACG TCCTCAACGG CCGCGCGGAT
CAGATCCGGG AGTTCCTCGG CAGGCTGGAC ACCTTCACCG AAGAATTGAA CCGGCAGCGC
GACGACATCA CCCGCGCCAT CGACTCGACG AACCGGCTGC TGGCGATCGT CGGTCAGCGC
AACCAGACCC TCGATGCGGT GCTGACCGAG TTCCCGCCGC TGATCCAGCA TTTCGCGGAT
ACCAGGGACC TGTTCGCCGA TGCCGTGGAG TCGCTGGGCC GCATCAGCAA CGCCGCCGTG
GATGCGCTCG CACCCGCCAG CGACGACATC AACACCAACC TGGCCAACCT CCAGCGGCCA
CTGCGCGAGC TGGGCAAGTC CGGGCCGTAT CTGCTCGGTG CGCTGAAGAT CTTCCTGACC
GCGCCGTACA ACATCGAGAA CGTGCCGAAG GCGATCCGCG GCGACTACAT CAACGTGTCG
CTGACGGTCG ACCTGACGCT GTCGGCGATC GACAACGGGT TCTTCTCCGG CACAGGCATC
TCGGGCATGC TGCGGGCGCT CGAGCAGGCG TGGGGTCGCG ACCCGGCCAC GATGATCCCG
GATGTGCGCT TCACGCCGAA CCCGAACTCG GTGCCGGGCG GACCTCTCAT CGAGAGGAGT
GAGTGA
 
Protein sequence
MGKWTRLARR TVALAAAAVV LTSCGSWKGI SNVPLPGGPG SGSERTTIYV QMPDTLALNV 
NSRVRVADVY VGRVRAIELK NWIATLTLDL EPSVQLPVNT LARIGQTSLL GSQHVQLDLP
PDPSPQKLKS GDVIPLANAS AFPTTERVLA SISSILTGGG VANLETIQTE IYNVLNGRAD
QIREFLGRLD TFTEELNRQR DDITRAIDST NRLLAIVGQR NQTLDAVLTE FPPLIQHFAD
TRDLFADAVE SLGRISNAAV DALAPASDDI NTNLANLQRP LRELGKSGPY LLGALKIFLT
APYNIENVPK AIRGDYINVS LTVDLTLSAI DNGFFSGTGI SGMLRALEQA WGRDPATMIP
DVRFTPNPNS VPGGPLIERS E