Gene Mmcs_3666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3666 
Symbol 
ID4112498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3922740 
End bp3924434 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content67% 
IMG OID638032806 
Productvirulence factor MCE-like protein 
Protein accessionYP_640829 
Protein GI108800632 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACCC GCTTTGTACG AACCCAGCTG ATCATCTTCA CCATCGCCTC GATCGTCGGC 
GTGTCGGTGA TGATCTTCGC GTACATGAAG GTTCCCACCC TGCTCGGCAT CGGCCGCTTG
ACGGTGACGC TGGAATTGCC CGCGGCCGGT GGGCTCTACC GCTTCAGCAA TGTGACCTAC
CGGGGTGTCC AGATCGGCAA GGTCATGGGA GTGAACCTGA CCGAGAACGG CGCAGAGGCC
ACGATGTCAC TCGACACCTC GCCGAAGGTC CCCGCAGATC TGCTTGCCGA GGTGCGCAGC
GTGTCCGCGG TCGGTGAGCA GTACGTGGAG CTGTTGCCCC AAACCGACTC GGGGCCTTAC
CTGGAGGACG GCTCGCGAAT CTCCAGGGAC AGAGCCACCA TTCCGCAAGA GGTGGGTCCC
ATGTTGGACC AGCTGAGCGC GCTGGTGGAC AGTATCCCGA GCGACAGGAT CCCCGATCTG
CTCGACGAGA CGTTCAAGGC CCTCAACGGC GCCGGCCCTG ACTTCCAGTC GCTCCTGGAC
TCGGCCTCGA AGGTGGCGGG TGACGCCAAC GCTGTCTCCG ACCAGACGCG ACAACTCATC
GATGACGTTG GGCCGCTGTT GGATTCGCAG GCGGAATCCA CCGAGGCGAT CCGGACGTGG
GCGCGCAGCC TGGCCGGGGT CACCGAACAA CTCGTCCAGA ACGACCCCGA ACTGCGCACG
GTCCTGCAGC AGGGACCCGG CTTCGCGCAG GAGGTTTCGC AGCTCCTCAC CCAGATCAAG
CCGACGCTGC CGATATTGCT GGCGAATCTG ACCAGCGTGA CTCAGGTTCT GCTCACCTAC
AATCCGGCGC TCGAGCAACT CTTGGTGATC TTTCCGGGAA TCATCGCCGC ACAGCAGTCA
TTCGGGCTCC CGCAGAACAA TCCCACCGGC CTCCCTTCGG GTGACTTCGC GCTCACCATC
AGCGACCCGG TGGCGTGCAC AGTCGGCTTC CTGCCGCCCT CGCAGTGGCG GAGCCCGGAG
GACATGACGA CGGTCGACAC GCCTGACGGG CTGTACTGCA AGTTGCCGCA GGACTCGCCG
ATCAGCGTCC GTGGTGCCCG TAACTACCCG TGTATCGAGC ACCCCGGCAA GCGGGCGCCG
ACGGTCGAAC TCTGCAACGA CCCCAAGGGA TATCAGCCAC TCGCGATCCG TGAGCACAGT
CTTGGTCCGT ATCCCTTCGA CCCGAATCTG GTCTCGCAGG GCGTCCCGGT CGACGAGCGG
GTGGATTTCC AGGACCGGAT CTACGCCCCG CTGCAGGGCA CACCGCTACC CCCGGGCGCG
GTTCCGGCGG GCACACCGCC GGTCGCCCCT CCGCCCGGCG CGCCGGCCCC GGCCACGACG
CCGGTCCCCG CAGGGCCGCC CGCCCCCGCG GCGGCGCCGG CCGCGGTGGC GCCTCCGCCA
CCGCCCGGCA ACTCGATCAA CGGAACGCCC CTTCCGCCTG TACCGCCCGC GGCGGCCGAA
GTGCCTCCCA GCGGCGGTGC GGTCCCCGCT GCGCCGAGCG CCTTCGGCGG CAACGGAACG
GGGGGACCGT CACTCGCGGT GGCGCACTAC GACCCGGCCA CCGGCGAGTA CCTGACCCCT
GATGGTCGAC TCGAGCGCCA GACGAATTCC GCGCTGCGAG CGCCCAAGTC CTGGCAGGAC
CTGTTGCCGA CCTGA
 
Protein sequence
MLTRFVRTQL IIFTIASIVG VSVMIFAYMK VPTLLGIGRL TVTLELPAAG GLYRFSNVTY 
RGVQIGKVMG VNLTENGAEA TMSLDTSPKV PADLLAEVRS VSAVGEQYVE LLPQTDSGPY
LEDGSRISRD RATIPQEVGP MLDQLSALVD SIPSDRIPDL LDETFKALNG AGPDFQSLLD
SASKVAGDAN AVSDQTRQLI DDVGPLLDSQ AESTEAIRTW ARSLAGVTEQ LVQNDPELRT
VLQQGPGFAQ EVSQLLTQIK PTLPILLANL TSVTQVLLTY NPALEQLLVI FPGIIAAQQS
FGLPQNNPTG LPSGDFALTI SDPVACTVGF LPPSQWRSPE DMTTVDTPDG LYCKLPQDSP
ISVRGARNYP CIEHPGKRAP TVELCNDPKG YQPLAIREHS LGPYPFDPNL VSQGVPVDER
VDFQDRIYAP LQGTPLPPGA VPAGTPPVAP PPGAPAPATT PVPAGPPAPA AAPAAVAPPP
PPGNSINGTP LPPVPPAAAE VPPSGGAVPA APSAFGGNGT GGPSLAVAHY DPATGEYLTP
DGRLERQTNS ALRAPKSWQD LLPT