Gene Mmcs_3670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3670 
Symbol 
ID4112502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3928388 
End bp3929413 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content61% 
IMG OID638032810 
Productvirulence factor MCE-like protein 
Protein accessionYP_640833 
Protein GI108800636 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGAT CGACTGGCAC GCTGATCAAG TTCCTCATCT TCGGCGTCAT CATGGTGGTG 
CTGACCGCCT TCCTGTTCTT GGTGTTCAGT GACTCGAGGA CCGGTGCGAC CGAGAAGTAT
TCGGCTGTCT TCGAAGATGC GTCGCGGCTG AAGGCGGGCG AGAGTGTGCG GATCGCCGGC
ATTCGGGTCG GCACCGTCAA GAGCGTGTCG CTGCGGGCCG ACAGAAAAGT CGTGGTCGAG
TTCGACACCG ATAAGAACAC CAAGCTGACC ACCAGCACCA AAGCGGCGAT CCGCTATCTC
AATCTGGTCG GCGATCGGTA CGTCGAACTC ATCGACAGCC CCGGTTCAAC GAGAATTCTC
CCGGCCGGCT CCGAGATTCC CTTGGCTCGC ACCGCACCGG CACTCGACCT CGACGTACTG
CTCGGCGGCC TCAAACCGGT TATCCGGGGC CTCAATCCAG AGGATGTGAA CGGCCTCACC
ACGTCGCTTG TCCAGATCCT GCAGGGTCAA GGCGGAACAC TCGATTCGTT GTTCTCGAAG
TCGTCGTCCT TCACCAACTC ACTCGCCGAC AACAACCAGG TGATCGAGCA GTTGATCGAC
GAGCTGCGAA CGCTGCTGGA CACGCTGTCC AAAGACGGCG AGGAGTTCTC CGGCGCGATC
GACAGACTGG ATCAGCTGAT CGAGGGATTG GCCGCGGACC GCGATCCGAT CGGCACCGCC
ATCGAGGCGT TGGACAACGG AACCTCGTCG CTGGCCGACC TTCTCGGCCG GGCACGGCCG
CCGTTGAACA ACACGATCGA CCAGCTGAAT CGGCTCGCTC CGCTGCTGAA TACCGATCTA
CCGCGCCTGG ACGCAACCCT GCAGCGCCTA CCCGAGATCT ACCGCAAGCT CGCCCGGGTG
GGTTCCTATG GCGCGTTCTT CCCCTACTAC ATCTGCGGAA TCACCTTCCG CGCCAGTGAT
CTCGAGGGCC GCACCGTGGT GTTCCCCTGG ATCAAGCAAG AGACGGGAAG GTGTGTGGAT
CAGTAG
 
Protein sequence
MTRSTGTLIK FLIFGVIMVV LTAFLFLVFS DSRTGATEKY SAVFEDASRL KAGESVRIAG 
IRVGTVKSVS LRADRKVVVE FDTDKNTKLT TSTKAAIRYL NLVGDRYVEL IDSPGSTRIL
PAGSEIPLAR TAPALDLDVL LGGLKPVIRG LNPEDVNGLT TSLVQILQGQ GGTLDSLFSK
SSSFTNSLAD NNQVIEQLID ELRTLLDTLS KDGEEFSGAI DRLDQLIEGL AADRDPIGTA
IEALDNGTSS LADLLGRARP PLNNTIDQLN RLAPLLNTDL PRLDATLQRL PEIYRKLARV
GSYGAFFPYY ICGITFRASD LEGRTVVFPW IKQETGRCVD Q