Gene Mmcs_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0120 
Symbol 
ID4108966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp134730 
End bp136316 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content69% 
IMG OID638029245 
Productvirulence factor MCE-like protein 
Protein accessionYP_637297 
Protein GI108797100 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.265453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACGC TCGAGGGATC GAACCGCCTC CGCGGCGGGT TGATGGGCAT CATCATCCTG 
GTGCTCGTCG TCGGGGTCGG ACAGAGCTTC GCCAGCGTGC CTATGCTGTT CGCCAAGCCG
AGGTACTTCG CGCAGTTCGG CGACACCGGC GGCATCAACC CCGGCGACAA GGTCCGCATC
GCCGGCGTCG ACGTCGGCGA GGTGCTCAAG ACCGAGATCG AGGGCGACAA GGTCGTCGTC
GGCTTCACGC TGGGCGGTAC GCAGATCGGC AGTGACAGCC GCGCGGCGAT TCGCACGGAT
ACGATTCTGG GCCGAAAGAA CATCGAGATC GAACCGCGCG GGTCGGAGGC GCTGCAGTCC
AACGGCGTCC TGCCGCTGGG TCAGACGACC ACCCCGTACC AGATCTACGA CGCCTTCTTC
GACCTCACCA AATCGGCGTC CGGCTGGGAC ACCAAGTCGG TGCGCGAATC GCTCAACGTG
CTGTCGGAGA CGATCGACCA GACCTATCCG CACCTGAGCG CGGCGCTCGA CGGGGTGGCC
CGCTTCTCCG ACACCATCGG TAAGCGCGAC GAGCAGCTCA AACAGCTGCT GGCCAATGCC
AACAAGATCG CCGGGGTGCT GGGTGAGCGC AGCGGTCAGG TGAATGCGCT GCTGGTCAAC
GCGCAGACCC TGCTGGCCGC GATCAACGAG CGCAGCTACG CCGTCGGCCA GCTGCTGGAG
CGGGTGTCGG CGTTCTCGGA GCAGGTCGAG GGCTTCATCG ACGACAACCC GAACCTCAAC
AGGGTGCTCG AGCAGCTGCG CGTGATCAGC GACATCCTCG TCGAGCGCAA ATTCGACCTG
GTCGACGTGC TGACCACGCT GAGCAAGTTC ACCGCATCGC TGGCCGAGGC CTTCGCGTCC
GGCCCGTACT TCAAGGTCAT GCTGGTCAAC CTGGCGCCCT ACTGGATCCT GCAGCCGTAC
GTCGACGCGG CGTTCAAGAA GCGCGGCATC GACCCGCAGA AGTTCTGGCG TGACGCCGGT
CTGCCGGCCT ACCAGTTCCC GGATCCGAAC GGGGTGCGTC AGCCCAACGG TGCGCCGCCG
CCTGCGCCGG CGACCCTGCA GGGCACTCCG GAGTACCCGA ATCCCGCGGT GCCGAGGGGC
GACCCATGTT CCTACACCCC GCCGGCCGAC GGCCTGCCCA CACCGGGCAA TCCGCTGCCG
TGTGCGGACC TGAGCGTCGG CCCGTTCGGC GACAATCCCT ACGGTCCGAA CTACCACGGC
CGGCCCAACG TGGCGACGTC GGAGCCGAAC CCGAACGGCA TGTTGCCCAC TCCCGGCGTC
GCCAGTTCCG GGGTGCCGGG TCAGCAGGCG CCCGTGGTGC CGGGTACGCC CGTGCCGCTG
CCGCCGGCCC CGCCGGGTGC GCGCAACGAG CCGCCGGGAC CGTTCCCCGG TCCGACGGCC
GTAGGCGGTC AGGTGAACAA CGTGCCGCCG CCGCCGGCGC TGCCGGGTCC GCCGCCGCCG
CCGGGGCCGG GCCAGCAGCT GTCGCCCGCC CAGACGGGTC CGCTGCCGGG CAATCCACCG
TTCCTCCCGC CGGGATCTCA ACAATGA
 
Protein sequence
MRTLEGSNRL RGGLMGIIIL VLVVGVGQSF ASVPMLFAKP RYFAQFGDTG GINPGDKVRI 
AGVDVGEVLK TEIEGDKVVV GFTLGGTQIG SDSRAAIRTD TILGRKNIEI EPRGSEALQS
NGVLPLGQTT TPYQIYDAFF DLTKSASGWD TKSVRESLNV LSETIDQTYP HLSAALDGVA
RFSDTIGKRD EQLKQLLANA NKIAGVLGER SGQVNALLVN AQTLLAAINE RSYAVGQLLE
RVSAFSEQVE GFIDDNPNLN RVLEQLRVIS DILVERKFDL VDVLTTLSKF TASLAEAFAS
GPYFKVMLVN LAPYWILQPY VDAAFKKRGI DPQKFWRDAG LPAYQFPDPN GVRQPNGAPP
PAPATLQGTP EYPNPAVPRG DPCSYTPPAD GLPTPGNPLP CADLSVGPFG DNPYGPNYHG
RPNVATSEPN PNGMLPTPGV ASSGVPGQQA PVVPGTPVPL PPAPPGARNE PPGPFPGPTA
VGGQVNNVPP PPALPGPPPP PGPGQQLSPA QTGPLPGNPP FLPPGSQQ