Gene Mmcs_3668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3668 
Symbol 
ID4112500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3925816 
End bp3927300 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content66% 
IMG OID638032808 
Productvirulence factor MCE-like protein 
Protein accessionYP_640831 
Protein GI108800634 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCAC CCCGTAAGCG CCTGGCGGCC TGGACGGCGG TCCTGTTGGC CCTCGTGTTG 
GTCGCAGGCG CGGTTTTCCT GGTGCGAAAG CTGTACTTCG GGCCGAACAC CATCACGGCG
TACTTCCCCA CCGCGACGGC AATCTATCCC GGTGACGAGG TACGGGTGTC GGGCGTCAAG
GTCGGCAAGA TCGAGGCCAT CACCCCCGAG GGCACCGAGA CGAAAATGCT TCTCAGCGTC
GACCGCGACG TGCCGGTACC GGCAGACGCC AAGGCGGTGA TCGTCGCGCA GAATCTGGTG
GCGGAACGCT TCGTCGCGCT CACGCCCGCC TACCGCACCG GCGACGGACC GAAGATGCCC
GACGGCGGGG TGATCCCCAG CGACCGAACC GCGGTTCCCG TGGAGTGGGA CGAAGTGAAG
GAACAGCTGA CGCGCCTGGC TACCGCGCTG GGGCCGGAGG CCGGAGTGTC CGACACGTCC
GTCTCGCGGT TCATCGACAG CGCGGCCGAT GCGTTGGATG GCAACGGCGA GAAACTCCGG
GATACCCTGG CCCAACTCTC CGGAGTGGCC CGGGTTTTCG CCGAGGGCAG CGGGAACATC
GTCGACATCA TCAAGAACCT GCAGATCTTC GTCACCGCCC TACGCGACAG CAAGCAGCAG
ATCGTCCAGT TCGAGAATCG ACTGGCCACC CTGACCAGTG TGTTGGACGA CAGCAGGTCC
GATCTGGACG CGGCTCTGTC TGAACTGTCG GTCGCCCTGG GCGAGGTGCA GCGCTTCGTC
GCGGGCACCC GCGATCTGAC CGCCGAACAG ATTCAGCGAC TGGCCAACGT CACTCAGGTC
CTGGTCGACA ACCGCACGGC ATTGGAGAAT GTCCTCCACA TCACCCCCAA TGCGATCGCC
AATTTCCAGA ACATCTACTA CCCGAACGGC GGAGCGGTGA CGGGTGCGTT CTCCCTTGTC
AACTTTGCCA ACCCGGTGCA CTTCGTCTGC GGCCTGATCG GCGGCGTCGC CAACACCACC
GCACCCGAGA CGGCGAAACT CTGCGCGCAG TACCTCGGTC CCGCATTGCG CCTGCTCAAC
TTCAACAACA TTCCTCTGCC GATCAACGCC TACTTGAGGC CCGCGGTCAA TCCGGAGCGG
ATCATCTACA CCGACCCGAA GCTCGCACCT GGGGGAGCAG GACCGGGCGA TCCGCCCGAG
CCGTTCCCGT CGGTGTCCGC CTACCTCGGA GCCGGGGACA TCCCGCCGCC TCCGGGCTGG
AATCAACCGC CGGGTCCTCC CGGGCTTTAT ACGCCGGACG ACGACGCGCC GGCCATCCCA
TCACCGGCGT TGTACCCCGG CGCCCCGATC CCAGCGCCGC CGAACGTGTT GAGCAACATC
GCGCCGGGAC CGCAGACAGT CGATGGGATG CTGCTTCCAC CCACCCCGGC GCCGGCCAAC
CCGCCGCCGC CCAGCGGGCC GCCGCTGCCT GCGGAGGCTC CGTGA
 
Protein sequence
MTAPRKRLAA WTAVLLALVL VAGAVFLVRK LYFGPNTITA YFPTATAIYP GDEVRVSGVK 
VGKIEAITPE GTETKMLLSV DRDVPVPADA KAVIVAQNLV AERFVALTPA YRTGDGPKMP
DGGVIPSDRT AVPVEWDEVK EQLTRLATAL GPEAGVSDTS VSRFIDSAAD ALDGNGEKLR
DTLAQLSGVA RVFAEGSGNI VDIIKNLQIF VTALRDSKQQ IVQFENRLAT LTSVLDDSRS
DLDAALSELS VALGEVQRFV AGTRDLTAEQ IQRLANVTQV LVDNRTALEN VLHITPNAIA
NFQNIYYPNG GAVTGAFSLV NFANPVHFVC GLIGGVANTT APETAKLCAQ YLGPALRLLN
FNNIPLPINA YLRPAVNPER IIYTDPKLAP GGAGPGDPPE PFPSVSAYLG AGDIPPPPGW
NQPPGPPGLY TPDDDAPAIP SPALYPGAPI PAPPNVLSNI APGPQTVDGM LLPPTPAPAN
PPPPSGPPLP AEAP