Gene Mkms_3743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3743 
Symbol 
ID4611678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3962870 
End bp3963895 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content61% 
IMG OID639793424 
Productvirulence factor Mce family protein 
Protein accessionYP_939727 
Protein GI119869775 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.91638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGAT CGACTGGCAC GCTGATCAAG TTCCTCATCT TCGGCGTCAT CATGGTGGTG 
CTGACCGCCT TCCTGTTCTT GGTGTTCAGT GACTCGAGGA CCGGTGCGAC CGAGAAGTAT
TCGGCTGTCT TCGAAGATGC GTCGCGGCTG AAGGCGGGCG AGAGTGTGCG GATCGCCGGC
ATTCGGGTCG GCACCGTCAA GAGCGTGTCG CTGCGGGCCG ACAGAAAAGT CGTGGTCGAG
TTCGACACCG ATAAGAACAC CAAGCTGACC ACCAGCACCA AAGCGGCGAT CCGCTATCTC
AATCTGGTCG GCGATCGGTA CGTCGAACTC ATCGACAGCC CCGGTTCAAC GAGAATTCTC
CCGGCCGGCT CCGAGATTCC CTTGGCTCGC ACCGCACCGG CACTCGACCT CGACGTACTG
CTCGGCGGCC TCAAACCGGT TATCCGGGGC CTCAATCCAG AGGATGTGAA CGGCCTCACC
ACGTCGCTTG TCCAGATCCT GCAGGGTCAA GGCGGAACAC TCGATTCGTT GTTCTCGAAG
TCGTCGTCCT TCACCAACTC ACTCGCCGAC AACAACCAGG TGATCGAGCA GTTGATCGAC
GAGCTGCGAA CGCTGCTGGA CACGCTGTCC AAAGACGGCG AGGAGTTCTC CGGCGCGATC
GACAGACTGG ATCAGCTGAT CGAGGGATTG GCCGCGGACC GCGATCCGAT CGGCACCGCC
ATCGAGGCGT TGGACAACGG AACCTCGTCG CTGGCCGACC TTCTCGGCCG GGCACGGCCG
CCGTTGAACA ACACGATCGA CCAGCTGAAT CGGCTCGCTC CGCTGCTGAA TACCGATCTA
CCGCGCCTGG ACGCAACCCT GCAGCGCCTA CCCGAGATCT ACCGCAAGCT CGCCCGGGTG
GGTTCCTATG GCGCGTTCTT CCCCTACTAC ATCTGCGGAA TCACCTTCCG CGCCAGTGAT
CTCGAGGGCC GCACCGTGGT GTTCCCCTGG ATCAAGCAAG AGACGGGAAG GTGTGTGGAT
CAGTAG
 
Protein sequence
MTRSTGTLIK FLIFGVIMVV LTAFLFLVFS DSRTGATEKY SAVFEDASRL KAGESVRIAG 
IRVGTVKSVS LRADRKVVVE FDTDKNTKLT TSTKAAIRYL NLVGDRYVEL IDSPGSTRIL
PAGSEIPLAR TAPALDLDVL LGGLKPVIRG LNPEDVNGLT TSLVQILQGQ GGTLDSLFSK
SSSFTNSLAD NNQVIEQLID ELRTLLDTLS KDGEEFSGAI DRLDQLIEGL AADRDPIGTA
IEALDNGTSS LADLLGRARP PLNNTIDQLN RLAPLLNTDL PRLDATLQRL PEIYRKLARV
GSYGAFFPYY ICGITFRASD LEGRTVVFPW IKQETGRCVD Q