Gene Mkms_3744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3744 
Symbol 
ID4611679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3963892 
End bp3965088 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content63% 
IMG OID639793425 
Productvirulence factor Mce family protein 
Protein accessionYP_939728 
Protein GI119869776 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.648515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.690685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGGGA GCGCAGTACG TCCGCTGACA GGGCTGGCCT TGCTCGTGGC CATCGGCCTG 
ATCATCGCGC TGGCCATCGG TCTGTTCGCG GGAACATTCA CCCGAACCGT TCCCGTAACG
GTCGTGTCCG ACCGGGCCGG CCTCGTGATG AACCCCGATG CCGAGGTCAA GATGCGCGGC
GTCGAAGTCG GGCGGGTCGA GACGATCGAG CGGCGGCCCG ACGGCAAGGC GGTTCTCCAT
CTGGCGATGA ATCCGTCACA GCTCCACCTC ATTCCGTCCA ACGTGAACGT CGACATCGCC
TCGACGACTG TCTTCGGGGC CAAGTTCGTG CGGTTCTTCC CGCCGGAGGA CCCCTCTCCG
GAGAAGTTGC GGAAGGGGCA GGTGATCGAA GGCCAGCATG TCACCGTCGA GGTCAATACG
GTGTTCCAGC AGCTCGTGGA GGTGCTGGAC AAGATCGATC CCGCGAAGCT CAACGAGACG
CTGGGTGCGA TCTCGTCGGC ATTCAACGGC CGCGGCGAGA AGTTCGGTCA GACGCTGGTC
GATCTGAATG CGGTGCTCGC CAAGATCGAA CCGAGCCTGC CGAATCTGGC CCAAGACATC
GAGGCCTCCG TGCCCACGCT GACGGCCTAC GGAGACGCGG CGCCGGACCT GATCTCTGCG
GTGCAGAACA CGACACAGTT CAGCAACACG ATCGTCGACC AGCAGCAGAA CCTCGACGAG
TTCCTGGTCA GCGCAATCGG TCTGGCCGAC ATCGGCAACG AGGTGATCGG TGGCAACCGC
GAGGCCCTCG GTGACGTCCT GGAGAATCTC GTGCCCACCG CCGAGTTGCT CAACACCTAC
AAAAAGTCGT TGTGGTGCGG CATCGGCGGC CTCATCCCGT TCGCGAAATC AGGACCGCAG
TACTCGGGCA TCATGGTCTC GGCCGGCCTG ACACTGGGCG TCGAGCGGTA CCGGTATCCC
GGTGACCTGC CGAAGGTCGC GGCCAAGAGC GGGGGGCGTG ACTATTGCAA GGAACTCGGA
CTGCCCGAAC TACCCCCTGA GTTCGTGCCG CCGATGGTCG TCGGCGACGT CGGCTCCAAT
CCGGCGCAAT ATGGCAACGC GGGAATCCTG CTGAACTCCG AGGGACTCAA GAACTGGCTC
TTCGGACCAC TCGACGGGCC GCCTCGAAAC ACCGCACAGA TTGGAATGCC GGGATGA
 
Protein sequence
MKGSAVRPLT GLALLVAIGL IIALAIGLFA GTFTRTVPVT VVSDRAGLVM NPDAEVKMRG 
VEVGRVETIE RRPDGKAVLH LAMNPSQLHL IPSNVNVDIA STTVFGAKFV RFFPPEDPSP
EKLRKGQVIE GQHVTVEVNT VFQQLVEVLD KIDPAKLNET LGAISSAFNG RGEKFGQTLV
DLNAVLAKIE PSLPNLAQDI EASVPTLTAY GDAAPDLISA VQNTTQFSNT IVDQQQNLDE
FLVSAIGLAD IGNEVIGGNR EALGDVLENL VPTAELLNTY KKSLWCGIGG LIPFAKSGPQ
YSGIMVSAGL TLGVERYRYP GDLPKVAAKS GGRDYCKELG LPELPPEFVP PMVVGDVGSN
PAQYGNAGIL LNSEGLKNWL FGPLDGPPRN TAQIGMPG