Gene Mkms_0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0129 
Symbol 
ID4615535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp142070 
End bp143656 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content69% 
IMG OID639789805 
Productvirulence factor Mce family protein 
Protein accessionYP_936137 
Protein GI119866185 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.54103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.12821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACGC TCGAGGGATC GAACCGCCTC CGCGGCGGGT TGATGGGCAT CATCATCCTG 
GTGCTCGTCG TCGGGGTCGG ACAGAGCTTC GCCAGCGTGC CTATGCTGTT CGCCAAGCCG
AGGTACTTCG CGCAGTTCGG CGACACCGGC GGCATCAACC CCGGCGACAA GGTCCGCATC
GCCGGCGTCG ACGTCGGCGA GGTGCTCAAG ACCGAGATCG AGGGCGACAA GGTCGTCGTC
GGCTTCACGC TGGGCGGTAC GCAGATCGGC AGTGACAGCC GCGCGGCGAT TCGCACGGAT
ACGATTCTGG GCCGAAAGAA CATCGAGATC GAACCGCGCG GGTCGGAGGC GCTGCAGTCC
AACGGCGTCC TGCCGCTGGG TCAGACGACC ACCCCGTACC AGATCTACGA CGCCTTCTTC
GACCTCACCA AATCGGCGTC CGGCTGGGAC ACCAAGTCGG TGCGCGAATC GCTCAACGTG
CTGTCGGAGA CGATCGACCA GACCTATCCG CACCTGAGCG CGGCGCTCGA CGGGGTGGCC
CGCTTCTCCG ACACCATCGG TAAGCGCGAC GAGCAGCTCA AACAGCTGCT GGCCAATGCC
AACAAGATCG CCGGGGTGCT GGGTGAGCGC AGCGGTCAGG TGAATGCGCT GCTGGTCAAC
GCGCAGACCC TGCTGGCCGC GATCAACGAG CGCAGCTACG CCGTCGGCCA GCTGCTGGAG
CGGGTGTCGG CGTTCTCGGA GCAGGTCGAG GGCTTCATCG ACGACAACCC GAACCTCAAC
AGGGTGCTCG AGCAGCTGCG CGTGATCAGC GACATCCTCG TCGAGCGCAA ATTCGACCTG
GTCGACGTGC TGACCACGCT GAGCAAGTTC ACCGCATCGC TGGCCGAGGC CTTCGCGTCC
GGCCCGTACT TCAAGGTCAT GCTGGTCAAC CTGGCGCCCT ACTGGATCCT GCAGCCGTAC
GTCGACGCGG CGTTCAAGAA GCGCGGCATC GACCCGCAGA AGTTCTGGCG TGACGCCGGT
CTGCCGGCCT ACCAGTTCCC GGATCCGAAC GGGGTGCGTC AGCCCAACGG TGCGCCGCCG
CCTGCGCCGG CGACCCTGCA GGGCACTCCG GAGTACCCGA ATCCCGCGGT GCCGAGGGGC
GACCCATGTT CCTACACCCC GCCGGCCGAC GGCCTGCCCA CACCGGGCAA TCCGCTGCCG
TGTGCGGACC TGAGCGTCGG CCCGTTCGGC GACAATCCCT ACGGTCCGAA CTACCACGGC
CGGCCCAACG TGGCGACGTC GGAGCCGAAC CCGAACGGCA TGTTGCCCAC TCCCGGCGTC
GCCAGTTCCG GGGTGCCGGG TCAGCAGGCG CCCGTGGTGC CGGGTACGCC CGTGCCGCTG
CCGCCGGCCC CGCCGGGTGC GCGCAACGAG CCGCCGGGAC CGTTCCCCGG TCCGACGGCC
GTAGGCGGTC AGGTGAACAA CGTGCCGCCG CCGCCGGCGC TGCCGGGTCC GCCGCCGCCG
CCGGGGCCGG GCCAGCAGCT GTCGCCCGCC CAGACGGGTC CGCTGCCGGG CAATCCACCG
TTCCTCCCGC CGGGATCTCA ACAATGA
 
Protein sequence
MRTLEGSNRL RGGLMGIIIL VLVVGVGQSF ASVPMLFAKP RYFAQFGDTG GINPGDKVRI 
AGVDVGEVLK TEIEGDKVVV GFTLGGTQIG SDSRAAIRTD TILGRKNIEI EPRGSEALQS
NGVLPLGQTT TPYQIYDAFF DLTKSASGWD TKSVRESLNV LSETIDQTYP HLSAALDGVA
RFSDTIGKRD EQLKQLLANA NKIAGVLGER SGQVNALLVN AQTLLAAINE RSYAVGQLLE
RVSAFSEQVE GFIDDNPNLN RVLEQLRVIS DILVERKFDL VDVLTTLSKF TASLAEAFAS
GPYFKVMLVN LAPYWILQPY VDAAFKKRGI DPQKFWRDAG LPAYQFPDPN GVRQPNGAPP
PAPATLQGTP EYPNPAVPRG DPCSYTPPAD GLPTPGNPLP CADLSVGPFG DNPYGPNYHG
RPNVATSEPN PNGMLPTPGV ASSGVPGQQA PVVPGTPVPL PPAPPGARNE PPGPFPGPTA
VGGQVNNVPP PPALPGPPPP PGPGQQLSPA QTGPLPGNPP FLPPGSQQ