Gene Mkms_4703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4703 
Symbol 
ID4616118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4927284 
End bp4928342 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content66% 
IMG OID639794395 
Productvirulence factor Mce family protein 
Protein accessionYP_940684 
Protein GI119870732 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.293736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGAC CCGACAGCGC CAATCCGCTC CGGACCGGCA TCTTCGGCAT CGTCCTGGTG 
ACCTGCCTGG TGTTGGTGTC ATTCGGCTAC ACCGGTCTGC CGTTCTTCCC TCAGGGCAAG
TCCTACGAGG CCTACTTCAC CGACGCCGGC GGCATCATCC CGGGCAACGA CGTGAACGTC
TCGGGCATCA CGGTCGGCAA GGTCGACGGC GTAGAACTCG CCGGCGACGT CGCCAAGGTG
AACTTCACCG TCGACCGCAA GGTCCGCGTC GGAGACCAAT CGATGGTCGC GATCAAGACC
GACACGGTGC TCGGCGAGAA GTCCCTGGCC GTCACCCCGC AGGGCGCCGG GTCCTCGACG
GTGATTCCGC TGGGGCGCAC CACCACCCCG TACACGCTGA ACACCGCGCT GCAGGATCTC
GGCAACAACG TGGGCGAGTT GGACAAGCCG CGCTTCGAGC AGGCGCTGGC CACACTGACC
GACTCGCTGC GCGAGGCCAC CCCGCAACTG CGGGGCGCGC TGGACGGCAT CACGAACCTG
TCGCGCAGCA TCAACGCCCG CGACGAGGCG CTCGAACAGC TCCTCGGCCA TGCGAAGCGG
GTCTCAGACA CGCTGGCGCA GCGCGCCGGT CAGGTCAACC AGCTGATCGT CGACGGCAAC
CTGCTGTTCG CCGCGCTCGA CGAGCGGCGC CAAGCGCTGA GCAATCTGAT CGCCGGGATC
GACGATGTCG CCGAACAGCT CTCGGGCTTC GTCAACGACA ACCGCCGCGA ATTTGGGCCC
GCGCTCGAGA AACTCAACCT GGTGATGGAC AACCTGCTGG AGCGCCGTGA GCACATCAAG
GAGGCGCTGC GCAGGCTGCC GCCGTACGCC ACCACACTCG GCGAGGTCGT CGGTTCGGGC
CCCGGTTTCC AGGTCAACCT GTACGGTCTG CCACCCGCGC CGCTGGCCGA GGTGCTGCTG
GACACCTACT TCCAGCCCGG CAAACTGCCG GACAGCCTCT CCGACTACCT GCGCGGCTAT
ATCTCCGAGC GCATGATCAT CAGGCCGAAG TCGCCATGA
 
Protein sequence
MARPDSANPL RTGIFGIVLV TCLVLVSFGY TGLPFFPQGK SYEAYFTDAG GIIPGNDVNV 
SGITVGKVDG VELAGDVAKV NFTVDRKVRV GDQSMVAIKT DTVLGEKSLA VTPQGAGSST
VIPLGRTTTP YTLNTALQDL GNNVGELDKP RFEQALATLT DSLREATPQL RGALDGITNL
SRSINARDEA LEQLLGHAKR VSDTLAQRAG QVNQLIVDGN LLFAALDERR QALSNLIAGI
DDVAEQLSGF VNDNRREFGP ALEKLNLVMD NLLERREHIK EALRRLPPYA TTLGEVVGSG
PGFQVNLYGL PPAPLAEVLL DTYFQPGKLP DSLSDYLRGY ISERMIIRPK SP