Gene Mjls_4998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4998 
Symbol 
ID4880696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5236654 
End bp5237712 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content67% 
IMG OID640142308 
Productvirulence factor Mce family protein 
Protein accessionYP_001073253 
Protein GI126437562 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.408593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGAC CCGACAGCAC CAATCCGCTC CGCACCGGAA TCTTCGGCAT CGTCCTGGTG 
ACCTGCCTGG TGCTGGTGTC GTTCGGCTAC ACCGGCCTGC CGTTCTTCCC GCAGGGCAAG
TCGTACGAGG CGTACTTCAC CGACGCCGGC GGGATCACGC CCGGCAACGA CGTGAACGTC
TCCGGTATCA CGGTCGGCAA GGTCGACGGC GTGGAACTTG CCGGCGACGC GGCCAAGGTG
AATTTCACCG TCGACCGCGA GGTGCGGGTC GGGGACCAGT CGATGGTCGC GATCAAGACC
GACACCGTGC TCGGTGAGAA ATCGCTGTCG GTCACGCCGC AGGGCGCCGG CTCGTCGACG
GTGATCCCGT TGGGGCGCAC CACAACCCCG TACACGCTCA ACACCGCGCT GCAGGACCTC
GGCCAGAACG TCGGCGAGTT GGACAAGCCG CGGTTCGAGC AGGCGCTGGC CACACTGACC
GACTCGCTGC GCGAAGCCAC CCCTGCGCTG CGCGGCGCGC TCGACGGCAT CACCGATCTG
TCCCGCAGCA TCAACGAGCG CGACGAGGCG CTCGAGCAGT TGCTCGGCCA CGCCAAGCGG
GTCTCGGACA CGCTGGCGCA GCGGGCCGGT CAGGTCAACC AGCTGATCAC CGACGGCAAC
CTGCTCTTCG CCGCACTCGA CGAGCGGCGC CAGGCGCTGA GCAACCTGAT CGCCGGGATC
GACGATGTGT CAGAACAACT GTCGGGGTTC GTCAACGACA ACCGCCGCGA GTTCCGGCCC
GCGCTCGAGA AGCTCAACCT GGTGATGGAC AACCTGCTCG AGCGTCGCGA GCACATCGGG
GAGGCCCTGC GCAGACTGCC GCCGTACGCG ACCGCGCTCG GCGAGGTCGT CGGTTCGGGA
CCCGGCTTCC AGATCAACCT GTACGGCCTG CCGCCTGCCC CCATCGCCGA AGTCCTGCTG
GACACCTACT TCCAGCCCGG CAAACTGCCG GACAGCCTGT CGGACCTGCT TCGCGGATAC
ATCTCGGAGC GCATGATCAT CAGGCCGAAG TCGCCATGA
 
Protein sequence
MARPDSTNPL RTGIFGIVLV TCLVLVSFGY TGLPFFPQGK SYEAYFTDAG GITPGNDVNV 
SGITVGKVDG VELAGDAAKV NFTVDREVRV GDQSMVAIKT DTVLGEKSLS VTPQGAGSST
VIPLGRTTTP YTLNTALQDL GQNVGELDKP RFEQALATLT DSLREATPAL RGALDGITDL
SRSINERDEA LEQLLGHAKR VSDTLAQRAG QVNQLITDGN LLFAALDERR QALSNLIAGI
DDVSEQLSGF VNDNRREFRP ALEKLNLVMD NLLERREHIG EALRRLPPYA TALGEVVGSG
PGFQINLYGL PPAPIAEVLL DTYFQPGKLP DSLSDLLRGY ISERMIIRPK SP