Gene Mjls_4165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4165 
Symbol 
ID4879871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4399682 
End bp4401145 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content62% 
IMG OID640141474 
Productvirulence factor Mce family protein 
Protein accessionYP_001072428 
Protein GI126436737 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.450079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATCA CCAGAAGAAT CTGGATCCAG TTGGGGGTCT TCCTCGCGGT TGCGCTGACC 
GCCTTCTCGA TCATGGCATT CAATTACATG AAGTTGCCGA ACCTGTTGTT CGGCATCGGT
CGCTACTCGG TCACACTGCA GCTGCCGGAG GCCGGCGGAC TGTACGAACG GGGCAATGTG
ACCTACCGCG GCACCGAGGT AGGGCAGGTC AAGAGCGTCC GCCTGACCGA GAGTGGTGAC
GTGGAGGCGG AGTTGTCTCT GCAGTCCGAT GTCAAGATCC CGGCGAACCT CATAGCTGAG
GTGCACAGTC AGAGCGCGGT CGGCGAGCAG TACGTCGCAC TGCTCCCGCA AGGTGACGGC
GGTCCGGTGC TGAAGAACGG TGACGTGATA TCGCAGGAGA GGACGACGGT TCCGCCGGAT
ATCAATTCGC TGCTCGACGC TACCAACCGC GGCCTGGAGG CCATCCCCGG CGACAATCTG
AAGACCGCGG TAGACGAGGC CTATACGGCA GTCGGTGGTC TCGGACCGGA GATCAACCGA
TTCGTCAAGG GCTCTACCGC CTTGGCGATC GACGCGCGCA AGAACCTGGA TGACCTGACC
AATGTGGTCG ACAATGTCGC TCCGATTCTG GATACGCAGA CCGACACGTC GGATTCGATC
CAGGCGTGGG CCTCTCACCT CGCTGGCGTC ACCAAGCAAC TGCAATCCAA CGATGCTGCT
GTGCAGGGGA TCCTGCATAA CGGGCCGGGG GCGGCCGACG AGGCGCGAGC GCTTTTCGAC
CGGCTGCAAC CCACGCTGCC GATCGTGCTG GCCAACCTGG TGAGCATCGA ACCGGTGCTG
GTCACCTACC GGGACAACCT CGAACAGCTA CTCGTGTTGT TGCCCCAGGC CACTTCGATC
ATGCAGGCGA TCGGTGTGCC GAACCGGCAC ACCAAGATGG ATTTCGAAGG CGCGTTCCTG
GCGTTCAACC TGAATGTCAA CATTCCGCCG CCGTGTACCA CAGGATTCCT GCCGGTTCAG
CAGATGCGGC CCGCTGCGGA GCTCGACTCG CCCGAGCGCC CCGCCGAGGA TCTGTACTGC
CGCATTCCGC AGGACTCGAT GTTCAACGTG CGTGGTGCGC GAAACACACC GTGTGTAACG
CGGCCGGGCA AGCGCGCCCC CACGGTGAAG ATGTGCGAGA GCGACGAAGA GTATGTTCCG
CTGAACGACG GTCACAACTG GAAGGGCGAC CCCAACGCCA CAACGAGCGG GCAGGATATT
CCTCAGCCTC CCGCGGGAAG TTTGCCGAAC ACGCCAGCAC CTACACTCGC CCCTGCGCCG
CCGATTGCGG CGGCCGACTA TGACCCGGCC ACCGGCACGT ACGTCGGACC AGACGGGCAC
GTCTACACAC AATCGAATCT GGCCAGAAGT GCTAACGAGG AACAGACATG GCAACAAATG
CTGATACCCC CGACGGGGCA GTAA
 
Protein sequence
MHITRRIWIQ LGVFLAVALT AFSIMAFNYM KLPNLLFGIG RYSVTLQLPE AGGLYERGNV 
TYRGTEVGQV KSVRLTESGD VEAELSLQSD VKIPANLIAE VHSQSAVGEQ YVALLPQGDG
GPVLKNGDVI SQERTTVPPD INSLLDATNR GLEAIPGDNL KTAVDEAYTA VGGLGPEINR
FVKGSTALAI DARKNLDDLT NVVDNVAPIL DTQTDTSDSI QAWASHLAGV TKQLQSNDAA
VQGILHNGPG AADEARALFD RLQPTLPIVL ANLVSIEPVL VTYRDNLEQL LVLLPQATSI
MQAIGVPNRH TKMDFEGAFL AFNLNVNIPP PCTTGFLPVQ QMRPAAELDS PERPAEDLYC
RIPQDSMFNV RGARNTPCVT RPGKRAPTVK MCESDEEYVP LNDGHNWKGD PNATTSGQDI
PQPPAGSLPN TPAPTLAPAP PIAAADYDPA TGTYVGPDGH VYTQSNLARS ANEEQTWQQM
LIPPTGQ