Gene Mjls_3946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3946 
Symbol 
ID4879655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4169252 
End bp4170916 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content67% 
IMG OID640141258 
Productextracellular solute-binding protein 
Protein accessionYP_001072212 
Protein GI126436521 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.436412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0739925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTGC GACGCCTGAT CTCGGCCGCC CTCGTTGCCA CCCTCACCCT CGCCGCGTGT 
TCCAGCGGTG ACGAGGAGAC CCCGTCGGCC GGCGGTAGCG CGGAGGTGGG CGCCACCAAC
GACGTCAATC CTCAGGACGT GTCCAACCTG CGCCAGGGCG GCAACCTGCG CTTGGCGCTG
ACCGCATTCC CGGCGAACTT CAACAGCCTG CACATCGACG GCAACGTCGC CGATGCGGCC
GGGATGCTCA GGGCCACGAT GCCGCGTGCG TTCCGGATCG CCGCGGACGG TTCGGCGACG
CTGAACACCG ACTACTTCAC CGGCGCCGAG ATCACCGGCA CCGACCCGCA GGTCGTCACC
TGGACGATCA ACCCGAAGGC GGTGTGGAGC GACGGCACCC CGATCACCTG GGAGGACATC
GCCGCCCAGG TGAACGCCAC CAGCGGTAAG GAGGGGTTCG CCTTCGCCAG CCCCAACGGC
TCCGACCGCG TCGCGTCGGT GACCCGCGGG GCCGACGACC GGCAGGCCGT CATGACCTTC
GCCAAGCCCT ACGCGGAATG GCGCGGCATG CTGTCGGGCA ACACCATGCT GCTGCCCAAG
AGCATGACCG CGACGCCCGA GGCGTTCAAC CGCGGCCAAC TCGACGCGCC GGGCCCGTCG
GCGGGTCCGT TCATCGTGTC GAATCTGGAC CGTACGGCCC AGCGGATCAC GCTGACCCGC
AACCCGAAAT GGTGGGGTGA GCCGCCACTT CTGGACAGCA TCACCTACTC GGTGCTCGAC
GACGCCGCCC GAATCCCCGC GCTGCAGAAC AACGCGCTGG ACGCCACCGG TCTGGCGACC
CTCGACGAGT TGACGATCGC GCGGCGCACC AACGGTGTGG CGATCCGGCG GGCGCCGGGC
AACAGCTGGT ACCACTTCAC GTTCAACGGG GCACCGGGGT CGATCCTGGC CGATAAGGCG
CTGCGGCAGG CGATCGCGAA AGGCATTGAC CGCCAGACCA TTGCGGCGGT CACCCAGCGC
GGCCTCGCCG ACGATCCGGT GCCGCTCAAC AACCACATCT TCGTCGCGGG CCAGCAGGGC
TACCAGGACA ACAGCGGTGT GGTGGCGTTC GATCCCGAGA AGGCCAAACA GGAACTCGAT
GCGCTCGGAT GGCGGCTCAA CGGGCAGTTC CGGGAGAAGG ACGGCCGCCA GCTGGTGATC
CGCGACGTGC TGTTCGACGC GCTGAGCACC CGTCAGTTCG GCCAGATCGC GCAGAACAAC
CTCGCCCAGA TCGGTGTGAA ACTCGAACTG GACGCCAAGG GTGCGGCCGG CTTCTTCACC
GACTACATCA ACACCGGCGA CTTCGACATC GCGCAGTTCT CGTGGGTGGG TGACGCGTTC
CCGCTCTCGG GTCTGACGCA GATCTACGCC TCCAACGGGG AGAGCAACTT CGGCAAGATC
GGCAGCCCGC AGATCGACGC GAAGATCGAG GAGGCGCTGG AGGAACTGGA TCCGGCGAAG
GCTCAGCAGA AGGCCAACGA GGTCGACAAA CTGCTGTGGG ACGAGGTGTT CAGCCTGCCG
TTGACGCAGT CGCCGGGCAA CGTGGCGGTG CGTGCCAACC TCGCCAACTT CGGGGCGTTC
GGCCTCGCGG ACGCCGACTA CTCGAAGATC GGCTTCGTGA AGTAG
 
Protein sequence
MTLRRLISAA LVATLTLAAC SSGDEETPSA GGSAEVGATN DVNPQDVSNL RQGGNLRLAL 
TAFPANFNSL HIDGNVADAA GMLRATMPRA FRIAADGSAT LNTDYFTGAE ITGTDPQVVT
WTINPKAVWS DGTPITWEDI AAQVNATSGK EGFAFASPNG SDRVASVTRG ADDRQAVMTF
AKPYAEWRGM LSGNTMLLPK SMTATPEAFN RGQLDAPGPS AGPFIVSNLD RTAQRITLTR
NPKWWGEPPL LDSITYSVLD DAARIPALQN NALDATGLAT LDELTIARRT NGVAIRRAPG
NSWYHFTFNG APGSILADKA LRQAIAKGID RQTIAAVTQR GLADDPVPLN NHIFVAGQQG
YQDNSGVVAF DPEKAKQELD ALGWRLNGQF REKDGRQLVI RDVLFDALST RQFGQIAQNN
LAQIGVKLEL DAKGAAGFFT DYINTGDFDI AQFSWVGDAF PLSGLTQIYA SNGESNFGKI
GSPQIDAKIE EALEELDPAK AQQKANEVDK LLWDEVFSLP LTQSPGNVAV RANLANFGAF
GLADADYSKI GFVK