Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mjls_3946 |
Symbol | |
ID | 4879655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. JLS |
Kingdom | Bacteria |
Replicon accession | NC_009077 |
Strand | + |
Start bp | 4169252 |
End bp | 4170916 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640141258 |
Product | extracellular solute-binding protein |
Protein accession | YP_001072212 |
Protein GI | 126436521 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.436412 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0739925 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTTGC GACGCCTGAT CTCGGCCGCC CTCGTTGCCA CCCTCACCCT CGCCGCGTGT TCCAGCGGTG ACGAGGAGAC CCCGTCGGCC GGCGGTAGCG CGGAGGTGGG CGCCACCAAC GACGTCAATC CTCAGGACGT GTCCAACCTG CGCCAGGGCG GCAACCTGCG CTTGGCGCTG ACCGCATTCC CGGCGAACTT CAACAGCCTG CACATCGACG GCAACGTCGC CGATGCGGCC GGGATGCTCA GGGCCACGAT GCCGCGTGCG TTCCGGATCG CCGCGGACGG TTCGGCGACG CTGAACACCG ACTACTTCAC CGGCGCCGAG ATCACCGGCA CCGACCCGCA GGTCGTCACC TGGACGATCA ACCCGAAGGC GGTGTGGAGC GACGGCACCC CGATCACCTG GGAGGACATC GCCGCCCAGG TGAACGCCAC CAGCGGTAAG GAGGGGTTCG CCTTCGCCAG CCCCAACGGC TCCGACCGCG TCGCGTCGGT GACCCGCGGG GCCGACGACC GGCAGGCCGT CATGACCTTC GCCAAGCCCT ACGCGGAATG GCGCGGCATG CTGTCGGGCA ACACCATGCT GCTGCCCAAG AGCATGACCG CGACGCCCGA GGCGTTCAAC CGCGGCCAAC TCGACGCGCC GGGCCCGTCG GCGGGTCCGT TCATCGTGTC GAATCTGGAC CGTACGGCCC AGCGGATCAC GCTGACCCGC AACCCGAAAT GGTGGGGTGA GCCGCCACTT CTGGACAGCA TCACCTACTC GGTGCTCGAC GACGCCGCCC GAATCCCCGC GCTGCAGAAC AACGCGCTGG ACGCCACCGG TCTGGCGACC CTCGACGAGT TGACGATCGC GCGGCGCACC AACGGTGTGG CGATCCGGCG GGCGCCGGGC AACAGCTGGT ACCACTTCAC GTTCAACGGG GCACCGGGGT CGATCCTGGC CGATAAGGCG CTGCGGCAGG CGATCGCGAA AGGCATTGAC CGCCAGACCA TTGCGGCGGT CACCCAGCGC GGCCTCGCCG ACGATCCGGT GCCGCTCAAC AACCACATCT TCGTCGCGGG CCAGCAGGGC TACCAGGACA ACAGCGGTGT GGTGGCGTTC GATCCCGAGA AGGCCAAACA GGAACTCGAT GCGCTCGGAT GGCGGCTCAA CGGGCAGTTC CGGGAGAAGG ACGGCCGCCA GCTGGTGATC CGCGACGTGC TGTTCGACGC GCTGAGCACC CGTCAGTTCG GCCAGATCGC GCAGAACAAC CTCGCCCAGA TCGGTGTGAA ACTCGAACTG GACGCCAAGG GTGCGGCCGG CTTCTTCACC GACTACATCA ACACCGGCGA CTTCGACATC GCGCAGTTCT CGTGGGTGGG TGACGCGTTC CCGCTCTCGG GTCTGACGCA GATCTACGCC TCCAACGGGG AGAGCAACTT CGGCAAGATC GGCAGCCCGC AGATCGACGC GAAGATCGAG GAGGCGCTGG AGGAACTGGA TCCGGCGAAG GCTCAGCAGA AGGCCAACGA GGTCGACAAA CTGCTGTGGG ACGAGGTGTT CAGCCTGCCG TTGACGCAGT CGCCGGGCAA CGTGGCGGTG CGTGCCAACC TCGCCAACTT CGGGGCGTTC GGCCTCGCGG ACGCCGACTA CTCGAAGATC GGCTTCGTGA AGTAG
|
Protein sequence | MTLRRLISAA LVATLTLAAC SSGDEETPSA GGSAEVGATN DVNPQDVSNL RQGGNLRLAL TAFPANFNSL HIDGNVADAA GMLRATMPRA FRIAADGSAT LNTDYFTGAE ITGTDPQVVT WTINPKAVWS DGTPITWEDI AAQVNATSGK EGFAFASPNG SDRVASVTRG ADDRQAVMTF AKPYAEWRGM LSGNTMLLPK SMTATPEAFN RGQLDAPGPS AGPFIVSNLD RTAQRITLTR NPKWWGEPPL LDSITYSVLD DAARIPALQN NALDATGLAT LDELTIARRT NGVAIRRAPG NSWYHFTFNG APGSILADKA LRQAIAKGID RQTIAAVTQR GLADDPVPLN NHIFVAGQQG YQDNSGVVAF DPEKAKQELD ALGWRLNGQF REKDGRQLVI RDVLFDALST RQFGQIAQNN LAQIGVKLEL DAKGAAGFFT DYINTGDFDI AQFSWVGDAF PLSGLTQIYA SNGESNFGKI GSPQIDAKIE EALEELDPAK AQQKANEVDK LLWDEVFSLP LTQSPGNVAV RANLANFGAF GLADADYSKI GFVK
|
| |