Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbis_1850 |
Symbol | |
ID | 9168344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobispora bispora DSM 43833 |
Kingdom | Bacteria |
Replicon accession | NC_014165 |
Strand | - |
Start bp | 2134964 |
End bp | 2136718 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003652455 |
Protein GI | 296269823 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGC GACTGGCCAG GCCCGCGGTC ACCGCCGCCC TAGCCGCCCT CACGCTGGGC GTCTCCGCCT GCGGCGGTGG CGGCGGGGGA CAGAGCACCG GCGGTGGATC CGCCTCGTCC GGCGACGGGT TCAACGCGGC TCTGGAGAAG GTGGTCAACC CTTCGGACAA GAAGGGCGGC ACCCTGAAGA TGGCGATCAC CGAGGCGTGG GACTCCATCG ACCCCGGTGA CACCTACTAC GCCCTCTCGT GGAACCTGCT CCGGCTCTAC GGGCGGCCGC TCGTCACCTA CAAGCCCGCC CCGGGCGCGG AGGGCCGCGA GCTCGTCCCC GACCTGGCCG AGAGCCTCGG CACCCCGTCC GACGGCGGCA AGACCTGGAC CTACCGGCTC CGCGAGGGCC TCAAGTTCGA GGACGGCACC CCGATCACGT CCAAGGACGT CAAGTACGCG GTGCTCCGCT CGCTCGACAA GAAGACCTTC GTCAACGGCC CGACGTACTT CAACGACTGG CTCGACCTGC CGAAGGACTT CGTCAGCGTC TACGAGACGC CGGACGTCAA CACCGACCAG GCGATCGAGA CGCCGGACGA CCGGACGATC GTCTTCCACC TGAAGAAGCC GTACGCGGGC TTCGACAACT TCGCGGCGCT GCCGTCCACC GTCCCGGTCC CCAAGGACAA GGACACCGGG GTCAAGTACC GGCAGCACCC GATCGCCTCC GGGCCGTACA TGTTCGAGAA GGTCGAGGAG GGCAAGCAGT ACACCCTGGT GCGCAACCCG CACTGGGACC CGAACACCGA CCCGATCCGC AAGGCCCTCC CGGACCGCAT CGAGATCTCC CTCGGCGTCG AGGCGAACGA CCTGGACAAC CGCCTCATCT CGGGCGACGT CCACGTCGAC CTCGCCGGCA CCGGCGTGCA GGCGGCCGCC CTCGGCACCG TCCTGGGCGA CCCGGCGCTC AAGGCGCGGG CGGACAACCC GACCAGCGCC CGCACCTGGT TCATCTCGAT CCTCGACACC CCGCCGCTCG ACAACGTCGA GTGCCGCAAG GCGATCATCT ACGCGGCCGA CCGGGTCGGG CTGCAGAACG CCTACGGCGG CGAGCTCCAG GGCGAGATCG CCACGGGCCT GATGCCGCCG TCGATCGACG GATGGCAGAA GCTCGACCTG TACCCCACGG GCACCGGTGA CCTGGAGAAG GCGAAGGCCG CCCTCCAGGC CTGCGGCCAC CCCGACGGCT TCGAGACCGT CATGGTCTAC CGCTCGGACC GGCCGAAGGA GCAGGCCGCG GCCGAGTCGC TCCAGCAGGC GCTCGCCCGG GTCGGCATCA AGCTCACCCT GAAGGGCTAC CCGACCAGCG ACTACTTCGC CAGCTACGCC GGCAAGCCCG AGTTCACCCG GAAGAACAAC GTCGGCCTCG CCGCCCACGG GTGGGCCGCG GACTGGAACG ACGGCTTCGG CTTCCTGTCG CAGATCGTCG ACAGCCGGAC CATCCGCGAG TCCGGCAACT ACAACCTCAG CGTGAAGAGC AAGGAGATCG ACGAGCTGAT CGACAAGGCC ATGGCCGAGC CGGACAAGGC CAAGCGCGAC GCCATCTGGG GTGAGATCGA CCGGAAGGTC ATGGAGGGCG CGTACGTCCT GCCGGCGGTG TGGGCCAAGG CGCTGCTCCT GCGGGGCAAG GGCGTGACCA ACGTGTTCGT CACGGGCGCC TTCGACATGT ACGACTACCT GAACATGGGC GTCGAGCAGC CGTAA
|
Protein sequence | MKKRLARPAV TAALAALTLG VSACGGGGGG QSTGGGSASS GDGFNAALEK VVNPSDKKGG TLKMAITEAW DSIDPGDTYY ALSWNLLRLY GRPLVTYKPA PGAEGRELVP DLAESLGTPS DGGKTWTYRL REGLKFEDGT PITSKDVKYA VLRSLDKKTF VNGPTYFNDW LDLPKDFVSV YETPDVNTDQ AIETPDDRTI VFHLKKPYAG FDNFAALPST VPVPKDKDTG VKYRQHPIAS GPYMFEKVEE GKQYTLVRNP HWDPNTDPIR KALPDRIEIS LGVEANDLDN RLISGDVHVD LAGTGVQAAA LGTVLGDPAL KARADNPTSA RTWFISILDT PPLDNVECRK AIIYAADRVG LQNAYGGELQ GEIATGLMPP SIDGWQKLDL YPTGTGDLEK AKAALQACGH PDGFETVMVY RSDRPKEQAA AESLQQALAR VGIKLTLKGY PTSDYFASYA GKPEFTRKNN VGLAAHGWAA DWNDGFGFLS QIVDSRTIRE SGNYNLSVKS KEIDELIDKA MAEPDKAKRD AIWGEIDRKV MEGAYVLPAV WAKALLLRGK GVTNVFVTGA FDMYDYLNMG VEQP
|
| |