Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbis_0842 |
Symbol | |
ID | 9167327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobispora bispora DSM 43833 |
Kingdom | Bacteria |
Replicon accession | NC_014165 |
Strand | - |
Start bp | 956229 |
End bp | 957527 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003651459 |
Protein GI | 296268827 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.516605 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCACTC GCCACGTGGG ACGCCGGATC GGCCGGCTCG CCGCCCCCAT GCTCGCCGCC ACCCTCATCG CCACCGCGGC GGCCTGCGGC GGCGACGGCT CCACCCAGAC CGGCTCCGGC GGCCAGGAGA AGATCAAGCT GACGGTGGGC CTGTTCGGCG ACTTCGGCTT CGAGCCGCTG TACGAGGAGT TCAAGAAGAC CCACCCGAAC ATCGAGATCG AGGAGCGCCA GGCGTCCTTC GCCGACCACC ACACCAACCT CGCCGCCCAC CTCGCCACCG GTGCGGGCGC CGCGGACGTC GAGGCCATCG AGGTCGGCTA CATCAGCCAG TTCACCGCCC AGCCGGACAA GTTCCACGAC CTGCGCGAGT ACGGGCTCGA CAAGCGGCAG GGGGAGTACC TCCCCTGGAA GTGGCAGCAG GGCGTCGCCC CGAACGGCGC GCTCATCGGC CTCGGCACCG ACGTCGGCGG CCTGGCCATG TGCTACCGGA CCGACCTGTT CGAGAAGGCC GGCCTCCCCA CGGACCGGGA CGAGGTCTCC GCGCTCTGGC CCACGTGGGA GGACTTCATC GAGACCGGCA AGAAGTTCAT GAAGTCCGCG CCCAAGGGCA CGGCGTTCAT CGACAGCCCG GGCGAGATCC TCCGCGCGAT CATCGCGCAG GCCCCGGTCG GCGTGTACGA CCAGAACGAC AACATCGTCG TGGCGACCAA CCCGGACGTG AAGCGCGCCT GGGACCTGTC GGTCCAGATG ATCCAGGCCG GCCTCTCCGC GAAGATCGCC GCGTTCACCC CCGAGTGGAA CACCGGCTTC AGCAAGGGCA CCTTCGCCAC CGTGGTCTGC CCCGCCTGGA TGACCGCGTA CATCCAGGAC CAGGCCAAGA ACGCGGCGGG CAAGTGGGAC ATCGCCGCGA TCCCCGGCGG CGCGGGCAAC TCCGGCGGCA GCCACCTGAC CGTGCCCAAG CAGAGCAAGC ACCCCAAGGA GGCGGCCGAG CTGGTCGACT TCCTCACCTC CGCGGAGAGC CAGGCCAAGG TCTTCAAGAC CACCGGCAAC TTCCCGTCCA TCCCGTCGCT CTACGACCAG CCGGACATCC AGAACTTCAC CAAGGACTTC TTCAACGGCG CCCCGGTCGG GAAGATCTAC TCCGAGGCCG CGAAGAAGCT CCAGCCGCAG CACCTCGGCC CGCGTGAGGG CGACGTGCGC ACCGCGATCG GCAACGGCCT CGGCCGCGTC GAGCAGGGCA AGCAGACGCC CGAGGAGGCC TGGGCCCAGG TCCTCAAGGA CGTCGAGAAG ATCAAGTAA
|
Protein sequence | MGTRHVGRRI GRLAAPMLAA TLIATAAACG GDGSTQTGSG GQEKIKLTVG LFGDFGFEPL YEEFKKTHPN IEIEERQASF ADHHTNLAAH LATGAGAADV EAIEVGYISQ FTAQPDKFHD LREYGLDKRQ GEYLPWKWQQ GVAPNGALIG LGTDVGGLAM CYRTDLFEKA GLPTDRDEVS ALWPTWEDFI ETGKKFMKSA PKGTAFIDSP GEILRAIIAQ APVGVYDQND NIVVATNPDV KRAWDLSVQM IQAGLSAKIA AFTPEWNTGF SKGTFATVVC PAWMTAYIQD QAKNAAGKWD IAAIPGGAGN SGGSHLTVPK QSKHPKEAAE LVDFLTSAES QAKVFKTTGN FPSIPSLYDQ PDIQNFTKDF FNGAPVGKIY SEAAKKLQPQ HLGPREGDVR TAIGNGLGRV EQGKQTPEEA WAQVLKDVEK IK
|
| |