Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbis_1891 |
Symbol | |
ID | 9168385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobispora bispora DSM 43833 |
Kingdom | Bacteria |
Replicon accession | NC_014165 |
Strand | + |
Start bp | 2187836 |
End bp | 2189179 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003652496 |
Protein GI | 296269864 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.276259 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACA AGTCGCGCCG CGGCAGAAGG CTCGCTGTCG CGGTGGCGGC GACGCTGGCC CTGGGCGTCA CCGCCGCCTG CGGCTCCGAC AACAGCAGCT CGCGCTCGTC CTCGGACGAT TCGTCCAAGG GTCCGGTCAC CATCACCGTG CAGACCTTCG GCGGTGGCAA GAACTTCGGG TTGCAGGAAG CGATCGAGAA GTGGAACGCC ACCCACACCG ATATCAAGAT CAAGCACGAG AACCTGACCG ACCAGTTCGA GCAGATCTAC TGGCCGCAGC TCATCCAGCA GCTCCAGTCC GGCGCCGGCG CGGGTGACGT CGTCGGCATC GAGGAGGGCG GCATGGGCCT CGCCAAGGCG CACCCCGAGT GGTGGGTCGA CCTCGGCGAG TACGGCCTGA CCAACCGCCA GTCGGACTTC CCCGCCTGGA AGTGGGAGAA CGGCATCACC ACCGACGGGA AGCTGTTCGC CCTCGGCACC GACATCGGCG GCATGAGCCT GTGCTACCGC CGCGACCTCT TCGAGAAGGC CGGGCTGCCG TCCGACCGCG AGGAGGTCAG CAAGCTCTGG CCCACCTGGG ACGACTTCAT CGAGGTCGGC AAGAAGTTCC AGGAGAAGAT CAAGGACACC AAGTTCGTCG ACGGCACCAA CACGCTCTTC AACGTCGTGC TGGTGCAGGA GGCGGCGAAG CACGGCAACG TCACCTACTT CGACCGCAAC AACAACCTGA TCATAGACCA GAACCCGGCG GTGAAGACCG CGTTCGACTT CGCCGTGAGG ATCAGCAAGG AGGGCCTCAC CGCGAAGCTG CGCAACTTCA CCGACGAGTG GTTCACCGGG ATGAAGAAGT CGCAGTTCGC CACGCTCGGC TGCCCGTCCT GGATGCTCGG CGTGGTGAGC GGTGACAACG CGGCCGGCCC CGAGCTGAAG GGCAAGTGGG ACGTGGCCGC GGTGCCCGGC GGCGGCGGCA ACTGGGGCGG CTCGTGGCTC GCCGTGCCGA AGCAGACCAA GCACCCGGCC GAGGCCGCCA AGGTGGTGGA CTACCTCACC AGCAAGGAAG CCCAGGTGAG CGTGTTCACC TACGCGGGGA ACATGCCGAG CAACATCCAG GCTCAGCAGG ACCCGGCGGT CCAGAACGCC ACGAACGAGT ACTTCAACAA CGCCCCGATC GGCGAGATCT TCGCCAAGTC GGCCTCTTCC CTGCAGCCCG TCTTCCTCGG GATCAAGCAC GCGCAGGTGA AGAACGCCGT CGAGTCGGTC ATCTCGGCCA TCGACGACGG CTCGGTGCCG TACGACCAGG CCTGGGAGAA GATCGTCGAC GACGCCAAGA AGGCCGCGAG CTGA
|
Protein sequence | MIDKSRRGRR LAVAVAATLA LGVTAACGSD NSSSRSSSDD SSKGPVTITV QTFGGGKNFG LQEAIEKWNA THTDIKIKHE NLTDQFEQIY WPQLIQQLQS GAGAGDVVGI EEGGMGLAKA HPEWWVDLGE YGLTNRQSDF PAWKWENGIT TDGKLFALGT DIGGMSLCYR RDLFEKAGLP SDREEVSKLW PTWDDFIEVG KKFQEKIKDT KFVDGTNTLF NVVLVQEAAK HGNVTYFDRN NNLIIDQNPA VKTAFDFAVR ISKEGLTAKL RNFTDEWFTG MKKSQFATLG CPSWMLGVVS GDNAAGPELK GKWDVAAVPG GGGNWGGSWL AVPKQTKHPA EAAKVVDYLT SKEAQVSVFT YAGNMPSNIQ AQQDPAVQNA TNEYFNNAPI GEIFAKSASS LQPVFLGIKH AQVKNAVESV ISAIDDGSVP YDQAWEKIVD DAKKAAS
|
| |