Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_15280 |
Symbol | |
ID | 7313121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 1634267 |
End bp | 1635622 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643611970 |
Product | glycoside hydrolase family 1 |
Protein accession | YP_002509272 |
Protein GI | 220932364 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000000000225524 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAA TAATATTTCC AGAAGATTTT ATCTGGGGAG CAGCTACGTC ATCCTACCAG ATAGAAGGAG CTTTTAATGA GGATGGAAAG GGAGAATCCA TCTGGGACAG ATTTAGCCAT ACCCCGGGAA AAATTGAAAA CGGTGACACC GGTGATATAG CCTGTGATCA TTATCATCTG TACCGGGAAG ATATTGAATT AATGAAAGAG ATAGGGATTA GGTCATACCG TTTTTCTACT TCCTGGCCCC GGATTCTTCC GGAAGGTAAA GGCAGGGTAA ACCAGAAAGG TCTTGATTTT TATAAAAGAC TGGTTGACAA TCTTCTTAAG GCCAATATCA GACCCATGAT AACCCTATAC CACTGGGATT TACCCCAGGC ATTACAGGAT AAAGGTGGCT GGACCAACAG GGATACAGCC AAATATTTCG CTGAATATGC CAGGCTTATG TTTGAAGAGT TTAACGGCCT GGTGGACCTC TGGGTTACCC ATAATGAACC CTGGGTAGTT GCCTTTGAGG GTCATGCTTT TGGTAACCAT GCCCCCGGTA CTAAAGATTT TAAAACGGCC CTTCAGGTCG CCCATCACCT GTTATTATCC CATGGAATGG CTGTTGATAT CTTCAGGGAG GAAGACCTGC CCGGGGAGAT TGGTATTACT CTCAACTTAA CCCCTGCTTA CCCGGCCGGT GACAGTGAGA AGGATGTTAA GGCAGCTTCT TTACTTGATG ACTATATTAA TGCATGGTTT TTATCTCCAG TGTTCAAGGG CAGTTACCCG GAGGAATTAC ACCACATCTA TGAACAGAAT TTAGGGGCCT TTACAACCCA ACCGGGTGAT ATGGATATAA TAAGCAGGGA TATTGACTTC CTGGGCATTA ATTACTACTC CAGGATGGTG GTCAGGCATA AACCGGGAGA TAATTTGTTT AATGCTGAAG TTGTAAAAAT GGAGGATAGG CCATCTACAG AGATGGGCTG GGAGATTTAT CCCCAGGGAC TTTATGATAT TTTAGTGAGG GTTAATAAAG AATATACCGA TAAGCCCCTT TACATAACAG AAAACGGGGC AGCTTTTGAT GACAAATTAA CAGAGGAAGG TAAGATCCAT GATGAGAAGA GGATTAACTA CCTGGGGGAT CATTTTAAGC AGGCATATAA AGCCCTTAAA GATGGAGTTC CCCTCAGAGG TTATTATGTG TGGTCATTGA TGGATAATTT TGAATGGGCC TATGGCTATA GCAAGCGCTT TGGTCTCATT TATGTTGATT ATGAAAATGG TAACAGACGC TTTTTAAAAG ATAGTGCCCT ATGGTATCGG GAGGTCATCG AAAAAGGCCA GGTTGAAGCT AACTAA
|
Protein sequence | MAKIIFPEDF IWGAATSSYQ IEGAFNEDGK GESIWDRFSH TPGKIENGDT GDIACDHYHL YREDIELMKE IGIRSYRFST SWPRILPEGK GRVNQKGLDF YKRLVDNLLK ANIRPMITLY HWDLPQALQD KGGWTNRDTA KYFAEYARLM FEEFNGLVDL WVTHNEPWVV AFEGHAFGNH APGTKDFKTA LQVAHHLLLS HGMAVDIFRE EDLPGEIGIT LNLTPAYPAG DSEKDVKAAS LLDDYINAWF LSPVFKGSYP EELHHIYEQN LGAFTTQPGD MDIISRDIDF LGINYYSRMV VRHKPGDNLF NAEVVKMEDR PSTEMGWEIY PQGLYDILVR VNKEYTDKPL YITENGAAFD DKLTEEGKIH DEKRINYLGD HFKQAYKALK DGVPLRGYYV WSLMDNFEWA YGYSKRFGLI YVDYENGNRR FLKDSALWYR EVIEKGQVEA N
|
| |