Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_20600 |
Symbol | |
ID | 7314384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2228098 |
End bp | 2229372 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643612504 |
Product | glycoside hydrolase family 1 |
Protein accession | YP_002509800 |
Protein GI | 220932892 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 56 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAAG ACATTCTAAG ACAAAGCAAG AAAGCATCAG ATTTTCGTTG GATAACAGGT ATTGAAGATA CCTTTGTAGG TCAGGTCAAA CCGGGGAAGA GGAGACTTGA TGAATACGAA TTGACCCAGC ATTATATCTT CTGGAAAGAA GATATAGATT TAATTAAGGA ATCGGGTTTT GAGATGACGA GATATGGTAT ACCGTGGTAT CGGGTAAACC CTGCTGATGG TGTTTTTGAT TGGAGCTGGA CTGATCAGGT CCTGGACTAT CTGGTAAATG TCAATAATAT TAGCCCCATT ATAGATTTAA TGCATTACGG GACACCACTA TGGTTAGATA ATGAATTTAT TAATCCTAAT TACCCCAGTA AAGTAGCCAG TTATGCCAGG GAATTTGCTG CCCGGTATAA GGATATTATT TATTATACTC CACTAAATGA ACCTTATATT AATGCTAAAT TTTGTGGGGA AACGGGTTTC TGGCCCCCGT ATTTAAGTGG TAACAGTGGT TTTGTTCAGG TTATGAAGAA TTTATGTAAA GGAATTATCT ACACAGTTAG AGAGATAAAG AAAGTAAATC CTGATTCCAA AATGATTCAT GTAGAAGCAA CCGGTGATTA TTTAACAGAT GATAAGTCTC TGAAAGATCG GGTTAAATTT GAAAAGGAAA GACATTTTTT AATGTTTGAT TTAATAACAG GTAGAGTTAA TAAAAATCAC TATTTATATA ATTATTTGAA AGAAAACGGG TTTACTAACG AGGATTTTAA ATGGTTTCAA GTAAATCGGA TAACCATTGA TATAATGGGA TTAAATTATT ATCCTGAATT ATCTGTTAAC CGGGTATTTA AGGACAATGA CGATATAAAG ACCGGTTTAA TCTGGGGTGG GGCTAAAGGG CTGGAAAAAG TATTAAAGGA ATATTATAAA AGGTATAAAA GACCGGTAGT CATAACTGAA ACAAGTACTA ATGGTACAGT CAATGATAGA ATAAACTGGT TGAAAGATTC AGTAGGGCTG GTCAGGGATT TAAGGAAAGA AGGGATTCCC TTGATCGGTT ATACCTGGTG GCCACTTTTT GACCTGGTAA ACTGGGATTA TATGGAAGGT CATAAACCCG TTGAAGAATA CCTTGAAGAA ATGGGTCTAT GGAGTCTTGA GATACAATTT AACGGGGTGT TAAAAAGGGT AAAAACCCCT GTGGTTGAAG TGTTTAAACA AATTGCTAAA GGAGATAAAC AGGAGATTGT TGGAGATATT GCTTTGAAAA AATAG
|
Protein sequence | MSQDILRQSK KASDFRWITG IEDTFVGQVK PGKRRLDEYE LTQHYIFWKE DIDLIKESGF EMTRYGIPWY RVNPADGVFD WSWTDQVLDY LVNVNNISPI IDLMHYGTPL WLDNEFINPN YPSKVASYAR EFAARYKDII YYTPLNEPYI NAKFCGETGF WPPYLSGNSG FVQVMKNLCK GIIYTVREIK KVNPDSKMIH VEATGDYLTD DKSLKDRVKF EKERHFLMFD LITGRVNKNH YLYNYLKENG FTNEDFKWFQ VNRITIDIMG LNYYPELSVN RVFKDNDDIK TGLIWGGAKG LEKVLKEYYK RYKRPVVITE TSTNGTVNDR INWLKDSVGL VRDLRKEGIP LIGYTWWPLF DLVNWDYMEG HKPVEEYLEE MGLWSLEIQF NGVLKRVKTP VVEVFKQIAK GDKQEIVGDI ALKK
|
| |