Gene Hore_04820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04820 
Symbol 
ID7314461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp518050 
End bp519348 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content42% 
IMG OID643610905 
ProductBeta-glucosidase 
Protein accessionYP_002508235 
Protein GI220931327 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0266886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTAA AACTACCACA GGATTTACTG CTCGGGGCAG CAACTTCAGC CCTGCAGATT 
GAAGGAGGAG ATAAAAATAA TAACTGGTAT CAGTGGTGTG AAGAGGGACA CATTAAAGAT
GGGAGTCATT GCCTTAATGC CAACGATCAC TGGAATAGAT ACCGCGAAGA TATTGAACTT
ATAAAAAAAC TGGGTCTTGA AACCTATAGA ATGGGGCTGG AGTGGAGTCG AATAGAGCAT
CAGCCCGGTA AATTTAGTAA GGAAGGAATC GAACATTACC GTGATGAAAT TACCCTTCTC
CTTGAAAATG GGGTTGTTCC CCTGGTGACC CTTCATCATT TTTCTCACCC CCTGTGGCTT
GTTAATAAAG GGGGCTGGGG GAATAAAAAG GTAGTTGATT ATTTTAAGCG GTATACAGAG
TATGTAGTCG AAAATCTGGG GGATCTGGTG AGTGACTGGA TTACCATTAA TGAACCCAAT
GTCTTTCTCT ATAATGGATA TGTTGAAGGT ATCTGGCCTC CGGGAAAAAA CAATATTTTT
TCTATGTTCA GGGCCATGAA GAATATGATA AAAGCCCATA TAGTCTCCTA TAAGACTATT
CATCAGGTCA GGTCTAAACA TAATTTTGAG GGAGAAACAA GGGTTGGAGT TGCCAACCAT
GTCAGACTGT TTGACCCGGC TGGAAATAAA AAAATACATG GAATACCGGC CCGCCTCCTT
GATTACTTTT TTCACCGCCT GGTTATGGAA GGAATGGCCA GGGGAAAGTT TATGTTTCCC
ATCGGTACCG GGGGACACCC CCTGGGGGAG GGGAGGTATT ATGACTTTAT CGGGATTAAT
TATTATACCA GGGATATTAT TAAGTTTACC CTGAATCCGG CCTCCCTGTT TGCCAGGATG
GAAGTTAAGG AAGGAGCAGA TACCAGTGAC CTCGGCTGGG AAATATATCC TGTGGGCCTG
AAGAGGGTCT GTCGTAAATA TTATGAGGAA TATCAGGCCC CTGTATTTAT TACCGAAAAC
GGTATTTGTG ATAAGGGGGA TACCAAAAGA GGGCACTTTA TCTATGACCA TTTAAAAGAA
GTAGTAAAGC TGATTAATGA AGGTATTCCC GTTGAGAGGT ATTATTACTG GACTTTGATA
GATAACTTTG AATGGATTGA AGGTGAAAGT GCCCGGTTTG GCCTGATCCA TAATGATTTT
AAAACTCAGA AACGATCCAT CAGGATCAGT GGTTATTTTT ATGGGAAAAT ATGCAAGACA
AAAGAGATTA CCCCCGAAAT GGAGAGAATT TATCTTTAA
 
Protein sequence
MSLKLPQDLL LGAATSALQI EGGDKNNNWY QWCEEGHIKD GSHCLNANDH WNRYREDIEL 
IKKLGLETYR MGLEWSRIEH QPGKFSKEGI EHYRDEITLL LENGVVPLVT LHHFSHPLWL
VNKGGWGNKK VVDYFKRYTE YVVENLGDLV SDWITINEPN VFLYNGYVEG IWPPGKNNIF
SMFRAMKNMI KAHIVSYKTI HQVRSKHNFE GETRVGVANH VRLFDPAGNK KIHGIPARLL
DYFFHRLVME GMARGKFMFP IGTGGHPLGE GRYYDFIGIN YYTRDIIKFT LNPASLFARM
EVKEGADTSD LGWEIYPVGL KRVCRKYYEE YQAPVFITEN GICDKGDTKR GHFIYDHLKE
VVKLINEGIP VERYYYWTLI DNFEWIEGES ARFGLIHNDF KTQKRSIRIS GYFYGKICKT
KEITPEMERI YL