Gene Hore_20600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20600 
Symbol 
ID7314384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2228098 
End bp2229372 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content36% 
IMG OID643612504 
Productglycoside hydrolase family 1 
Protein accessionYP_002509800 
Protein GI220932892 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAAG ACATTCTAAG ACAAAGCAAG AAAGCATCAG ATTTTCGTTG GATAACAGGT 
ATTGAAGATA CCTTTGTAGG TCAGGTCAAA CCGGGGAAGA GGAGACTTGA TGAATACGAA
TTGACCCAGC ATTATATCTT CTGGAAAGAA GATATAGATT TAATTAAGGA ATCGGGTTTT
GAGATGACGA GATATGGTAT ACCGTGGTAT CGGGTAAACC CTGCTGATGG TGTTTTTGAT
TGGAGCTGGA CTGATCAGGT CCTGGACTAT CTGGTAAATG TCAATAATAT TAGCCCCATT
ATAGATTTAA TGCATTACGG GACACCACTA TGGTTAGATA ATGAATTTAT TAATCCTAAT
TACCCCAGTA AAGTAGCCAG TTATGCCAGG GAATTTGCTG CCCGGTATAA GGATATTATT
TATTATACTC CACTAAATGA ACCTTATATT AATGCTAAAT TTTGTGGGGA AACGGGTTTC
TGGCCCCCGT ATTTAAGTGG TAACAGTGGT TTTGTTCAGG TTATGAAGAA TTTATGTAAA
GGAATTATCT ACACAGTTAG AGAGATAAAG AAAGTAAATC CTGATTCCAA AATGATTCAT
GTAGAAGCAA CCGGTGATTA TTTAACAGAT GATAAGTCTC TGAAAGATCG GGTTAAATTT
GAAAAGGAAA GACATTTTTT AATGTTTGAT TTAATAACAG GTAGAGTTAA TAAAAATCAC
TATTTATATA ATTATTTGAA AGAAAACGGG TTTACTAACG AGGATTTTAA ATGGTTTCAA
GTAAATCGGA TAACCATTGA TATAATGGGA TTAAATTATT ATCCTGAATT ATCTGTTAAC
CGGGTATTTA AGGACAATGA CGATATAAAG ACCGGTTTAA TCTGGGGTGG GGCTAAAGGG
CTGGAAAAAG TATTAAAGGA ATATTATAAA AGGTATAAAA GACCGGTAGT CATAACTGAA
ACAAGTACTA ATGGTACAGT CAATGATAGA ATAAACTGGT TGAAAGATTC AGTAGGGCTG
GTCAGGGATT TAAGGAAAGA AGGGATTCCC TTGATCGGTT ATACCTGGTG GCCACTTTTT
GACCTGGTAA ACTGGGATTA TATGGAAGGT CATAAACCCG TTGAAGAATA CCTTGAAGAA
ATGGGTCTAT GGAGTCTTGA GATACAATTT AACGGGGTGT TAAAAAGGGT AAAAACCCCT
GTGGTTGAAG TGTTTAAACA AATTGCTAAA GGAGATAAAC AGGAGATTGT TGGAGATATT
GCTTTGAAAA AATAG
 
Protein sequence
MSQDILRQSK KASDFRWITG IEDTFVGQVK PGKRRLDEYE LTQHYIFWKE DIDLIKESGF 
EMTRYGIPWY RVNPADGVFD WSWTDQVLDY LVNVNNISPI IDLMHYGTPL WLDNEFINPN
YPSKVASYAR EFAARYKDII YYTPLNEPYI NAKFCGETGF WPPYLSGNSG FVQVMKNLCK
GIIYTVREIK KVNPDSKMIH VEATGDYLTD DKSLKDRVKF EKERHFLMFD LITGRVNKNH
YLYNYLKENG FTNEDFKWFQ VNRITIDIMG LNYYPELSVN RVFKDNDDIK TGLIWGGAKG
LEKVLKEYYK RYKRPVVITE TSTNGTVNDR INWLKDSVGL VRDLRKEGIP LIGYTWWPLF
DLVNWDYMEG HKPVEEYLEE MGLWSLEIQF NGVLKRVKTP VVEVFKQIAK GDKQEIVGDI
ALKK