Gene Hore_04180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04180 
Symbol 
ID7314093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp440197 
End bp442053 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content43% 
IMG OID643610841 
Productbeta-N-acetylhexosaminidase 
Protein accessionYP_002508171 
Protein GI220931263 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones68 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGAC TAGCCGGCTT TAGTCTTCTG TACATGGTGC TGTTGATTTG TTTTATAGGG 
GGAACTGTTT TTGCGTCTAC TGAATTAGTC GAAAATGAAG AGTTTAAGAT AATACCAGAT
TATATGAACC CCTTTTTATC TATTGAGGAA AAGGTTGACA AACTTCTATC TGTGATGACC
CTGGAAGAAA AAATAGGACA GATGACCCAG GCCGAACGAA GATATATAAC ACCGGATGAA
GTTTATCAGT ACAAAATTGG TTCTATTTTG AGTGGTGGGG GTTCGACACC ATTTTCTAAT
ACACCTGAAG CCTGGGCTAA CATGTATGAC AGATTTCAGA AGTGGGCCAT GAAGACCAGG
CTAAAGATAC CAATAATCTA TGGTGTAGAC GCAGTCCATG GACATAATAA CCTCAGGGGG
GCGACCATCT TTCCCCATAA CATTGGCCTT GGGGCCACCC GGGATCCTGA ACTGGTGGAA
AAGGTAGGTA GAATTACTGC CAAAGAGGTT TCAGCCACTG GACCTGACTG GAATTTTGGT
CCCTGTGTGG CAGTGGCCCG GGATGAGAGA TGGGGCAGAA CCTACGAAAG TTTCGGAGAG
CATCCAGAAT TACAAAAATT ACTGGCCGGG GCCTATGTCA GGGGGTTACA GGGTCCAGAG
GCAGAGATGG ATGGAGAATA TGTGGTGGCC TGTGCCAAGC ATTATGTTGG TGATGGTGGA
ACTGAATGGG GAAGTGGTGA TGGAGGATAT TTAATTGACC GTGGCGACGT TACTGTTGAT
GAAAAAACCT TACGTGAAAT CCACCTTCCA GGTTATATTG AAGCTATTGA AGAAGGTGTC
GGTACCATTA TGGTATCATT TAACAGCTAT CAGGGAGTAA AAATGCATGC CCATAAATAC
CTGATTACTG ATGTCTTAAA AGGTGAGCTG GGTTTTGACG GATTTGTTGT TTCGGACTGG
AACGGAATCA ATGAGATATC AGGCTACAGT TATTATGAAA AAGTAGTTAA GTCAGTTAAT
GCCGGAATTG ATATGTTTAT GGTGCCTGAT AGCTGGAAGA AATTTATTTA TAACCTTAAG
CAGGCTGTAG AAAATGGAGA TGTAAGTGAA GAGAGGATTA ATGATGCGGT ACGGAGAATC
TTAACCGTCA AATTCAAAGC AGGTTTATTT GAAAAACCCT TTACTGATCG TAGCCATATC
TCCCTGATTG GCTCAGAAGA ACACCGTGAG GTAGCCCGGG AAGCAGTTCG AAAATCCCTG
GTTCTATTGA AAAATGAAAA TGTTCTACCC CTGGATAAGG ATAGTAAAAT TTATGTAGGT
GGTTCCAATG CCGAAGACAT TGGGAGTCAG TGTGGGGGCT GGACTATAAC CTGGCAGGGA
CGTTCCGGTG ATATTACTGA AGGGACCACA GTTCTGGAAG GTATTGAAGC AGCTATTGCT
GGCCGGGGTC AGGTTGTAAA TGATTTAAAT CAAGCTGATG TAGCGGTAAT AGTAGTAGGA
GAAGACCCTT ATGCTGAAGG CCGGGGGGAT AATGGAAGGC TGGAATTGAA ACAGGAAGAT
ATCAGCCTGC TAGAAAAGGT CACCGGGGCC GGAAAACCGG TTGTAGTAGT TATGATTTCC
GGTAGACCTT TGATTATAAG TGATTATATC GATGACTGGG ATGCTTTTGT AATGGCCTGG
TTACCTGGCA CAGAAGGTCA GGGTATAGCT GATGTGTTAT TCGGTGATTA TAATTTTACT
GGTAGATTAC CTGTTTCCTG GCCAGAAGAT GTTTCTCAGT TACCCATAAA TTATGGGGAT
GATGATTATG ACCCCTTATT CGAATATGGT ACTGGCCTTA AAATGGACCT TGAGTAA
 
Protein sequence
MKRLAGFSLL YMVLLICFIG GTVFASTELV ENEEFKIIPD YMNPFLSIEE KVDKLLSVMT 
LEEKIGQMTQ AERRYITPDE VYQYKIGSIL SGGGSTPFSN TPEAWANMYD RFQKWAMKTR
LKIPIIYGVD AVHGHNNLRG ATIFPHNIGL GATRDPELVE KVGRITAKEV SATGPDWNFG
PCVAVARDER WGRTYESFGE HPELQKLLAG AYVRGLQGPE AEMDGEYVVA CAKHYVGDGG
TEWGSGDGGY LIDRGDVTVD EKTLREIHLP GYIEAIEEGV GTIMVSFNSY QGVKMHAHKY
LITDVLKGEL GFDGFVVSDW NGINEISGYS YYEKVVKSVN AGIDMFMVPD SWKKFIYNLK
QAVENGDVSE ERINDAVRRI LTVKFKAGLF EKPFTDRSHI SLIGSEEHRE VAREAVRKSL
VLLKNENVLP LDKDSKIYVG GSNAEDIGSQ CGGWTITWQG RSGDITEGTT VLEGIEAAIA
GRGQVVNDLN QADVAVIVVG EDPYAEGRGD NGRLELKQED ISLLEKVTGA GKPVVVVMIS
GRPLIISDYI DDWDAFVMAW LPGTEGQGIA DVLFGDYNFT GRLPVSWPED VSQLPINYGD
DDYDPLFEYG TGLKMDLE