Gene Hore_20590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_20590 
Symbol 
ID7314383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2225388 
End bp2228066 
Gene Length2679 bp 
Protein Length892 aa 
Translation table11 
GC content36% 
IMG OID643612503 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_002509799 
Protein GI220932891 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATGTA TTATTAAAGA ATTAAATGGA CTATGGAACT TTACGACAGA CCCAGAAAAA 
ATTGGAGAAC AGAATGAATG GTATATTAAC GGTTTTGAAG GTGAGTCCGT TAAGGTCCCC
GGGGCCTGGC AAACTTACAA TAAAGAGATG GTTACTTATA CAGGCTATGC CTGGTACTCA
AGGGAGTTTA ATATAAATGA AGCCATCCAC GAATTTAAGA GATTTTTCTT AAAGTTTGAA
GCGGTAGATT ATGTGTCTGA TGTATGGATA AATGGTAAAT ACCTGGGTTC CCACGAGGGA
GGGTATACCC CTTTCGATTT TGAAGTTACC GAGCACCTTA AAGAAGGGGA TAATTTAATT
GTGGTCAGGG TTTTTGATCC TGATGATAAT GATGAGATTC CCCATGGCAA ACAGGGAAGC
TGGTATACAA GGGTAAGTGG TATCTGGCAG GATGTATTAT TAGTTGGATA TGACAAAACC
TTTATTGAAA ATGTTCTTAT CACCCCTGAT ATTGATAATA AACGGGCTAT AACCCAAGTT
AAAATAAAAG ATAGAGATAG GCTAGTCAAC CCCAGAATAG AATTTAAAAT AGACCCCAGG
GTAAATGGTA ATAAAGTAGA GGGAAGTAAA ACACATACTT ATAAGTATTT CCTGGATGAA
CAGGAGTTAT ATGAACTGGA AATTTTAGAT TTATATTTAT GGGGTCCAGA ATCACCGGCC
TTATACGACA TGACAGTTAT ATTAAAGGAT GGCAATAGTA TAATTGACAG TTATCAAACT
TACTTTGGAA TGAGGAAAAT AGAATATAAA AATGGGATGG TATATTTAAA TCATAAACCA
TTATATATAA GAGGGGCCCT GGACCAGGCC TTCTGGCCTA AAACCATCTA CCGACCGGAA
AGTGAAGAGT TGATAAAAGA GGAGATATTA AAGGCTAAAG AGATGGGTTT TAACCTTTTA
AGAAAACATA TAAAGACAGA AGACCCCAGA TATCTTTACT GGGCCGATTA TCTTGGAATG
TTAATTTGGG AAGAACCGGC TAACTATGCC AGCTGGACCC CGCAGGCAAG GAAGAGGTTT
AAAAAGGAAT TTACCAGAAT GGTTAAAAGA GACTATAACC ATCCTTCAAT TATTGCCTGG
AGTATATACA ATGAAGAGTG GGGACTGGAA TGGAAACTAA AAGAAAATAA GGATATGCAA
AAGTGGGTTG AAGGTTTTTA TGAATATGCC AGGGAGCTGG ATCCTACCCG GTTAATCTGT
GACAATTCAG GATGGGCCCA TGTCAAAACA GATATAAATG ATTACCACCG ATATTTTGCT
GTACCGGAAA ATCATAAAGA ATGGCAGGAA GATCTGGATA ATTATATTAT AAAAAAACCT
GGAGCAAATT ATGTTGACGG GTATAAATAT AACGGAGAAC CTTTAATTGT TTCCGAGTTT
GGTATGTGGG GCTTACCGGA AATAAGTAAA ATCGAAGAGG CATATGAGGA GTTGCCTGAA
TGGTACCGGG GTAATTCAAA ATTATTTTCG GAGGACTTTA AAATACCTGC TACCCTGAAA
GAAAATTATA AAAAATATAA TCTTAATAAA ATATTCAAAA GTTATGATGA GCTATGTTAT
TTAACCCAGG AGAGGCAATT TAGAGGGGTA AAGAGTATTA TAGAGGAGAT GAGAAAAAGA
AGTGAGATTG CAGGATATGT AGTAACAGAA CTGACAGATA TTGAATGGGA AACTAACGGA
TTTCTGGATT ACTTTAGAAA TCCTAAATTT AGAAATAATT GTATTAACCA TTTTAATGGC
CAGGTGATTC TTGCTATTAA TATAAATAAA CATAACTTCT GGTCAGGTGA AGAATGCTCC
TTTACTCCCA TAGTTATAAA CAATAGTGAT AAAAAGATTC AGGGAGTTTT TAGATGGTAT
TTGGAAGAAA GCGAATTGAA GGGGATCTTT CCTGTTAAGA TACCGGCTTA TTCAAACCAG
CGTCTTGATG AAATTAAATT TAACTTCCCT GGTGACTGGA CAGGATCCAG AGGGGTTAAA
TTGAGGGTTG AGCTGGAAGA GGAAAAAGAG GTAGTTACTT CAAATTATGA AGAATTAACG
GTAACCAATC GAAGAGAAAT AAAGAAAACA GGGAAGTCAT TACAGGTAAA GGGTTTATCT
CATGAATTTA AAAATAAACT TAAGAACAAT GGATTTGAAT TGAAAACTAA TAATTCCCTT
GTACTTACTG ATAATTTGAC TGAAGAAGTA CTAAAAGAAG TGCGTAATGG GACCCGGGTA
GTTTTTCTGG CTGAAAATGG CAACAGGATT CAGGATAAGG GTTATATTAA TTTTACCAAA
TTGCCTGGTG GCGAGAGCTG GGATAGGGCT GCTACCTTTA ATTTCATAAA TACAGATATT
TTTGATGGTA TTCCCCTTTT AAAAATTTCG GGTTGGGAAC TTGAAGATAT CTATCCAGAT
TACAAGGTCA AAAACCTGGT TGATTTAAAT TGTACAGAAA TAATAAGTGG TAATTTTGCC
GGATGGTTAG GTGACTTTGG GGCAACTACC TTTGTTATGA ACTGGGGCAG GGGTCAGGTA
TTAGTTACAA CATTAAAATT AATTTCTAAT TATCATACTC ACCCTATTGC CAGTCTTTTA
TTAAATAAAC TTATAAATTA TTTTCAAGAA CACAAATAA
 
Protein sequence
MGCIIKELNG LWNFTTDPEK IGEQNEWYIN GFEGESVKVP GAWQTYNKEM VTYTGYAWYS 
REFNINEAIH EFKRFFLKFE AVDYVSDVWI NGKYLGSHEG GYTPFDFEVT EHLKEGDNLI
VVRVFDPDDN DEIPHGKQGS WYTRVSGIWQ DVLLVGYDKT FIENVLITPD IDNKRAITQV
KIKDRDRLVN PRIEFKIDPR VNGNKVEGSK THTYKYFLDE QELYELEILD LYLWGPESPA
LYDMTVILKD GNSIIDSYQT YFGMRKIEYK NGMVYLNHKP LYIRGALDQA FWPKTIYRPE
SEELIKEEIL KAKEMGFNLL RKHIKTEDPR YLYWADYLGM LIWEEPANYA SWTPQARKRF
KKEFTRMVKR DYNHPSIIAW SIYNEEWGLE WKLKENKDMQ KWVEGFYEYA RELDPTRLIC
DNSGWAHVKT DINDYHRYFA VPENHKEWQE DLDNYIIKKP GANYVDGYKY NGEPLIVSEF
GMWGLPEISK IEEAYEELPE WYRGNSKLFS EDFKIPATLK ENYKKYNLNK IFKSYDELCY
LTQERQFRGV KSIIEEMRKR SEIAGYVVTE LTDIEWETNG FLDYFRNPKF RNNCINHFNG
QVILAININK HNFWSGEECS FTPIVINNSD KKIQGVFRWY LEESELKGIF PVKIPAYSNQ
RLDEIKFNFP GDWTGSRGVK LRVELEEEKE VVTSNYEELT VTNRREIKKT GKSLQVKGLS
HEFKNKLKNN GFELKTNNSL VLTDNLTEEV LKEVRNGTRV VFLAENGNRI QDKGYINFTK
LPGGESWDRA ATFNFINTDI FDGIPLLKIS GWELEDIYPD YKVKNLVDLN CTEIISGNFA
GWLGDFGATT FVMNWGRGQV LVTTLKLISN YHTHPIASLL LNKLINYFQE HK