Gene Hore_23190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_23190 
Symbol 
ID7314202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2534773 
End bp2536878 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content46% 
IMG OID643612771 
Productalpha-glucosidase 
Protein accessionYP_002510059 
Protein GI220933151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.243429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGAA GGAGTATGAT CGGAAGGAGT ATTACTATAA CAGGTTTTAT CATAATAACA 
GTAATTGCTG TTTTTGTACT GGCTACAGGA GTATTTGCTA AAGGTCATGA TCAGGGAGAT
GAATCTGTGG TTACATCTCC TGATGGGAGG ATAAAGGTCA GGTTCATCCT TGATAAAGGA
GTACCCCACT ATTCGGTATC CTACGGAGAC ACTGTCCTCA TCAGGCCATC TTCACTGGGG
TTCCATTTCA AAGAGAAAAA ACCCCTTGAT GATAACTTTA AGATAATAGA TGTCAGGAGG
AATGTCTTTT ATAATACCTG GCGTCCGGTC TGGGGGCAGA CGTCAAAGAT AACCAATTAC
TACAATGAGC TGGTGATATA TCTCAAAGAA GAGGTGCCCC CCTACCGGAA GATGAATCTT
GTTTTCCGGG TCTATAATGA TGGGGTCGGC TTCCGGTATA TTATCCCCGG GCAGGAGTCC
CTGGAGACAA TAAATATTAT GTCAGAGGAT ACTGAGTTCC GTCTATCCAG TAATAATACT
ACATGGTGGA TCCACAATGA CTGGGACAGT TACGAGTATC AGTATCTGGA GACACCGTTA
AACCATGTAA TGTCTGCCAG TACTCCGGTT ACCATGAAAA CCCCCGGGGG AATTTACCTA
AGTATTCATG AAGCTGCCCT GGTCGACTAT GCCGGAATGG CCCTCAAGCG TGATCTTGAC
CGGGATTACA CCCTGGTAAG TGAGCTCTGT CCCTGGCCCG ATGGTGTCAA AGTTAAGGGA
CGGACACCCC TGAAAACACC GTGGCGGACC ATTCAGATAG GAGCAGCCCC CGGTGACCTG
TTAGAATCTA ATCTAATTTT GAACTTAAAT GACCCCTGTG CCCTTGAGGA TACTTCCTGG
ATTCAACCCA TGAAGTACGT GGGAATCTGG TGGGAGATGC ATATCGGGAA GTCAACCTGG
GAAGCAGGGC CGAGACACGG AGCTACCACG GAGCGGGCCA AATACTATAT TGATTTTGCC
GGTAAACACG GTATCGGTGG AGTACTGGTA GAGGGATGGA ATCTGGGCTG GGGTGGAACC
TGGGATGATC AGGACTATAC TACCCCTTAC CCCGACTTTG ACCTTGTAGA GGTTGCTAAA
TATGCTGAAG AACGTGGGGT TGAATATATT GCCCATAACG AAACAGGTGG CAATGTCATC
AATTATATTA ACCAGATAGA GGAGGCATAT AGTCTCTATA ATAGCCTTGG TATACATGCC
ATCAAGACCG GATATGTTGC TGATAATGGC ATGATTAAAC CCAGGGGTCA GCATCACCAC
GGCCAGTGGA TGGTTAACCA CTATCTGGAT GTAATTAAAA AGGCGGCAGA GTATGAAATT
ATGATTGATG CCCATGAACC GATTAAGCCG ACCGGTCTTT ACCGAACTTA CCCTAACTTT
ATAACCCGGG AGGGGGTTCA GGGCATGGAA TATAATGCCT GGAGTGCTGG AAATAAACCT
GAACATACTA CCATAATTCC CTTTACCAGA ATGCTGGCCG GGCCCATTGA TTATACCCCG
GGTATATTTG ATATAACCTT CGATGAATAC AGGTTTTTAA ACCGGGTTCA TACCACAAGG
GCCAAACAGC TGGCACTCTA TGTTGTTATC TTCAGTCCCC TCCAGATGGT AGCTGATCTC
CCGGAAAATT ATCTTGATGA TAATGGCAAT CCCCTGCCTG AATTTAAATT TATTCAGGAT
GTACCGGTTA CCTGGGACGA AACCCTGGTG CTTAATGCCA GGATAGGTGA TTATGTCACC
ATTGCCCGGC GCCGGGGTCA GGAATGGTAT GTAGGGAGTA TTACCGATGA AAAGCCGAGA
AGACTCATGG TTCCCCTGGC TTTCCTTGAG GATGGGCAAA AATATGTAGC TGAAATTTAT
GAGGATGGTC CGGAGGCTGA TTTAAAACAT AATCCGACCC AGGTGGCCAT CAGAAGGGTT
ATTGTTGACT CCAATGATAC CCTGGTTGCC GATATGGTAG AAAGCGGGGG CCAGGCCATC
AGACTTTATC CAGCCAGGAA TGAAGATGTT AATAAACTGC CGGAATTTAA TCAAAAGAAG
AACTAA
 
Protein sequence
MTGRSMIGRS ITITGFIIIT VIAVFVLATG VFAKGHDQGD ESVVTSPDGR IKVRFILDKG 
VPHYSVSYGD TVLIRPSSLG FHFKEKKPLD DNFKIIDVRR NVFYNTWRPV WGQTSKITNY
YNELVIYLKE EVPPYRKMNL VFRVYNDGVG FRYIIPGQES LETINIMSED TEFRLSSNNT
TWWIHNDWDS YEYQYLETPL NHVMSASTPV TMKTPGGIYL SIHEAALVDY AGMALKRDLD
RDYTLVSELC PWPDGVKVKG RTPLKTPWRT IQIGAAPGDL LESNLILNLN DPCALEDTSW
IQPMKYVGIW WEMHIGKSTW EAGPRHGATT ERAKYYIDFA GKHGIGGVLV EGWNLGWGGT
WDDQDYTTPY PDFDLVEVAK YAEERGVEYI AHNETGGNVI NYINQIEEAY SLYNSLGIHA
IKTGYVADNG MIKPRGQHHH GQWMVNHYLD VIKKAAEYEI MIDAHEPIKP TGLYRTYPNF
ITREGVQGME YNAWSAGNKP EHTTIIPFTR MLAGPIDYTP GIFDITFDEY RFLNRVHTTR
AKQLALYVVI FSPLQMVADL PENYLDDNGN PLPEFKFIQD VPVTWDETLV LNARIGDYVT
IARRRGQEWY VGSITDEKPR RLMVPLAFLE DGQKYVAEIY EDGPEADLKH NPTQVAIRRV
IVDSNDTLVA DMVESGGQAI RLYPARNEDV NKLPEFNQKK N