Gene Hore_03670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_03670 
Symbol 
ID7314042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp377424 
End bp380498 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content37% 
IMG OID643610793 
Productglycoside hydrolase family 31 
Protein accessionYP_002508123 
Protein GI220931215 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.510709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCTGG ACAACTATAA TGTTGTTTTC AATATTCTTG AAGGTGAAGT GTTGCATATT 
CAAATTCAGG ATAAAAATAA TAAATCACCC TTTTTAAGGG TAACAGGGGG AGAGAAAGTT
CTTAAATCAA TCGATTATCA TCAGGTTGAT AGTCGGGAGA TAATCCCTGT TTCAGAAAGG
TATAATCTTG AAATAAATAA AGACAGAGAA AGTTTTAAAA TATTTGATAA AACCCGTCAG
AAGATTGTAT TAGAGAGTTA TGACAGCTTT TTCCGCAAGG TAGATGAGAA GCAAAAAGGC
CTTTCTCTGT CCATTAAAGA AAATGAGCTT TTTACAGGGC TGGGGCAGGA TGTAGAGGGC
AAGTTATATA TAGAGGATAT TGAAAGAAGA TGCTGGAATG AGTGGAATGG GTACGATTAT
CTTGGATCCA ACTCGGTACC TTTTTATTTA TCCAATCAGG GTTATGGCCT TTATTTAGAT
ACAACCTATC CTTCCCGTTT TGTTTTCGGA AAAGGAGAGA TCCAGCCAAA ACCAATTGCC
CATGAAGTAA TGGCTGAAAC TCCTTTTGAC TGGGAAGAGA AGGCCTGTGT AAATGCTGAA
GATCAATTAA CTATTCTGGG CTGGGAAGAG GAACAGTTGG ATTTTTATAT CTTATTGGGA
GATATGGCTG AAATTGAACA AAAGTATTAC CAGCTGACAG GTAAACCCAG TCTTTTGCCC
AAATGGTCAT TGGGATATAT CCAGTGTAAA AATAGATATA AGAGTGAAGA AGAAATTTTA
CACATTGCCA GAAAAATGAG GGAAAAGGGA ATACCCTGTG ATGTTATTGT TATTGACTGG
CTCTGGTTTA AAGAATTTGG AGATTTATAC TGGGATGGGG AAAACTACTC AGAAAAGATG
GCTGAAACCA TCAAAAAACT TAAGGATATG GGTATTAAAA TTTTACTGGC TGTCCATCCT
TTTGTCGATT ATTCCAGTAA AAATTATAAA GAATTGAGTG AAAAAGGTTG CCTGTCAAAG
GTACCCGAGG GGGGGCGGCC TTATTTCGAC CATACTAATC CTGCTACCAA GGAGGCCATG
TGGAAGTTTT ATCAAAAGCT ATATGATGAA GGTGTAGCTG GATGGTGGAC AGATATGGGT
GAACCAGAAT CAGACTTACC TGGTACACAA GGATATGCCG GTAAAAGAGA GGTCTACCAT
AATGTCTATA CCCTGCTATG GAGTAAAAAT ATCATGGAAG CCCAGCGGGA AAATACCGGA
AGCCGAAATT TCTGTCTGGC CCGTACCAAT GCCCTGGGGA TACAGAATTA CAATACCGCA
TACTGGACAG GGGATATCTT TGCTACCTGG GAAATTTATC GTCGTAACAT TAAGGCCCTT
CAGACTGTCT CTGTGTCCGG ACAACCATAT GTTTGTACTG ATATAGGTGG TTTCCATACC
GATGAACGCT TTACTCCGGA ATTATATGTT CGCTGGTTGC AGTGGGGTGT ATTTGCCGGG
TTATTCAGGG TCCATGGTGT TAAACCCGAA AATGAACCCT GGTCTCTGGG AGAATCTAAT
GAAAAGATCA TTAAGAAGAT AATCGAATTT AGATACAGGT TTATTCCCTA TATTTATGAA
AAGATGTATC AAATGCAGCA AAACGGCGAA GCATTTATTA GACCCCTGAT TTATGATTAC
CCACAGGATG AAAAAGCTAT AGAAAGGGAG TATCAGTATC TCTTTGGGGA TATACTGGTC
TGCCCTGTAG TTGAACCTGA TGTCAGGGAG ATCGATGTAT ATCTGCCGGC TGGAAAATGG
TATGATTTCT ATAAAGGGAC AATGTATTAT GGAGGAGAAA CCTATAAGGC ATATGCACCT
ATTGACAGGA TACCTCTTTA TGTAAAAGAC GGCAGTATTA TTCTTACTAC AGAACCGGAA
GAAGATGTAA AAGATATTTA TGATAAATAT CAGTTGTTAA TCTATGGAGA GGGTTCAACA
ACAGAATACA TTTATGAAGA TGACGGAACA AGTTATGGTT ATGAAGATGG TAGTTATAAT
CTCATAAAAC TGGAGAAAAA AGATAACCAG TTAACAGTAA GCACCCTGCA GCATAAATAT
AAAGAAGAAA ATACAAACAG GGAATTAGAA ATTGTATATT ATAAAAACAA TAAAAAATAT
ACCAGGACAG TTGACTATAC CATTGGTGAG ACTATTAACA TATCTTTACA GGAAGGAAAA
GAAAGTAATA TACAAGTGGT TAATGTAGAA ATGGATGTTG ATGTCACCCA TTCAGAGTAT
AATGGTGACT TTTATGCCAA CTTATCTATT AAAAACAATT TAAATGAACC ACAGAATTTA
AGAATTAGAA TTGAAAAACC AGACCATTAT TATGTAAAAG CACAAATCGA TCTACCTGAA
TCCCTTGATT TCTCCAGTAA TGTTAAAGTG GAAGAGTCTG CAGAATATCT ATATCGTAAA
ATTCAGGTTG AAGATACTTA TACTTCAGTT TTCCCCTTTA AACCTTTTAA AGATAAAATG
CCTCAGCAAG AGAAGATTGA GGTAGCTATA GAAGATCAGA GGACAGGTAA GGTTCTGGAT
AAAAAGATAA TAACCCTGGG AAATGGTTAT TTGAAAAACT GGCGCTATGC TGTTTCCAAA
TATGAGGAGA TTGATAAGGA TGACTTAAAT TTTGCACCGG CCTTAGATTC AAACCCCTGG
GGTTATATTT ACCTGTATAA ATACCTGAAT ATGCAGGAAA ACGGTATAAA CCCTGTTGAT
TTTATAGAAA TTATCCAGAA AATTGGTTAT GGTTATGCCC GGGTAAATAT TCTGTCACCG
GAAAATAAAA AAGCTTACTT ACGTATCAGA GCAGATGAAG GGTCTACTTT CTATCTCAAT
GGAGAAAAAA TACATGAAAA TTCCCGATAT ACTATCGAAG AAGATATTTT AATTGAACTT
GAAAAAGGTG TAAATTTACT GGAAGCTAAT GTTCAGTGGA AATCTCCCCG TCCTTTTACT
GGAAGGGAAT TTGGTCTGTC AGCCCAGGTT CTTACCCTGG ACAAAGAGAT AGATGAGACT
GTCAAGAGTT TCTAA
 
Protein sequence
MHLDNYNVVF NILEGEVLHI QIQDKNNKSP FLRVTGGEKV LKSIDYHQVD SREIIPVSER 
YNLEINKDRE SFKIFDKTRQ KIVLESYDSF FRKVDEKQKG LSLSIKENEL FTGLGQDVEG
KLYIEDIERR CWNEWNGYDY LGSNSVPFYL SNQGYGLYLD TTYPSRFVFG KGEIQPKPIA
HEVMAETPFD WEEKACVNAE DQLTILGWEE EQLDFYILLG DMAEIEQKYY QLTGKPSLLP
KWSLGYIQCK NRYKSEEEIL HIARKMREKG IPCDVIVIDW LWFKEFGDLY WDGENYSEKM
AETIKKLKDM GIKILLAVHP FVDYSSKNYK ELSEKGCLSK VPEGGRPYFD HTNPATKEAM
WKFYQKLYDE GVAGWWTDMG EPESDLPGTQ GYAGKREVYH NVYTLLWSKN IMEAQRENTG
SRNFCLARTN ALGIQNYNTA YWTGDIFATW EIYRRNIKAL QTVSVSGQPY VCTDIGGFHT
DERFTPELYV RWLQWGVFAG LFRVHGVKPE NEPWSLGESN EKIIKKIIEF RYRFIPYIYE
KMYQMQQNGE AFIRPLIYDY PQDEKAIERE YQYLFGDILV CPVVEPDVRE IDVYLPAGKW
YDFYKGTMYY GGETYKAYAP IDRIPLYVKD GSIILTTEPE EDVKDIYDKY QLLIYGEGST
TEYIYEDDGT SYGYEDGSYN LIKLEKKDNQ LTVSTLQHKY KEENTNRELE IVYYKNNKKY
TRTVDYTIGE TINISLQEGK ESNIQVVNVE MDVDVTHSEY NGDFYANLSI KNNLNEPQNL
RIRIEKPDHY YVKAQIDLPE SLDFSSNVKV EESAEYLYRK IQVEDTYTSV FPFKPFKDKM
PQQEKIEVAI EDQRTGKVLD KKIITLGNGY LKNWRYAVSK YEEIDKDDLN FAPALDSNPW
GYIYLYKYLN MQENGINPVD FIEIIQKIGY GYARVNILSP ENKKAYLRIR ADEGSTFYLN
GEKIHENSRY TIEEDILIEL EKGVNLLEAN VQWKSPRPFT GREFGLSAQV LTLDKEIDET
VKSF