Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_03670 |
Symbol | |
ID | 7314042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 377424 |
End bp | 380498 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643610793 |
Product | glycoside hydrolase family 31 |
Protein accession | YP_002508123 |
Protein GI | 220931215 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.510709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCTGG ACAACTATAA TGTTGTTTTC AATATTCTTG AAGGTGAAGT GTTGCATATT CAAATTCAGG ATAAAAATAA TAAATCACCC TTTTTAAGGG TAACAGGGGG AGAGAAAGTT CTTAAATCAA TCGATTATCA TCAGGTTGAT AGTCGGGAGA TAATCCCTGT TTCAGAAAGG TATAATCTTG AAATAAATAA AGACAGAGAA AGTTTTAAAA TATTTGATAA AACCCGTCAG AAGATTGTAT TAGAGAGTTA TGACAGCTTT TTCCGCAAGG TAGATGAGAA GCAAAAAGGC CTTTCTCTGT CCATTAAAGA AAATGAGCTT TTTACAGGGC TGGGGCAGGA TGTAGAGGGC AAGTTATATA TAGAGGATAT TGAAAGAAGA TGCTGGAATG AGTGGAATGG GTACGATTAT CTTGGATCCA ACTCGGTACC TTTTTATTTA TCCAATCAGG GTTATGGCCT TTATTTAGAT ACAACCTATC CTTCCCGTTT TGTTTTCGGA AAAGGAGAGA TCCAGCCAAA ACCAATTGCC CATGAAGTAA TGGCTGAAAC TCCTTTTGAC TGGGAAGAGA AGGCCTGTGT AAATGCTGAA GATCAATTAA CTATTCTGGG CTGGGAAGAG GAACAGTTGG ATTTTTATAT CTTATTGGGA GATATGGCTG AAATTGAACA AAAGTATTAC CAGCTGACAG GTAAACCCAG TCTTTTGCCC AAATGGTCAT TGGGATATAT CCAGTGTAAA AATAGATATA AGAGTGAAGA AGAAATTTTA CACATTGCCA GAAAAATGAG GGAAAAGGGA ATACCCTGTG ATGTTATTGT TATTGACTGG CTCTGGTTTA AAGAATTTGG AGATTTATAC TGGGATGGGG AAAACTACTC AGAAAAGATG GCTGAAACCA TCAAAAAACT TAAGGATATG GGTATTAAAA TTTTACTGGC TGTCCATCCT TTTGTCGATT ATTCCAGTAA AAATTATAAA GAATTGAGTG AAAAAGGTTG CCTGTCAAAG GTACCCGAGG GGGGGCGGCC TTATTTCGAC CATACTAATC CTGCTACCAA GGAGGCCATG TGGAAGTTTT ATCAAAAGCT ATATGATGAA GGTGTAGCTG GATGGTGGAC AGATATGGGT GAACCAGAAT CAGACTTACC TGGTACACAA GGATATGCCG GTAAAAGAGA GGTCTACCAT AATGTCTATA CCCTGCTATG GAGTAAAAAT ATCATGGAAG CCCAGCGGGA AAATACCGGA AGCCGAAATT TCTGTCTGGC CCGTACCAAT GCCCTGGGGA TACAGAATTA CAATACCGCA TACTGGACAG GGGATATCTT TGCTACCTGG GAAATTTATC GTCGTAACAT TAAGGCCCTT CAGACTGTCT CTGTGTCCGG ACAACCATAT GTTTGTACTG ATATAGGTGG TTTCCATACC GATGAACGCT TTACTCCGGA ATTATATGTT CGCTGGTTGC AGTGGGGTGT ATTTGCCGGG TTATTCAGGG TCCATGGTGT TAAACCCGAA AATGAACCCT GGTCTCTGGG AGAATCTAAT GAAAAGATCA TTAAGAAGAT AATCGAATTT AGATACAGGT TTATTCCCTA TATTTATGAA AAGATGTATC AAATGCAGCA AAACGGCGAA GCATTTATTA GACCCCTGAT TTATGATTAC CCACAGGATG AAAAAGCTAT AGAAAGGGAG TATCAGTATC TCTTTGGGGA TATACTGGTC TGCCCTGTAG TTGAACCTGA TGTCAGGGAG ATCGATGTAT ATCTGCCGGC TGGAAAATGG TATGATTTCT ATAAAGGGAC AATGTATTAT GGAGGAGAAA CCTATAAGGC ATATGCACCT ATTGACAGGA TACCTCTTTA TGTAAAAGAC GGCAGTATTA TTCTTACTAC AGAACCGGAA GAAGATGTAA AAGATATTTA TGATAAATAT CAGTTGTTAA TCTATGGAGA GGGTTCAACA ACAGAATACA TTTATGAAGA TGACGGAACA AGTTATGGTT ATGAAGATGG TAGTTATAAT CTCATAAAAC TGGAGAAAAA AGATAACCAG TTAACAGTAA GCACCCTGCA GCATAAATAT AAAGAAGAAA ATACAAACAG GGAATTAGAA ATTGTATATT ATAAAAACAA TAAAAAATAT ACCAGGACAG TTGACTATAC CATTGGTGAG ACTATTAACA TATCTTTACA GGAAGGAAAA GAAAGTAATA TACAAGTGGT TAATGTAGAA ATGGATGTTG ATGTCACCCA TTCAGAGTAT AATGGTGACT TTTATGCCAA CTTATCTATT AAAAACAATT TAAATGAACC ACAGAATTTA AGAATTAGAA TTGAAAAACC AGACCATTAT TATGTAAAAG CACAAATCGA TCTACCTGAA TCCCTTGATT TCTCCAGTAA TGTTAAAGTG GAAGAGTCTG CAGAATATCT ATATCGTAAA ATTCAGGTTG AAGATACTTA TACTTCAGTT TTCCCCTTTA AACCTTTTAA AGATAAAATG CCTCAGCAAG AGAAGATTGA GGTAGCTATA GAAGATCAGA GGACAGGTAA GGTTCTGGAT AAAAAGATAA TAACCCTGGG AAATGGTTAT TTGAAAAACT GGCGCTATGC TGTTTCCAAA TATGAGGAGA TTGATAAGGA TGACTTAAAT TTTGCACCGG CCTTAGATTC AAACCCCTGG GGTTATATTT ACCTGTATAA ATACCTGAAT ATGCAGGAAA ACGGTATAAA CCCTGTTGAT TTTATAGAAA TTATCCAGAA AATTGGTTAT GGTTATGCCC GGGTAAATAT TCTGTCACCG GAAAATAAAA AAGCTTACTT ACGTATCAGA GCAGATGAAG GGTCTACTTT CTATCTCAAT GGAGAAAAAA TACATGAAAA TTCCCGATAT ACTATCGAAG AAGATATTTT AATTGAACTT GAAAAAGGTG TAAATTTACT GGAAGCTAAT GTTCAGTGGA AATCTCCCCG TCCTTTTACT GGAAGGGAAT TTGGTCTGTC AGCCCAGGTT CTTACCCTGG ACAAAGAGAT AGATGAGACT GTCAAGAGTT TCTAA
|
Protein sequence | MHLDNYNVVF NILEGEVLHI QIQDKNNKSP FLRVTGGEKV LKSIDYHQVD SREIIPVSER YNLEINKDRE SFKIFDKTRQ KIVLESYDSF FRKVDEKQKG LSLSIKENEL FTGLGQDVEG KLYIEDIERR CWNEWNGYDY LGSNSVPFYL SNQGYGLYLD TTYPSRFVFG KGEIQPKPIA HEVMAETPFD WEEKACVNAE DQLTILGWEE EQLDFYILLG DMAEIEQKYY QLTGKPSLLP KWSLGYIQCK NRYKSEEEIL HIARKMREKG IPCDVIVIDW LWFKEFGDLY WDGENYSEKM AETIKKLKDM GIKILLAVHP FVDYSSKNYK ELSEKGCLSK VPEGGRPYFD HTNPATKEAM WKFYQKLYDE GVAGWWTDMG EPESDLPGTQ GYAGKREVYH NVYTLLWSKN IMEAQRENTG SRNFCLARTN ALGIQNYNTA YWTGDIFATW EIYRRNIKAL QTVSVSGQPY VCTDIGGFHT DERFTPELYV RWLQWGVFAG LFRVHGVKPE NEPWSLGESN EKIIKKIIEF RYRFIPYIYE KMYQMQQNGE AFIRPLIYDY PQDEKAIERE YQYLFGDILV CPVVEPDVRE IDVYLPAGKW YDFYKGTMYY GGETYKAYAP IDRIPLYVKD GSIILTTEPE EDVKDIYDKY QLLIYGEGST TEYIYEDDGT SYGYEDGSYN LIKLEKKDNQ LTVSTLQHKY KEENTNRELE IVYYKNNKKY TRTVDYTIGE TINISLQEGK ESNIQVVNVE MDVDVTHSEY NGDFYANLSI KNNLNEPQNL RIRIEKPDHY YVKAQIDLPE SLDFSSNVKV EESAEYLYRK IQVEDTYTSV FPFKPFKDKM PQQEKIEVAI EDQRTGKVLD KKIITLGNGY LKNWRYAVSK YEEIDKDDLN FAPALDSNPW GYIYLYKYLN MQENGINPVD FIEIIQKIGY GYARVNILSP ENKKAYLRIR ADEGSTFYLN GEKIHENSRY TIEEDILIEL EKGVNLLEAN VQWKSPRPFT GREFGLSAQV LTLDKEIDET VKSF
|
| |