Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tter_1089 |
Symbol | |
ID | 8645584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobaculum terrenum ATCC BAA-798 |
Kingdom | Bacteria |
Replicon accession | NC_013525 |
Strand | + |
Start bp | 1171464 |
End bp | 1173851 |
Gene Length | 2388 bp |
Protein Length | 795 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003322827 |
Protein GI | 269926204 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCTCTG AGACCTTCAA ATATAAAGAC GGTTCACTCC CAATCGATCA GAGAATAGAT GATCTTTTAT CCAGGATGAG CATCGATGAA AAGATTGCTC AGCTAGGCTG CATCTGGAGT ACTGACTTAA TACGAGAAGG GCGATTTGAT CCTGATTATG CGATTAGTCA AATTCCAAAT GGTATTGGAC AAATTACACG CATCGGCGCA GCTACCGGTC TTCGTCCTAA TGAAAGTGCA AATCTTATGA ATTCTATTCA GAAGGTTGTG ATTGAGAGGA CGAGATTAGG CATACCAGTC TTTATTCATG AGGAGTCCGT CGGAGGCTTT TGTCATAGAG ATGCCACTGT CTTTCCCCAG GCTTTAGGGC TGGCTTGTTC TTGGAACCCC GAGCTAATAG AAAAAGTTGC TCAGGTGATT AGAGAGCAAA TGTTGGCTGT GGGAGCCAGG CTAGCACTAG CTCCGGTGCT TGATGTGGCT AGAGATCCTC GTTGGGGAAG GGTCGAGGAA ACTTATGGTG AAGATCCTGT TCTGGTGGGG ACACTGGGCA CAGCCTATAT AAAGGGTCTG CAAGGGGATG ATCTTGCTCA AGGGGTTGCA GCCACAGGTA AGCACTTCCT GGCTTATTCT TTCTCGCTAG GAGGAAGAAA TTGGGGACCT GTTCATGTAG GTCCACGCGA ACTTAGAGAG GTTTATGCCG AACCATTCGC AGCAGCGATA AGGGATGCTG GTTTATCAGT AATTATGAAT TCGTACGCTT CTGTTGATGG TTTACCCTGT GCCGGAAGTA AGTCTATCCT TACCGACCTT TTGCGCAAGG AGTTAGGGTT TAGAGGGTCC GTAGTCGCTG ACTACTTCTC CGTTGAGATG TTGCGATCTT TTCACAAGGT TGCTGCTGAT AAGTCAGAAG CCGCCTGCAT AGCTCTAAAT GCCGGATTGG ACATGGAGCT GCCTGCTTTG GATTGCTTCG GCGAACCCCT AAAGAAGGCT ATCGAAGATG GAAGCATCAA AATTGAGTTG ATAGATGCTG CAGTTAGAAG AGTGCTTGAG CTTAAGTTCC GATTAGGCTT GTTCGAGAAT CCATATGTTG ATGCAGGGGT TGCTAGCTCG AAGTTCCAGA CACCAGAGCA GAGACAACTG GCTTATCAGG CTGCAGCAGA GTCGGTAGTG TTACTAAAGA ACGATGGTGT GTTGCCGATA TCGAAGGATG ATGTGAAGTC AATAGCAGTC ATAGGTCCCG CAGCAGACGA TAAGAGGCTC CTACAAGGGG ATTACCACTA TCCAGCACAC CTTGAGTCTT TGTTCGAGTC TCAATCAGAT ACGGAGTCCT TAGGGTTGCT ATCAGAAGAG CCAGCACCCA CTCCTGCTGG TCAACTTAAT CTAGGAAACT TTGCTCCTGG TCCTTACTAC ACTCCGCACG TTACTCCACT ACAGGCAATA AGAGATAAGC ACCCCGATAT CGACGTAATT TATGAAAAGG GTTGCGATAT TTTGGGAGAT GACAGATCTG GGTTTGCTGC TGCAGTTAAT GCTGCCAGTA ACGCTGATGT TTCTATAGTT TTTGTGGGTG GCAAGAGTGG ATTGAAGAGA CCAGCTACTT CCGGAGAGGC AAATGACGCA ACATCCTTAT CTTTGACAGG CGTGCAAGCC GATTTGGTCA GAGCTATAGC TGAAGCTGCT AGGAAGCTGG TCGTAGTCGT AATAAGTGGT AGAGTTCATA CGCTGGAAGA CCTGGTTGAT TCAACTAATG CCTTAATCTT CTGCGTGCCT CCTGGGGAGG AGGGGGGCAA TGCTATAGTT GATGTGCTGT TTGGGAGTGT TTGCCCGAGC GGGAAACTTC CTGTTAGCTT CCCTCGTAGG GTGGGCCAAG TACCTGATTA CTTTGGTCAA AGGAATGGTG GGGACAAGGC GATGTTCTTT GGAGATTACA TAGACTCCAC TGTAGATCCT CTGTTCCCAT TTGGCTATGG CTTGTCCTAT ACACGCTTTG AGTACAGCCA GCCAAATATC GAAGTTGGTG ACACCACCAA GCCCACAGCC ATCTCCTTTG AAATTAGGAA TGTAGGGGAA TATACGGGGA GTGAGGTTGT TCAGCTTTAC TGTCAAGATG TTGTAGCTTC TGTTTCCCGT CCGACAAACA TGTTACTAGG ATTCACAAAA GTGAGATTGG ATCCCGGTCA ATCCAAGAAG TTAACCTTTA TAGTCCATCC CTCTAGATTA GCTTTTTATA ACGAGGCTAT GCAATTTGTT ACTGAGCCTG GACAATATAT CTTTCGTGTT GGATCCTCCT CGGTAGATAT TAGACATGAA CTTGATGTAA CACTACCAGG AGAGACTACT TACTATAACC AGAGGGATGT AGTCGCCACT ACTGTTGTCG TTGAATAG
|
Protein sequence | MSSETFKYKD GSLPIDQRID DLLSRMSIDE KIAQLGCIWS TDLIREGRFD PDYAISQIPN GIGQITRIGA ATGLRPNESA NLMNSIQKVV IERTRLGIPV FIHEESVGGF CHRDATVFPQ ALGLACSWNP ELIEKVAQVI REQMLAVGAR LALAPVLDVA RDPRWGRVEE TYGEDPVLVG TLGTAYIKGL QGDDLAQGVA ATGKHFLAYS FSLGGRNWGP VHVGPRELRE VYAEPFAAAI RDAGLSVIMN SYASVDGLPC AGSKSILTDL LRKELGFRGS VVADYFSVEM LRSFHKVAAD KSEAACIALN AGLDMELPAL DCFGEPLKKA IEDGSIKIEL IDAAVRRVLE LKFRLGLFEN PYVDAGVASS KFQTPEQRQL AYQAAAESVV LLKNDGVLPI SKDDVKSIAV IGPAADDKRL LQGDYHYPAH LESLFESQSD TESLGLLSEE PAPTPAGQLN LGNFAPGPYY TPHVTPLQAI RDKHPDIDVI YEKGCDILGD DRSGFAAAVN AASNADVSIV FVGGKSGLKR PATSGEANDA TSLSLTGVQA DLVRAIAEAA RKLVVVVISG RVHTLEDLVD STNALIFCVP PGEEGGNAIV DVLFGSVCPS GKLPVSFPRR VGQVPDYFGQ RNGGDKAMFF GDYIDSTVDP LFPFGYGLSY TRFEYSQPNI EVGDTTKPTA ISFEIRNVGE YTGSEVVQLY CQDVVASVSR PTNMLLGFTK VRLDPGQSKK LTFIVHPSRL AFYNEAMQFV TEPGQYIFRV GSSSVDIRHE LDVTLPGETT YYNQRDVVAT TVVVE
|
| |