Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_0898 |
Symbol | |
ID | 5171160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | + |
Start bp | 917576 |
End bp | 919744 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640563416 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001244492 |
Protein GI | 148270032 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGGAA AGATCGATGA AATCCTTTCA CAGCTGACTA TTGAAGAAAA AGTGAAACTT GTAGTGGGGG TTGGTCTTCC AGGACTTTTT GGAAATCCAC ATTCCAGAGT GGCAGGTGCA GCTGGAGAAA CGCATCCTGT TCCGAGGCTT GGAATTCCTT CTTTCGTTCT GGCCGACGGT CCCGCGGGCC TCAGAATAAA TCCCACAAGA GAGAACGACG AAAACACCTA TTACACAACA GCGTTTCCTG TTGAAATCAT GCTCGCTTCC ACCTGGAACA AAGATCTTCT GGAAGAAGTA GGAAAAGCTA TGGGAGAAGA AGTCAGGGAA TACGGTGTCG ATGTGCTTCT TGCACCTGCG ATGAACATTC ACAGGAACCC TCTTTGTGGA AGGAATTTCG AGTATTATTC AGAAGATCCT GTCCTTTCCG GTGAAATGGC TTCAGCCTTT GTCAAGGGAG TTCAATCTCA AGGGGTGGGA GCCTGCATAA AACACTTTGT CGCGAACAAC CAGGAAACGA ACAGGATGGT AGTGGACACG ATCGTGTCCG AGCGAGCCCT CAGAGAAATA TATCTGAAAG GTTTTGAAAT TGCCGTCAAG AAAGCAAGAC CCTGGACCGT GATGAGCGCT TACAACAAAC TGAATGGAAA ATACTGTTCA CAGAACGAAT GGCTTTTGAA GAAGGTTCTC AGGGAAGAAT GGGGATTTGA CGGTTTCGTG ATGAGCGACT GGTACGCGGG AGACAACCCT GTAGAACAGC TCAAGGCCGG AAACGATATG ATCATGCCTG GAAAAGCGTA TCAGGTGAAC ACGGAAAGAA GAGATGAAAT AGAAGAAATC ATGGAGGCGT TGAAGGAGGG AAGACTCAGT GAGGAAGTCC TGAACGAATG TGTGAGAAAC ATCCTCAAAG TTCTTGTGAA CGCGCCTTCC TTTAAAGGGT ACAGGTACTC GAACAAACCG GACCTCGAAT CTCACGCGAA AGTTGCCTAC GAAGCAGGTG TGGAGGGTGT TGTCCTTCTT GAGAACAACG GTGTTCTTCC ATTCGATGAA AGTATCCATG TCGCCGTCTT TGGCACCGGT CAAATCGAAA CAATAAAGGG AGGAACGGGA AGTGGAGACA CCCATCCGAG ATACACGATC TCTATCCTTG AAGGCATAAA AGAAAGAAAC ATGAAGTTCG ACGAAGAACT CACCTCCATC TATGAGGATT ACATCAAAAA GATGAGAGAA ACAGAGGAAT ATAAACCCAG AACTGACTCC TGGGGAACGG TTATAAAACC GAAACTTCCA GAGAACTTTC TCTCAGAAAA AGAGATAAAG AAGGCTGCGA AGAAAAACGA TGCTGCAGTT GTTGTAATCA GTAGGATCTC CGGTGAGGGA TACGACAGAA AGCCGGTGAA AGGTGACTTC TACCTCTCCG ATGACGAGCT GGAGCTCATA AAAACAGTCT CAAGGGAATT CCACGAACAG GGTAAGAAGG TTGTGGTTCT TCTCAACATC GGAAGTCCCA TTGAAGTTGC AAGCTGGAGA GATCTTGTGG ATGGAATCCT TCTCGTCTGG CAAGCAGGAC AGGAGATGGG AAGAATAGTG GCCGATGTTC TTGTGGGAAG GGTAAACCCC TCCGGAAAAC TTCCAACGAC CTTCCCGAAG GATTACTCGG ACGTTCCATC CTGGACGTTC CCAGGAGAGC CAAAGGACAA TCCGCAAAGA GTGGTGTACG AGGAAGACAT CTACGTGGGA TACAGGTACT ACGACACCTT TGGTGTGGAA CCTGCCTACG AGTTCGGCTA CGGCCTCTCT TACACAAAGT TTGAATACAA AGATTTAAAG ATCGCTATCG ACGGAGATAT ACTCAGAGTG TCGTACACGA TCACAAACAC CGGGGACAGA GCTGGAAAGG AAGTCTCACA GGTTTATGTC AAAGCTCCAA AAGGGAAAAT AGACAAACCC TTCCAGGAGC TGAAAGCGTT CCACAAAACA AAACTTTTGA ACCCGGGTGA ATCCGAAAAG ATCTTTCTGG AAATTCCTCT TAGAGATCTT GCGAGTTTCG ATGGGAAAGA ATGGGTTGTC GAGTCAGGAG AATACGAGGT CAGGGTCGGT GCATCTTCGA GGGATATAAG GTTGAGAGAT ATTTTTCTGG TTGAGGGAGA GAAGAGATTC AAACCATGA
|
Protein sequence | MMGKIDEILS QLTIEEKVKL VVGVGLPGLF GNPHSRVAGA AGETHPVPRL GIPSFVLADG PAGLRINPTR ENDENTYYTT AFPVEIMLAS TWNKDLLEEV GKAMGEEVRE YGVDVLLAPA MNIHRNPLCG RNFEYYSEDP VLSGEMASAF VKGVQSQGVG ACIKHFVANN QETNRMVVDT IVSERALREI YLKGFEIAVK KARPWTVMSA YNKLNGKYCS QNEWLLKKVL REEWGFDGFV MSDWYAGDNP VEQLKAGNDM IMPGKAYQVN TERRDEIEEI MEALKEGRLS EEVLNECVRN ILKVLVNAPS FKGYRYSNKP DLESHAKVAY EAGVEGVVLL ENNGVLPFDE SIHVAVFGTG QIETIKGGTG SGDTHPRYTI SILEGIKERN MKFDEELTSI YEDYIKKMRE TEEYKPRTDS WGTVIKPKLP ENFLSEKEIK KAAKKNDAAV VVISRISGEG YDRKPVKGDF YLSDDELELI KTVSREFHEQ GKKVVVLLNI GSPIEVASWR DLVDGILLVW QAGQEMGRIV ADVLVGRVNP SGKLPTTFPK DYSDVPSWTF PGEPKDNPQR VVYEEDIYVG YRYYDTFGVE PAYEFGYGLS YTKFEYKDLK IAIDGDILRV SYTITNTGDR AGKEVSQVYV KAPKGKIDKP FQELKAFHKT KLLNPGESEK IFLEIPLRDL ASFDGKEWVV ESGEYEVRVG ASSRDIRLRD IFLVEGEKRF KP
|
| |