Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1494 |
Symbol | |
ID | 4601403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1444122 |
End bp | 1446227 |
Gene Length | 2106 bp |
Protein Length | 701 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639774269 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_920894 |
Protein GI | 119720399 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.643815 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGGGGAA GGGGCCCTGG GCTCTCGCTC GAGGAAAAGG CTAGCCTCGT CGTCGGCTGG GGCTCTTCCA GGCGCCTACC GGGAGCCGCC GGGGAGACGC GGCCCGTCAG GGTACCGTCC ATAGTGACTG CCGACGGCCC CTCTGGGCTC AGGGTTGAGC CTAGCGGGGG GCGCAGGTGG TTTGCGACAG CCTTCCCGGT GCCGACAATG CTGGCTGCGA CGTGGAACCC CGAGGTCGTG GAGAGGGTGG GCAGGGCTAT AGGAGAGGAG TGCAGGGCTT ACGGCGTGGA CGTGCTCTTG GCTCCCGGGG TGAACATGCA CAGGCACCCG CTGTGCGGCA GGAACTTCGA GTACTTCAGC GAAGACCCTC TTCTCAGCGG GGAGATGGCG GCCGCGTACG TTAGGGGCGT GCAGTCCGTG GGCGTCGGTG CGACGTTGAA GCACTTCGCG GCTAACGACC AGGAGACTAA CAGGACGGTT ATAGACACCG TGGTCTCCGA GAGGGCTCTC CGCGAGATCT ACCTCAAGCC CTTCGAGATA GCGGTGAAGA AGGCTAAGCC GTGGTGCGTG ATGAGCTCCT ACAACAAGCT CAACGGGAAG TACTCCTCCC AGAACGAGTG GCTCCTAACC AGGGTGCTCA GAGAGGAATG GGGGTTCGAC GGGGTAGTGA TGACGGACTG GGGGGCCGGG GACGACTCGG TGGAGCAGGT CAACGCTGGG AACGACCTGA TAATGCCCGG GAGCGACGAG GCCGTCGAGA AGCTCTTGGA GGCCGCTAGG AGCGGCAGGC TGCGGCTAGA GGCTCTCGAA GCCTCCGCGG AGAGGGTTCT CAGGCTTGTG AGGAAGTCGC TAACGTACAG GGGCTACAGG CCTAGAGGCG CGCCGGACCT AGATGGGCAC GCTAGGGTAG CCTACGAGGC GGCATCCGAG GGGGTCGTGC TCCTCAAGAA CGAGGGAGCG CTACCCCTGG GCCCCGGCGC CAGGGTGGCT TTATTCGGGA CGGGGCAGGT TGAGACGCTG AAGGGCGGGA TGGGGAGTGG TCACACGCAC CCGAGGTACG TGGTCACGGT ACTGGAGGGG TTGAAGAGCG GGGGGTTGCT GGTCGACGAG GAGCTCTCCT CGATCTACGA GCGCTACGTG CGCGAAGCGA GGGGGGAGGA GTTCCTCGAG AAGCTCTACC TCGACGAGGT GTACGCGGAC CCGCTACCCC AGGACATCGT CAGCGAGGGC GACGCGGCTA GGTTCGCCGA GAGGAACGAC GCGGCGGTGG TCGTGCTCTA CAGGGTCTCC GGGGAGGGGT GGGACAGGCG CCCGGTTAGG GGCGACTTCT ACCTCACGGA GAGCGAGGAA AGGTTGCTGA GGCTCGTATC GCGGGAGTTC CGCGGGCGGG GTAAGAAGGT GGTAGTCGTG CTCAACGTGT GCGGCCCAAT CGAGGTTGCG AGCTGGAGGG ACCTCGTCGA CGCGATCCTC GTAGTCTGGC TACCGGGGCA GGAGGCCGGC AGGGTGGTAG CAGACGTGCT GGCAGGCAGA GTAAACCCGT CCGGGAAGCT CCCTATGACG TGGCCGCGGG ACTGGACGGA CGTCCCGGCG GCGAAGGCGC CCGAGTGCTA CCCGGGGCTA CCCGTGGAGG ATCCCCGCAG AGTCGTGTAC TGCGAGGGGG TCTACGTCGG CTACCGCTAC TACGATACCT TCGGCGTGGA GCCAGCCTAC GAGTTCGGGT ACGGGCTCAG CTACACGAAG TTCGAGTACA GGGGTCTCCG CGTCGCCTTG TCGCGCGGAG CCCTCAAGGT GTCCTTCGAA GTCGTGAACG CTGGGAGCCG CCCGGGCAAG GAGGTAGCGC AGGTCTACGT GCGGGCCCCT CGGGGCAGGA TCGACAAGCC GTTCCAAGAG CTGAAGGCAT TCAGGAAGAC GAGGCTACTG GAACCCGGGG AGGCGGAGAG GATAAAGCTG AGGGTTAGCC TGAGGGACCT AGCCAGCTTC GACGAGCGCG AGAAGGTATG GGTAGTCGAG CCGGGCGAGT ACGAAGTCAG AGTGGGCTCG TCTTCGAGGG ACATCAGGCT TACTGAGCAC TTCGAGGTAA AGCAGGAGCT GAGATTCGCG CCCTGA
|
Protein sequence | MRGRGPGLSL EEKASLVVGW GSSRRLPGAA GETRPVRVPS IVTADGPSGL RVEPSGGRRW FATAFPVPTM LAATWNPEVV ERVGRAIGEE CRAYGVDVLL APGVNMHRHP LCGRNFEYFS EDPLLSGEMA AAYVRGVQSV GVGATLKHFA ANDQETNRTV IDTVVSERAL REIYLKPFEI AVKKAKPWCV MSSYNKLNGK YSSQNEWLLT RVLREEWGFD GVVMTDWGAG DDSVEQVNAG NDLIMPGSDE AVEKLLEAAR SGRLRLEALE ASAERVLRLV RKSLTYRGYR PRGAPDLDGH ARVAYEAASE GVVLLKNEGA LPLGPGARVA LFGTGQVETL KGGMGSGHTH PRYVVTVLEG LKSGGLLVDE ELSSIYERYV REARGEEFLE KLYLDEVYAD PLPQDIVSEG DAARFAERND AAVVVLYRVS GEGWDRRPVR GDFYLTESEE RLLRLVSREF RGRGKKVVVV LNVCGPIEVA SWRDLVDAIL VVWLPGQEAG RVVADVLAGR VNPSGKLPMT WPRDWTDVPA AKAPECYPGL PVEDPRRVVY CEGVYVGYRY YDTFGVEPAY EFGYGLSYTK FEYRGLRVAL SRGALKVSFE VVNAGSRPGK EVAQVYVRAP RGRIDKPFQE LKAFRKTRLL EPGEAERIKL RVSLRDLASF DEREKVWVVE PGEYEVRVGS SSRDIRLTEH FEVKQELRFA P
|
| |