Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1153 |
Symbol | |
ID | 4600959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1092105 |
End bp | 1093166 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639773929 |
Product | glycosidase, PH1107-related |
Protein accession | YP_920554 |
Protein GI | 119720059 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0153392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGGCGA AAATACTGTC AGCTGGGGAT CCTCTCTCGA AGCTCCTCGC GGAGAAGGTA GGGCGCAGAG CAGAGCTTAG GGGGCTGAGG CGCGGCTTGG AGCAGGACGT ATTCGAAAGG CTGGCGTACA TCACTCCGAT CCGCGTAGAG GTAACGAACT ACGCGAGAAG ACCGGTAGCC GTCTTCAACC CGGGTGCCGC GCTGGAGGGC TCGAATGTCG CCATTTTCCC AAGGATGGTC TTCGACTACT ACTGGTACGT TTCATCGGTG GGGAGAATAC GCGTAGGCGT GGACGACCTT CTCTCGGGGA GTATACCCGG AACCCTGAAG GCAGACCTGG TGATATACCC GAGCGAGGAA TGGGAGCTGA GAGGGTGCGA GGACCCGAGG GTCTACGGCG CCGATGGAAA CTACCTCGTG CTCTACACCG GCGTCCTGCC CTTGCAGAAC GGCGTGCTAC CTCTACAGGC CGTTGCCAGG TACTCGGAGG GCCGTGTCGA GAAGCTCGGC TACCTGGCTT TCGAGTATGG AGGCGAAAGG TACGTGGCGC CCTGGAAGGA CAGCGCCATT CTCTCGGAGG GTGGCGGCGA GGCTCTGGCG CTCGTGCGCC CCTCCGTGCC GGTGCCGGGG GGTTTCCTGG AGGCCGGCTG GTTTACGCGC TTCGACCTGG CAGGGCTTAC AGTGGACCCG GGCGAAGCGG TTCCCTTGCT CGTTGCAGAG AGCTTTGAAT ACAAGGTTGG GTGGTCTACG AACGCCTTGA AGCTGTCGAG CGGAGAGTAC CTGGTGGGGT GGCACGGGGT AGGGGTGGAC AACGTCTACA GGAATGGGCT CGCGGTGGTA AGCGAGGAGG GGGAGCTCTT GGAGCTTTCC GAGTACCTCC TGGTCCCGCG GAAAAGCTTA GAGGAGTTCT ACGGCGATAG GCCTGGCGTG GTCTTCGGGT GCGGGTTGCT GAGAATCAAG GAGAAGCTCG TCTGGGTAGG CGGCGTCTCG GACTACGCGG TAGGCGTATT CGCCGTGGAC ATGGACAAGG CGCTGGAGCA CTTGAAGAGA GTCTCGCGCT GA
|
Protein sequence | MKAKILSAGD PLSKLLAEKV GRRAELRGLR RGLEQDVFER LAYITPIRVE VTNYARRPVA VFNPGAALEG SNVAIFPRMV FDYYWYVSSV GRIRVGVDDL LSGSIPGTLK ADLVIYPSEE WELRGCEDPR VYGADGNYLV LYTGVLPLQN GVLPLQAVAR YSEGRVEKLG YLAFEYGGER YVAPWKDSAI LSEGGGEALA LVRPSVPVPG GFLEAGWFTR FDLAGLTVDP GEAVPLLVAE SFEYKVGWST NALKLSSGEY LVGWHGVGVD NVYRNGLAVV SEEGELLELS EYLLVPRKSL EEFYGDRPGV VFGCGLLRIK EKLVWVGGVS DYAVGVFAVD MDKALEHLKR VSR
|
| |