Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1155 |
Symbol | |
ID | 4602157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1094264 |
End bp | 1096378 |
Gene Length | 2115 bp |
Protein Length | 704 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639773931 |
Product | Kojibiose phosphorylase |
Protein accession | YP_920556 |
Protein GI | 119720061 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.162173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAGAAGT ACGCGTTTAG CGATAGGGAG AGGCCTATCA GGAAGATAGC CACGCTTACA ACTCTCTCCA ACGGTTTTCT CAGCGTCCGC GGGGACCCGG AGACTGCGCC CTCGGAGTAC GGCACGCTCG TTGCGGGAGT CTACAGCTAT ACGCCCATAT TCTACAGGGA GCTGGTCAAC CTCCCGAGGA TAACCCCCGT ATACGTGGAG CTAGACGGCG TGCCCATGCT ACCCGTCCAG GGTAGCAACG AGTTCCTCCT GGACGCGGAG GGAGGAACTC TGAGCTACAA GGCTATCCTT GGGTCGAGCC TAGGCGAGCT GGAATACGAG AGCCTGAGGC TTACACACAA GAAGTTCAAA GGTATATTCG CGCTACGGTA CCGCCTAGCC TCCCGCAACG CCGAGGGGCG CCTCTGCATC AAACACCCCA TAGAGCTCGA TACCCTCAAC GTATCGTCTC CCCCGGAGGT CAAGGTGAAG CTTTACAAAG TCGAGGAAGT ATCCGCCGAG GGCTCTTCCC CCTCGCTCTC CGTGAGAACA GCCGACAATG CCTACAGGGT GCTCTACGCC CTGATAGTGA GAGGTAGCCC CGCAGAGCCT AAGAGCTACT ACACGGGGAA GGAGGTCGGC TCCTGCTACT GTGTAGACGT GAAGCCGGGC TCCGTGGTCG AAGGGGAAAA GGTTGTGATC GTCGCTCTCA GCAAGGAAGA GCTCGAAAAG TTCAGGGGGA TCGCTACCTC GGAGAGCTTC GGTGGGCTTG TTTCTTCTCA TGTTGGTTTT TGGAGGGGTT TGTGGGGTAG GGTTGGTTTT AGGCTTTACG GTGATTCTGC ACTGGAGGAT GCACTTGTCT TTAACGCTTT CCACTTGCTA CAATTGTACA ATGAGGGTGG AGGGGAGTTT ATGCTTCCAG CTAGGGGTTT ACACGGCTAC GGGTATAGAG GGCATGTTTT CTGGGACTCC GACACCTACT CCCTACCATT CTACCTGCTA CTAGAGCCCG AGGCCGCTAG GAAGATACTG GAGTACAGGT GTAGGTGTCT GGGCGCGGCT AGGGAGTACG CTTCCAGTAC AGGCTTTAGG GGTGCTAGGT ACCCCTGGGA GGGTGTTGAT GACTGTAGGG AGGCTACACC CGTAGAGGTT CCTTTAGACC TCGAGGGCTC CAGGAAGGCC TTCATAGAGA CTGGTAGGCT TGAACAACAC ATAACTGCTG ACGTAGCATA CGCTGTAGAC ATGTACTACG AGTATACGGG TGACGAGGAG TTCATGGAGA GGTGTGGTTT AAGGATTATA TTCGAGACTG CTAGGTTCTG GGCTTCGAGA GTCGAGCTTG GTGGTGACGG CTACTACCAC ATCCGGGGCG TTATAGGCCC CGACGAGTAC CACGTAGGCG TAGACGATAG TTTCTACACG AACGTAATGG CTAGGTACAA CCTCGTCCTC GGCGCTAAGT ACTACGCTTT ATCCCAGTCT AAGCCTGGCT GGCTGAGGGT TGCCGTAGAG GAGGGTGTGA GTAGAGAGGA GGCTGAGGGG TGGCTCGAGG TTGCCGGGAG GGTTAGGGTT CCCTGCGAGC CGGGTGGTTT ATGCGAAGAG TTTGAGGGCT ACTTCAAGCT TAAAGACTTA GAGGTTTCCA ACTGCTTCGG GGATTCCTGT GCTAAAGGCT TAGATGTGGG TTCTACTCGC CTAGTTAAGC AGGCGGATGT TGTTGCTGGC TTGTTTTTGT TGAGGAGGTT TTTCGATAGG AGGGTTTTAG AGGGGAACTA CGAGTACTAC TTGCGTAGAA CTACTCACGC ATCCTCGCTA TCCCTACCTA TGTACGCGGC TATGGCTGCG TACCTGGGTA GAGTGGAGGA AGCTTTAGCG TTGTTGAGGA AAGCTGCCTC CACAGACCTA GAGGATACTT ACGGAAACCT TGAAGACGGC TTCCACGTAG CAGCAGCTGC AGGATCATGG ATGGCACTAC TACTAGGCTT CCTCGGGCTA GAGCCCCGCG GCGGAAAACT GGTCGCGGAG CCACGCCTCC CGGAGGGGCT CGGCGTCGAG CTAAACGTTT GGTTTAGGGG TAAGCTACAC AGAGTAGAGG CTAGGGGCTC CGAGTACAGG ATAACTGAGC TCTAA
|
Protein sequence | MKKYAFSDRE RPIRKIATLT TLSNGFLSVR GDPETAPSEY GTLVAGVYSY TPIFYRELVN LPRITPVYVE LDGVPMLPVQ GSNEFLLDAE GGTLSYKAIL GSSLGELEYE SLRLTHKKFK GIFALRYRLA SRNAEGRLCI KHPIELDTLN VSSPPEVKVK LYKVEEVSAE GSSPSLSVRT ADNAYRVLYA LIVRGSPAEP KSYYTGKEVG SCYCVDVKPG SVVEGEKVVI VALSKEELEK FRGIATSESF GGLVSSHVGF WRGLWGRVGF RLYGDSALED ALVFNAFHLL QLYNEGGGEF MLPARGLHGY GYRGHVFWDS DTYSLPFYLL LEPEAARKIL EYRCRCLGAA REYASSTGFR GARYPWEGVD DCREATPVEV PLDLEGSRKA FIETGRLEQH ITADVAYAVD MYYEYTGDEE FMERCGLRII FETARFWASR VELGGDGYYH IRGVIGPDEY HVGVDDSFYT NVMARYNLVL GAKYYALSQS KPGWLRVAVE EGVSREEAEG WLEVAGRVRV PCEPGGLCEE FEGYFKLKDL EVSNCFGDSC AKGLDVGSTR LVKQADVVAG LFLLRRFFDR RVLEGNYEYY LRRTTHASSL SLPMYAAMAA YLGRVEEALA LLRKAASTDL EDTYGNLEDG FHVAAAAGSW MALLLGFLGL EPRGGKLVAE PRLPEGLGVE LNVWFRGKLH RVEARGSEYR ITEL
|
| |