Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1654 |
Symbol | |
ID | 4601731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1601324 |
End bp | 1602565 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639774427 |
Product | hypothetical protein |
Protein accession | YP_921052 |
Protein GI | 119720557 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2407] L-fucose isomerase and related proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACGTAT ACGCGGTAGC CTTCGCCTCG AGGATCCACG GGGAGGGCTA CTACAGGCAG GCGTACAGCT ACGTTTCGAG CGTTCTCCGC GTACCCGTCT ACCCCGAGGT GGTCGCGGAG CGCGACACCT TGAAGAAAGC CGTCGAGGAG CTTAGGGGCT CCCTCCCCTT AGCCGTAGTC TTAACCGGCG GGACCAGCGG GTTGATACAG GAGTTCGCCT CGGAAGGAGG CTTCAGGGCT GTGGCGCTAT TAGCGCAGGG CGAGCACAAC AGCCTAGCCT CGGCGATCTC TGCGAGAGCG GCTCTCGAAT CCAGGGGGGT CGGTGTAGCT CTCTTCCACT GCGGCTCCTT CTCGGACGGC AACTGCGCCG CCGCGGCGAG CGCGGCTGTC AGAGTTGCCC GAGGAGCAGG CCGGGTTCTG GGGGCGAGGG TGGGCGTGGT GGGCTCTAAG CCTCGCTACG CGGATGTCTT CTCGTCGAGG CTCGGCTGGA CTATCGAAGT GGTGCCCGCC GAGGAGCTTT TCTCCGCCGC AGAGTCCGCT CCCAGAGAGG CTGTGGAGTC CTTCCTTTCC AGGGTGTCGG GGGTCCCGGG CTTCGAGTTG TACCGCTCAA GCCTCGAGCA CGTCGGCGGG GTGTACTACG CGTTGAGGAG GCTCTCCGAG GAGAAAAGGC TCGACGCGGT CGCCGTTGAC TGCTTCCCCT ACCTCGTAGA GCACCGCGTA TCCCCCTGCG TTGCGCTGGC GCTCCTGAAC GCGGACGGCT TCGCGGCGGC CTGCGAGGCT GACCTCTACT CGGCGCTCTT AATGCTCGTC TCAAGGGAGC TTACAGGGTC CTCGGGGTGG ATAGCCAACG CTACGCACTT CGAGGGCAGG GTCGGGGTCT TCTCTCACTG CACGATAGCG TTCGACATCG CCAGGGCTCC CAGCCTGGTA GACCACTTCG AGAGCGGCTA CCCGGTAGCA GTGGCGTCCC AGCTTCAGCC CGGTGAGGTA ACGGTGGCCT CGCTTTCACG GGACCTCTCG GAGGTCTACG TAGCTAGGGG CAGGGTGGTG CGCTCTGGCT TTATCAGCCG AGCGATGTGC AGGACGCAGG CACACGTGGA GTTCGACTTC GACGCGGAGG TAATCCCGCT GGTGGCACCC GCGAACCACC ACCTCGTAAT GCCCGGCGAC GTCGTAAGGG AGGTTAAAAG CGTCTCGAAG CTCCTCGGGC TACGCGTCAA GGAGTACTCA AAGGAGGCTT GA
|
Protein sequence | MNVYAVAFAS RIHGEGYYRQ AYSYVSSVLR VPVYPEVVAE RDTLKKAVEE LRGSLPLAVV LTGGTSGLIQ EFASEGGFRA VALLAQGEHN SLASAISARA ALESRGVGVA LFHCGSFSDG NCAAAASAAV RVARGAGRVL GARVGVVGSK PRYADVFSSR LGWTIEVVPA EELFSAAESA PREAVESFLS RVSGVPGFEL YRSSLEHVGG VYYALRRLSE EKRLDAVAVD CFPYLVEHRV SPCVALALLN ADGFAAACEA DLYSALLMLV SRELTGSSGW IANATHFEGR VGVFSHCTIA FDIARAPSLV DHFESGYPVA VASQLQPGEV TVASLSRDLS EVYVARGRVV RSGFISRAMC RTQAHVEFDF DAEVIPLVAP ANHHLVMPGD VVREVKSVSK LLGLRVKEYS KEA
|
| |