Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0320 |
Symbol | |
ID | 4600971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 285853 |
End bp | 287307 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639773080 |
Product | thiamine biosynthesis protein ThiI |
Protein accession | YP_919732 |
Protein GI | 119719237 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGGGC TTCGAGAAGT CTTTGTGGTG CGGCTGGGAG AGATCACAGT GAAGAGTAAG AAGTCCAGGG AGCGTTTCGA AAAGAGGTTG CTGGAGAATA TCCGCGACGC GCTTAGCTCC TCCGGTGTGA GCGGAGAAGT GAGGAGGGAG TACGGGAGGA TCTACGTTTA CGCGCCGAGT AGCTTCGCCG GTGTTCTAAG GAGGGTGTTC GGGATAACGT CCTTCTCGAT GGCTCTTGAG TACGAGTTCA AGGCTTTGGA CGATATCGCG AGCACGGTTT ACAACCTTTA CTGCGATAAG CTGAAAGGTA AAACCTTCGC GGTAAGAGCC AGGAGGACTG GAGACCACCC CTTTACTTCG ATGGACGTTG CGAGAAAGGT TGGCGAGAAG CTTTACCCCT GCTCGAGCGG CGTCGATCTG TCGAACCCAG AGGTTCAGGT TTTCATAGAG GTTAGAGGGA GCAGGGCCTA CTTCTACACC GACGTCGTAA GGGGGTATGG AGGCCTCCCC GTGGGTAGTG AAGGTAAAGT ACTAGCGTTG ATATCCGGTG GCTACGACTC TGCGGTAGCC GCGTGGTACA TGCTCAAGAG AGGCGCGGAG GTACACTACC TGTACTGCAA CATGGCGGGG GACCTAACCA AGAGCCTTGT ACTCTCCGTT GCGAAGAAGC TGGCAGATTC CTGGAGCTAC GGCTATAAAC CGAGGCTCTA CGTCGCCGAC TTCTCCCCGT TGCTTAGAGA GCTTAGGGCG AAAGTCGCCC CTGAGCTTTT CGGGGTAGTT CTAAAAAGGT ACATGTACAG GGTTGCAGAG GCGATTGCAG GTAGGATCGG GGCGATAGGG ATAGTCACAG GGGAGAGCCT TGGACAGGTC TCCTCTCAGA CACTCGAAAA CCTCTACGTA GCCTCCCAAG CCACCTCCAT GCCCATTTAC AGGCCCCTCA TAGGCTTCGA CAAGGAGGAG ATAATCTCCA AGTCCAAGGA GATAGGGACC TACGAGGAGA GCTCGAAGGT AAAGGAAGTC TGTGGAGTGT TCTCCGTGCA TCCCAAGACG AGGTCCAGGC TGGAAGAGGT AGAAAGAGAG GAGAGCAAAC TAGACCCGGC GATCCTGGGG AGGGTTCTAT CCACGGTGGA GGAGATAGAC CTTCGCTCAG CGACGCCGAG CCCCCTCATA GAGGTAGACG TGGATGCCCC GCCCGAGGGG AGCATTATAG TCGACGTGCG CCCGAAGGAG AAGTACGAAG AGGGGCACAT ACCGGGGAGT CTCCACATAG AGTTTACCGA GCTACCCCTC TTCCTCGAAA GGCTCGATAA GAGCAAGACC TACGTCTTTG TATGCGACGA GGGAGGGTTA AGCCGCGAAG CCGCATACAT GCTGAGAAAA GCCGGGTTTA ACGCGTGGAG CCTAAAGGGA GGATTAAGGA GGTTCTCACG CCTCGCTCGG GAGAGCTCCG GGTAG
|
Protein sequence | MEGLREVFVV RLGEITVKSK KSRERFEKRL LENIRDALSS SGVSGEVRRE YGRIYVYAPS SFAGVLRRVF GITSFSMALE YEFKALDDIA STVYNLYCDK LKGKTFAVRA RRTGDHPFTS MDVARKVGEK LYPCSSGVDL SNPEVQVFIE VRGSRAYFYT DVVRGYGGLP VGSEGKVLAL ISGGYDSAVA AWYMLKRGAE VHYLYCNMAG DLTKSLVLSV AKKLADSWSY GYKPRLYVAD FSPLLRELRA KVAPELFGVV LKRYMYRVAE AIAGRIGAIG IVTGESLGQV SSQTLENLYV ASQATSMPIY RPLIGFDKEE IISKSKEIGT YEESSKVKEV CGVFSVHPKT RSRLEEVERE ESKLDPAILG RVLSTVEEID LRSATPSPLI EVDVDAPPEG SIIVDVRPKE KYEEGHIPGS LHIEFTELPL FLERLDKSKT YVFVCDEGGL SREAAYMLRK AGFNAWSLKG GLRRFSRLAR ESSG
|
| |