Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0458 |
Symbol | |
ID | 4601861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 417353 |
End bp | 418468 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639773225 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_919870 |
Protein GI | 119719375 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.847424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTCAGC GTCTACGCAG AATAAGACAG CTAGGACCAG CCGACCTAGT ATACCCCGGA GCTGTACACA CACGTTTCTC CCACTCGCTT GGAACGCTGT ACCTCGCAGA GCGTATCGCG AAGAGTGCCG GCATAGAGGA CGAAGGAGAG GTTGAGTCTC TAAGGTTAGC GGCACTCCTA CACGATGTCG GTCACATGCC CTTTTCGCAC GCTCTTTCGA GCAATCACGA GAAGGTTTCG CAGGAGGTAG TAAGAAGCAT GCTGGGAGAC TTGCTGGGAA AAGATCTAAA ACACGGGGTT ATCGATATAC TCGCCGGGAG CTCGAGGCTA TCACCCATCC TAGCCTCGGA GGTCGATGCA GACAGGCTAG ATTACTTGCT CAGGGACTCA AAGCACACCG GCGTGTCCTA CGGCAACGTG GACGTCGACA GGGCTGTACG CTCAGCGAGG CTCGTACGAA CAGAGGCTGG GTGGGCTCTC GGCTTCGTGC ACGGGGCGGA GACCGCAGTA GAGAACATCC TGCTGGCTAG AACTCAGCTT TTCAGAGTGG TGTACTACCA CAGAACAGTA GGAGCATTCG AAGCTGTGTT GCGCGCAGCG TACGCGATGC TGGTCGAAGA AGGGTACCTA CCCTCTCTCG ATGAAGCCCT CGAAGAGAAA GAACTCTGGT GCATGTTCGA CGATTGCATG GTTATCGAGG CCTTGAAGCG CGCGAGAAAG AGCGAGGAGG ACAGGTTGAA AGAGCTCGCC ACCAGCTTCC TTCTCAGAAA GCCACCGAAG CTGGTATTCG AAGCGTACCT CAGCGACGGT CCGGGCGAGG GCTTCAGAAG AGAGCTCTAC GAACACGCGC TTAAAGGGAC GGCAGAAGAA TTACTGGTCG AGAAGTGCGG CGTACCGGAG GGTTGCCTGC TCGTCTACCT AGCGCGTATA GTCCCTATAG GCAACGTGAA CAGGCTTCTC ATACTGCACG GGGAAAGCGC GAAGCCACTC GTTTCTCTGA GCAGCACCCT GCTGAGCTAC CTCAAAGGGG TAAGCCTCTC CCCGCTGAGA GTATACGCCT TCCCAGAATG CCTACGGCAG GCTACCGAGT GCCTCTCACG CCTGGCAGTC TTATGA
|
Protein sequence | MVQRLRRIRQ LGPADLVYPG AVHTRFSHSL GTLYLAERIA KSAGIEDEGE VESLRLAALL HDVGHMPFSH ALSSNHEKVS QEVVRSMLGD LLGKDLKHGV IDILAGSSRL SPILASEVDA DRLDYLLRDS KHTGVSYGNV DVDRAVRSAR LVRTEAGWAL GFVHGAETAV ENILLARTQL FRVVYYHRTV GAFEAVLRAA YAMLVEEGYL PSLDEALEEK ELWCMFDDCM VIEALKRARK SEEDRLKELA TSFLLRKPPK LVFEAYLSDG PGEGFRRELY EHALKGTAEE LLVEKCGVPE GCLLVYLARI VPIGNVNRLL ILHGESAKPL VSLSSTLLSY LKGVSLSPLR VYAFPECLRQ ATECLSRLAV L
|
| |