Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1118 |
Symbol | |
ID | 4600860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1056272 |
End bp | 1057315 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639773894 |
Product | metallophosphoesterase |
Protein accession | YP_920519 |
Protein GI | 119720024 |
COG category | [S] Function unknown |
COG ID | [COG2908] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.315719 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGAGCG AGGAGTTGGA CGTAGAGGGC TTGAAGGTGC CAGTGTTCCA CCTTACGAGA GACGAGGAGG TTCTCGTGTT CTCAGACGTC CACTTCGGCC TTAGGTTTAA CGGCAGGGAG CTCTCGCTCC ACGAGGAGCT GGCGGAGTTC CTGGAATCCG TCCTGGAGGG AGGCGAGCAT CCCAGGCTGG TGGTACTGCT GGGAGATATA TTCGAGCTGT GGAGCGCCAG CCTCAGGGAT GTTTTCTCTG ATGCTTTCGA CTCGCTTAGG CTTCTATCCA GGCTGGACTC GACGATCGTC TTCGTCCCCG GTAACCACGA CAGGGTGACT ACGAGGCTCC ACTTGGAGAG CCTTAGGGGC GGGGGTGGCT TCGTGATAGC GCCGGAGATC GTACTGCTCG ACATCGACGG GAAGAAAGTT CTGCTGTTCC ACGGGCACCA GCTCGACGGG CTGTTCCTTG CCGTGAAGGG GCTCTGGAAG CTTCAATCGT ACGTCTACAT ATTCTCCGAG TCCCTGATGT CCCTTCCCGG GCCTCTCGAA TGGGTCTTCG CGGCGGTAGC TGCTACCACG GTGCTTCTAC TCATGGTGCT CATACAGGCA ACGTCCCTGC TCATGGAGGC TGCGATCGCC CTGTCCGCCT TGATACTGCT CTCGCCGATG GTGATACTGC TGTGGAGGAA GGCCCAGGAC AAGATCTGGT ACGGCTTCGT GCAACCACTA GCTTCGAGGC TACTCAAGAG CAGGCTACGC GGGAAGTCCC TCCAGTCGCT CGCCGTAAGC AAGCCTCTGA GGAGGCTGAT CGGGTTCCTA GAGTCCTTCC CCTCCGTGGG TAAGCTGGAC CTAGTGGTGT TCGGGCACAC GCACGTGCCG GAGTACCTTG CCGAGGGCGG CAGGTTGATA CTGAATACGG GTAGCTGGGT TCGAAGCAAC TCCTCGACCG GGGTGGATAA CAAGACATAC GTTAGGATAA AGAACGGTAA AGTCATGCTG GCAAAGTGGG AGGGGTCAGA GATAAAGATC TTCGAGGCTA GCCTTGTGCA ATAG
|
Protein sequence | MWSEELDVEG LKVPVFHLTR DEEVLVFSDV HFGLRFNGRE LSLHEELAEF LESVLEGGEH PRLVVLLGDI FELWSASLRD VFSDAFDSLR LLSRLDSTIV FVPGNHDRVT TRLHLESLRG GGGFVIAPEI VLLDIDGKKV LLFHGHQLDG LFLAVKGLWK LQSYVYIFSE SLMSLPGPLE WVFAAVAATT VLLLMVLIQA TSLLMEAAIA LSALILLSPM VILLWRKAQD KIWYGFVQPL ASRLLKSRLR GKSLQSLAVS KPLRRLIGFL ESFPSVGKLD LVVFGHTHVP EYLAEGGRLI LNTGSWVRSN SSTGVDNKTY VRIKNGKVML AKWEGSEIKI FEASLVQ
|
| |