Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1518 |
Symbol | |
ID | 4601114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1465511 |
End bp | 1466449 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639774293 |
Product | phosphoesterase, DHHA1 |
Protein accession | YP_920918 |
Protein GI | 119720423 |
COG category | [R] General function prediction only |
COG ID | [COG2404] Predicted phosphohydrolase (DHH superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGGGCG TTATACTTGC GCACGGCGAC TCCGACGGCG TGACGGCGGC CGCCATAGCG AAGAGCGTTT ACAGGGACGC AGAGGTCTTC TTCACGCACC CGGTGGGGCT CGCGAAGGAC CTGGAGGAGT TCGCCCGGGG CAGGAGCCCC GTGATCATAC TCGACGTCGC CGTCGACGAG GCCGGCGCGC CGAGGCTAGC GGAGGTTCTC CGCTCGCTGG GCGCAGAGGT CGTCTACGTA GACCACCACC CGGGTGGAGA GGCGCTGGAG AAGATACGCG TACCCGGCGT GAGGGTCGTC CACGAGGAGG GGCCCTGCGC GGCGGAGCTT GCCTACAGGT TCTTCAAGCC TCCCAGAGAG ATGAGCAGGG TGGCGCTCTA CGGCGCTATA GGCGACCACG CCCTGGGAAC CGCGTGGGTC GCGGAGGCTC TCGAGGAGTG GGACTTGAAG ACGCTCTTCT TCGACGCCGG CGTGCTCGTG CTGGCTCTCG AGGCTCTGGG GAGGGACTAC GAGTCGAAGA GGAGGGTCGT GGACCTCCTG GCGGGCGGCG GCGTGCCGTC GCGGGAGAAG AGCCTCGTAG AGCTCGCGGC GAGGCAGAGC CTGCTCAACG AGGAGCTCAG GGTCAGGGTT CGTGAGAAGG CTCTGGTCGT TGGGCAGGTA GCCTACGTCA TGGACCCTGG CGGGAGCCTG GGCACCGCGG CCTTCTACGC GAGGGTGGAG AGAGGCGTGA AGGTAGGCGT AGCCGTAGAG CGCAGAGGGG AAACGTGCGT GATGAGCCTT AGGTCTACCG GCGGAGTGGA CCTCAACGCC CTGCTGAGAA GGATAGCCCC GAGGCACGGT GGGCACGGGG GAGGACACAG ACAGGCGGCC GGCGCCAGGA TCCCGTGCGG CGAGCTCGAG AAGTTCTTGG CGGAGCTTGC GGCCAGCCTC GACCGTTAA
|
Protein sequence | MAGVILAHGD SDGVTAAAIA KSVYRDAEVF FTHPVGLAKD LEEFARGRSP VIILDVAVDE AGAPRLAEVL RSLGAEVVYV DHHPGGEALE KIRVPGVRVV HEEGPCAAEL AYRFFKPPRE MSRVALYGAI GDHALGTAWV AEALEEWDLK TLFFDAGVLV LALEALGRDY ESKRRVVDLL AGGGVPSREK SLVELAARQS LLNEELRVRV REKALVVGQV AYVMDPGGSL GTAAFYARVE RGVKVGVAVE RRGETCVMSL RSTGGVDLNA LLRRIAPRHG GHGGGHRQAA GARIPCGELE KFLAELAASL DR
|
| |