Gene Tpen_0458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0458 
Symbol 
ID4601861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp417353 
End bp418468 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content56% 
IMG OID639773225 
Productmetal dependent phosphohydrolase 
Protein accessionYP_919870 
Protein GI119719375 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.847424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTCAGC GTCTACGCAG AATAAGACAG CTAGGACCAG CCGACCTAGT ATACCCCGGA 
GCTGTACACA CACGTTTCTC CCACTCGCTT GGAACGCTGT ACCTCGCAGA GCGTATCGCG
AAGAGTGCCG GCATAGAGGA CGAAGGAGAG GTTGAGTCTC TAAGGTTAGC GGCACTCCTA
CACGATGTCG GTCACATGCC CTTTTCGCAC GCTCTTTCGA GCAATCACGA GAAGGTTTCG
CAGGAGGTAG TAAGAAGCAT GCTGGGAGAC TTGCTGGGAA AAGATCTAAA ACACGGGGTT
ATCGATATAC TCGCCGGGAG CTCGAGGCTA TCACCCATCC TAGCCTCGGA GGTCGATGCA
GACAGGCTAG ATTACTTGCT CAGGGACTCA AAGCACACCG GCGTGTCCTA CGGCAACGTG
GACGTCGACA GGGCTGTACG CTCAGCGAGG CTCGTACGAA CAGAGGCTGG GTGGGCTCTC
GGCTTCGTGC ACGGGGCGGA GACCGCAGTA GAGAACATCC TGCTGGCTAG AACTCAGCTT
TTCAGAGTGG TGTACTACCA CAGAACAGTA GGAGCATTCG AAGCTGTGTT GCGCGCAGCG
TACGCGATGC TGGTCGAAGA AGGGTACCTA CCCTCTCTCG ATGAAGCCCT CGAAGAGAAA
GAACTCTGGT GCATGTTCGA CGATTGCATG GTTATCGAGG CCTTGAAGCG CGCGAGAAAG
AGCGAGGAGG ACAGGTTGAA AGAGCTCGCC ACCAGCTTCC TTCTCAGAAA GCCACCGAAG
CTGGTATTCG AAGCGTACCT CAGCGACGGT CCGGGCGAGG GCTTCAGAAG AGAGCTCTAC
GAACACGCGC TTAAAGGGAC GGCAGAAGAA TTACTGGTCG AGAAGTGCGG CGTACCGGAG
GGTTGCCTGC TCGTCTACCT AGCGCGTATA GTCCCTATAG GCAACGTGAA CAGGCTTCTC
ATACTGCACG GGGAAAGCGC GAAGCCACTC GTTTCTCTGA GCAGCACCCT GCTGAGCTAC
CTCAAAGGGG TAAGCCTCTC CCCGCTGAGA GTATACGCCT TCCCAGAATG CCTACGGCAG
GCTACCGAGT GCCTCTCACG CCTGGCAGTC TTATGA
 
Protein sequence
MVQRLRRIRQ LGPADLVYPG AVHTRFSHSL GTLYLAERIA KSAGIEDEGE VESLRLAALL 
HDVGHMPFSH ALSSNHEKVS QEVVRSMLGD LLGKDLKHGV IDILAGSSRL SPILASEVDA
DRLDYLLRDS KHTGVSYGNV DVDRAVRSAR LVRTEAGWAL GFVHGAETAV ENILLARTQL
FRVVYYHRTV GAFEAVLRAA YAMLVEEGYL PSLDEALEEK ELWCMFDDCM VIEALKRARK
SEEDRLKELA TSFLLRKPPK LVFEAYLSDG PGEGFRRELY EHALKGTAEE LLVEKCGVPE
GCLLVYLARI VPIGNVNRLL ILHGESAKPL VSLSSTLLSY LKGVSLSPLR VYAFPECLRQ
ATECLSRLAV L