Gene Tpen_0884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0884 
Symbol 
ID4600827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp831696 
End bp832832 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content58% 
IMG OID639773662 
ProductKH domain-containing protein 
Protein accessionYP_920288 
Protein GI119719793 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.145577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTTC CGTTTACACC TGCTACTGAG AAGCAGAAGA TGTTGCTGGG AGCTCTCGAA 
AGCCCGGACG TGGACCTAGT GGGGGTCTTC GGTCCCAGCG GTACGGGGAA AAGCTTCGTC
GTGCTCCTCT ACGGCCTCTC TGCGGTTCGG AGCGGGAAGT ACAAGAGGAT GGTGGTCGTA
AAGCCTCTAG TAAGCCTCTC CAGGTCTAAG GTACTCGACT CCAGCGAGAT GGGTAACCTC
TTCTTCGAGA TAGCCTCCTC CTACGTGGAA GACGTGGCCG GCGACTACGT AGACCTGAAG
GAGCTTAGGG AGATGTTCGA CCAGAGAAAG GTGGTGTTCG TCGACCCGGA CTTCCTCGCA
GGTAGGACTT TCGACAACAG CCTGGTCTTC CTGGACGACG TCCAGTACGC CTCGCCCGAC
CTCGTGACGG AGTGCATCAT TAGAACCGGG AAGAACTCCA AGCTCGTAAT AGCTGGTGAC
CCCATACTCC AGGCGCTCGA AGGAAAGACG AGAAACACTG CCGCTATAGC CCGGGAGTTG
CTCCTCGGGG AGGAGAGGTC CCTCGTTATA AACATGGGTA TAAACGACAT AGTGAGGCCC
GGCTCCAAGA GAGGCTTCAA GCTGGCGCTC GAGTCAAGGT TGAGGCGCAG GGCCCTCAGC
GAAGAGGAGG AGAAGGTGAA AGCGATACTT CAAAGCCATG CCCCTGACGC CGACGTGGTG
ACCGTGGTCT GGCTAAAAGA CCTGAAGGAA AAGTTCGGCG CGCACACAGC CCCCGATGTC
CTGGCGGTCG CGAAGGAGAA TACCCTGAGC AGGCTTATAG GGAAGAAGGG CGAAAGGATA
AACAAGGCCC AGGAGGAGGC GGGGGTCCAG ATAAGGGCTG TCGAGTTGAC GCAGGATCTC
GGGGAGATAG TGAAGGCAAT ACACCCAGTG GGGTGGATCA GGAAGCATAT CACGAGCGTA
GAGATAGAGG GCTCGGAGCT CGCCGTATAC GTAAATCCCG ACGAGTACGG AGCATTCGTA
GGCAAACAAG GGTCCTACAT CAGGTTCTTG GACGCCGCTG TACGGAGGCT TCTGGGGCTC
GGGGTCAGGG GGAAGCACGC CGAGAAGACC GAGCAGCGAG GAGCAAAGAG GAAGTAA
 
Protein sequence
MSLPFTPATE KQKMLLGALE SPDVDLVGVF GPSGTGKSFV VLLYGLSAVR SGKYKRMVVV 
KPLVSLSRSK VLDSSEMGNL FFEIASSYVE DVAGDYVDLK ELREMFDQRK VVFVDPDFLA
GRTFDNSLVF LDDVQYASPD LVTECIIRTG KNSKLVIAGD PILQALEGKT RNTAAIAREL
LLGEERSLVI NMGINDIVRP GSKRGFKLAL ESRLRRRALS EEEEKVKAIL QSHAPDADVV
TVVWLKDLKE KFGAHTAPDV LAVAKENTLS RLIGKKGERI NKAQEEAGVQ IRAVELTQDL
GEIVKAIHPV GWIRKHITSV EIEGSELAVY VNPDEYGAFV GKQGSYIRFL DAAVRRLLGL
GVRGKHAEKT EQRGAKRK