Gene Tpen_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0454 
Symbol 
ID4601857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp413357 
End bp414406 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content52% 
IMG OID639773221 
Producthypothetical protein 
Protein accessionYP_919866 
Protein GI119719371 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1499] NMD protein affecting ribosome stability and mRNA decay 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGATTC TCTGTCCTGT CTGCGGTAAG CCTACTGATA GGCTGATAGA TGGTTTATGC 
CCGGAGTGTT ACCGCAGGAG TAGGCGTATA ATGGAGCCTA AGAGAGACGC GCTCAGGGTT
AAGGTTTGCA GGAAGTGCGG GAGGCTTTAC TACAAGGACG AGTGGCTAAG CTCCAGCGAG
GAGCTTGCCA TGTCTGTTGA AAAGGATCTG CCAAGGCTAG TAAAGGTGAG GGGAGAGCTT
AGAAGCGCGA GGGTTAGTCT CTACATCGAT AAAGGGTTCG CGGAGGTTCA CGCGGTCGGC
AGGGCTGATA GCGGCATTGA CTTCTTCTAC GAAGAAACTC TGAGGATACC GCTTAAGACC
GAGTACTCTA TCTGCGACTG GTGTCTAGGC AAGGTTTCGA AGAAGAAGAG CGCCATAGTA
CAGGTTCGGG CGAGCGAGAG AGAGCTCAGC AACGACGAGA AGAAAGCCGT GTACAAGGTT
CTAGACGAGC TTTCCTCTAG CGGCAACGCG GAGGCCTTAC CGTGGGACGT AAAGGAAGAA
GCCGGCGGAC TTGACCTGTA CTTCTCCTCC CCGAGGGCCG CGAGGGAGGT TGTACGAAGG
CTTTCAGAAA GAGTTTACTT CGAAGTACTC GAGACTAGTA AAAAAGTGGG GGTGGACGGT
AGCGGCCACG AGAAGTATCA GTCAACGATA AGGCTGTTGT TGCCCCACAT AGCTAGGGGC
GACGTTGTGC TCTACAGAAA CGAGTACTTC CTCGTTAAAG ACGTCGACCC TAAGAGAGTC
ACGCTCTTAA ACCTAAGTAG CTACGAGAAA GAGGTGTACA TGCTCAACAA AAGCTTACTT
TCGCGGATAT CCGTGATTGC ACGGGCTAAC GAGTTGAAGA TGGGTGTAGT TGTCTACACC
GCTGGAGACA AAGTTCACGT AATGAGCAAC GACTACAAAG TATACGAGGT GGACATACCT
CCAAGCGCTA GGAAGCTTTT TCAGGAGGAA CAAACAGTCG GGCTACTCGT ACTCGAAAAC
AAGGTACTGC TTATACCGAG CCCGCCTTAG
 
Protein sequence
MKILCPVCGK PTDRLIDGLC PECYRRSRRI MEPKRDALRV KVCRKCGRLY YKDEWLSSSE 
ELAMSVEKDL PRLVKVRGEL RSARVSLYID KGFAEVHAVG RADSGIDFFY EETLRIPLKT
EYSICDWCLG KVSKKKSAIV QVRASERELS NDEKKAVYKV LDELSSSGNA EALPWDVKEE
AGGLDLYFSS PRAAREVVRR LSERVYFEVL ETSKKVGVDG SGHEKYQSTI RLLLPHIARG
DVVLYRNEYF LVKDVDPKRV TLLNLSSYEK EVYMLNKSLL SRISVIARAN ELKMGVVVYT
AGDKVHVMSN DYKVYEVDIP PSARKLFQEE QTVGLLVLEN KVLLIPSPP