Gene Tpen_0989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0989 
Symbol 
ID4601965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp940254 
End bp941264 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content54% 
IMG OID639773767 
Producthypothetical protein 
Protein accessionYP_920392 
Protein GI119719897 
COG category[S] Function unknown 
COG ID[COG5493] Uncharacterized conserved protein containing a coiled-coil domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.132758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGCGC TGGGTAGGGG GGAGTGGGAG CGGTTGGTTA AGGCTTTGGA GGAGGATAGG 
GAGCTTAGGT ACGCTTTAGC CGGTTTGCTG GGATTCAGGG ATTTGCTCGA GAAGATGGAC
GCAACGCTGA ACGAGATAAG GGCGCTGAGG GAGGGACAGG AGCAACTGTG GAGAAACCAG
GAAAAACTGT GGGAAGAAGT GAAGTCCCTC AGAGAGGGGC AGGGAAAGCT ATGGGAAGAA
GTTAAGGCGC TGAGAGAGGA TCAGAGGAGG CTGTGGGAGG AAGTTAAGGC TCTCAGAGAA
AACCAGGAAA AGCTATGGGA GGAGGTTAGA GCGCTGAGAG AGGACCAGGG GAAGTTGTGG
GAAGGCCAGC AGAGGCTCTG GGAGGAGGTC AAGGCACTGA GAGAGGGACA GGAGAAGCTC
TGGGAAGAGG TAAGGAAGCT GTGGGAGGAG GTAAAAGCTC TGAGGGAGAA CCAGGAAAAG
CTGTGGGAGG AAGTGAAAGC TCTTAGAGAG GGACAGGAGA AACTATGGGA AGAGGTTAAG
GCGCTGAGAG AAGAGCAAGG AAAGCTGTGG AAAGAAGTGA AGTCTCTCAG AGAGGAGCAA
GGGATCCTCG CGAGAAAGAT GGACTCCTTC GAGAGACGCC TCATAGCGCT GGGCGCCAGG
TGGGGCATCG AGTCGGAAGC CGCTTTCAGA GAAGCCATGA GGGGAGTCGT CGAGGAAATA
CTAGGCGCAG GCGAAGTCCT CAGGTGGGTC TACTACGACG AAGACGGCGA AGTCCTCGGA
TACCCCTCCA GGGTCGAAGC AGACATACTG ATAAAAGACA AGGTACACGT ACTCATCGAA
GTAAAACCCA GCGCCTCCAG CGGAGACATA GCAAAGCTCT GGAGGCTCGG ACGCCTATAC
GAGAAGAAAA CCGGCACAAA GCCAAGACTA GTCCTCGTAA CACCCTTCAT AGAAGAAGAA
GCACTAAAAG CCGCAAAACA ACTCGGAATA GAAGTATACA CGAACACCTA G
 
Protein sequence
MAALGRGEWE RLVKALEEDR ELRYALAGLL GFRDLLEKMD ATLNEIRALR EGQEQLWRNQ 
EKLWEEVKSL REGQGKLWEE VKALREDQRR LWEEVKALRE NQEKLWEEVR ALREDQGKLW
EGQQRLWEEV KALREGQEKL WEEVRKLWEE VKALRENQEK LWEEVKALRE GQEKLWEEVK
ALREEQGKLW KEVKSLREEQ GILARKMDSF ERRLIALGAR WGIESEAAFR EAMRGVVEEI
LGAGEVLRWV YYDEDGEVLG YPSRVEADIL IKDKVHVLIE VKPSASSGDI AKLWRLGRLY
EKKTGTKPRL VLVTPFIEEE ALKAAKQLGI EVYTNT