Gene Tpen_0961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0961 
Symbol 
ID4600765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp911567 
End bp913123 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content45% 
IMG OID639773739 
Producthypothetical protein 
Protein accessionYP_920364 
Protein GI119719869 
COG category[R] General function prediction only 
COG ID[COG1373] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.416068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACATAG AAGATGTCAA AACCCTTCTA GACTCCTACA ACCCTTGGTG GCGCGACAAA 
GAGTGGAGCC AGAACCACCC GTTGCTTAGA GCTGTCAGAG AGTCTATCCT ACAAAATCCT
CCACGCCTCT TTTACCACAT CGTAAACTCC CTCCCAAAGC AAGGCTACTA CGGAATAGTA
ACGATTAGAG GACCCAGGAG AGTTGGCAAA ACAACATTGA TAGTAAGAAT AATTGACCAC
TTAATCTCAA AGTCTGGAAT AAAGCCGGAG AATGTATTCT ACATCCCACT CGACTACAAA
AAGTTAGAAA GCTTAAACCT ATTCGACTTG TTTTACGTTA CTGCACAGCT ACCCGAGGAG
AAGTATATCT TTCTAGACGA AGCCTCTATG CGGAGAGACT GGGCACTTGT CCTCAAAAAC
CTCGTCGATG CCGGTTTAGT TGAGAAAGGC AAACTGAAAA TCATAGTTAC CGGGAGTCAC
TCCATGGATC TAGCGGAGGC AGTGAGCAAG TTGAGCGATA GACAAGGTCG TCTAGCCTCG
TTATTTAACC TCGGAGGCAA CCTTTTTCAC GTTCCTCTAC GTTTCGTGGA AATTCTTGAG
GCTATCAGAC CAGACATAGA TGACTATCTG AGAAGATACC GTTTAAGGAA ACCCCGCGAG
AGGTTTAACA TACTACTTCA GCTATGGCAC GGTACCATAC CTAAAGCTTT GGAGGACTTT
TACAACGAGT TCTCCGAACT TCTCAACGAG ATCTTCGAAG ACTACCTCCT CCACGGAGGA
TTCCCCAAAA CAGTAGACCA GTACCATAGA GAAGGAACAA TAGAGCCGAG CTTCTATCAC
GACCTTGCCG AGCTCGTTAT TTCCGACAGC GAAAACGCCG GGTTAAAACC CGAAAATACT
AAACGCGTCC TTGAGTTTTT AACAGAGCAT GAGAGGCTAT CCTCTTTACT TGGACTGGAG
AAAAGGGAGA ACATAGGTAA ACATGTTACG GGAGTAGACG AAGAAGGCTT TCCCTCCGCA
AGATTTGGGT TCGGAAAATA CCTAGAGTAC CTAGAAACGA CCAAGCTCTT CCTGTTTCCG
TACCGTGAAG ATTCTTCACA AACCTGCACG CCGAATTATA GGGCAGATAG GAAGGTTTAT
GTGATGGATC CATTCCCCTA CTATGCTTTC AAAGCCCATA TACATAACGA GGTGGATCCA
CTAGGCTTTT CAAAGAAATT GCTCAGCGAA CCTGGGTTCA AAGGAAGACT CGTCGAAAGC
GTAGTAGCAG CACACCTAGT TATGGCTCAG CAGTTCTTCG AGCACGTATC AACGGTAGAC
TATCACAAGG TTCTTTTATA CTCCAAAAAC GAAAAAGAGA CAGACTACGT GCTCTGCCTG
TCACGTAAAG GTGAAAAGCA CAGGATTCTC ATAGAGTCTA AATACCGAGA AAACGCGCGA
AGAGAGGTTC CAGAAGGCAA GAAGATAATC TTGACAAAAA ACAAGCTCGA AGTGGTCGAA
GAAGAAGAAA ATATGCACAT CTACGTCCCA GTCACAGCTT TCCTAGCACT ATTCTAG
 
Protein sequence
MNIEDVKTLL DSYNPWWRDK EWSQNHPLLR AVRESILQNP PRLFYHIVNS LPKQGYYGIV 
TIRGPRRVGK TTLIVRIIDH LISKSGIKPE NVFYIPLDYK KLESLNLFDL FYVTAQLPEE
KYIFLDEASM RRDWALVLKN LVDAGLVEKG KLKIIVTGSH SMDLAEAVSK LSDRQGRLAS
LFNLGGNLFH VPLRFVEILE AIRPDIDDYL RRYRLRKPRE RFNILLQLWH GTIPKALEDF
YNEFSELLNE IFEDYLLHGG FPKTVDQYHR EGTIEPSFYH DLAELVISDS ENAGLKPENT
KRVLEFLTEH ERLSSLLGLE KRENIGKHVT GVDEEGFPSA RFGFGKYLEY LETTKLFLFP
YREDSSQTCT PNYRADRKVY VMDPFPYYAF KAHIHNEVDP LGFSKKLLSE PGFKGRLVES
VVAAHLVMAQ QFFEHVSTVD YHKVLLYSKN EKETDYVLCL SRKGEKHRIL IESKYRENAR
REVPEGKKII LTKNKLEVVE EEENMHIYVP VTAFLALF