Gene Tpen_0958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0958 
Symbol 
ID4600762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp908835 
End bp910724 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content61% 
IMG OID639773736 
ProductAAA ATPase 
Protein accessionYP_920361 
Protein GI119719866 
COG category[L] Replication, recombination and repair 
COG ID[COG1112] Superfamily I DNA and RNA helicases and helicase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTTCTC CAGTTAAGGA AGACTTGTGG AGCATGCTCG AAAAGCTTCT CGAGAAGGAG 
CGCGGGTCCT CGGCGACAGA GGCGGAGAGG TTCCCGGCGA GGTTCCTCGG AGGCGAGAAG
GTCCTGCTCT TCGAGAAGCG CGGAAAGCCC TTCAGCTTCG AGCCGGGAGA CGTCGTGGGC
GTCGCCGAGG GGGGATTGGT CGAGCCCCTA GGAGTCGTGC TCGACGCGAC CTCGGAGACG
CTCATAGTCG AGAACGTCTA CCGGAAGAGG CTCCAGCAAC TCCGAGAGGT AGAGCTGGCG
GACGCGGAGT TAACGCTGGG GTACGACCTC CAGCTGGACC TCTTGAAGCA GGTGCGGGGC
GGCAAGGCGG AGATCGTCGC GGTGTTCAAC GAGAAGCCCG TGGAGCTCTT CGAGGGAGCC
CCGCAGAAAC CGCTCGGCAG GGTCGACGCG CGGACCGTGA AGGTATCCCT GGTGAAGCGC
TCGGCGGGCG GCGGGGAGGC TCGGGAAGAG GTACGCCTAG ACGAGTCGCA GTCGAGAGTG
GTGAACGCCG CCCTCGAGCT AGAGGAAGGG GAGGTGCTGC TCGTCGTCGG GCCGCCGGGC
ACGGGGAAGA CAACCACCAT AGCGGGGATC GCGGAGAAGC TGGCGGAGCG CGGCGAGCGG
GTACTGATCT CCTCCCACAC GAACAGGGCT GTGGACAACG CCGTGGGGAA GCTTCCCATC
GACTTCACCG TGAGGGTCGG CAGGCCGGAG AAGGTACTAA GGGAGATAGA GCCCTACCTG
CTAAGCTACA AGGTAAAGAG CGCGCTGGGC GAGAGGTACG CGGACCTCCA AGAAAGGATA
AACGAGTTGC TGGAAACAAT AAGGATCCAC CGAGGGTACC TCCGCGGAAA GGATAGGTCG
ACGTTTTCCC TTCTGGAGCG GAGCCTCCGG GAGCACGAGA GGGAGCTGAG GGAGCTCTTG
GAGGAGCGGA GAAAGCTCAT AGAGGAGGTC CAGAGCGAGG TCCTCGGAGG AGCCAGGGTA
GTCGGCTCGA CGCTAATAAA GTCCCAGCTC TACCCCCTCC GGGACTACCC GTTCGACACC
GTGATAATAG ACGAGGCAAG CCAGGTCTCG GTAACCCTAG CACTACTCGC AATGGTCAAG
GGGAGAAAGT GGATAGTAGT AGGCGACCAC AAGCAACTAC TCCCCATATT CAGGTCCGAG
GTAGCACGGG AAGAGCTGGA GGACCTGGGA GCGTTCACGA GGCTCATGCG GTTCTACGGG
GAGAAGGGAG GCTACCCTAG AACCCTCTGG CTGAGGAAAA GCTACAGGAG CCACCCCGAC
ATAGTGGGCT TCGCCGCCCG CTACGTCTAC GAGGGCAAGA TAAAGCCCGC CGCCAAGCCC
AAGGAGAAAA CGCTCGCGCT CAGCCCGGGG TACCCCGACT TCCTCGAGCC CCGCAAGCCC
TTCACGCTAA TACACGTAGA CTCGCAGGAA GAGCGCCGGG GCGGCTCAAG GATCAACGAG
GCGGAAGCCA GGGTATGCTA CGAGCTCGTA GACGCGCTGA CCAAGCACGG CGTACCACAG
GAAGAGATAG GAGTAATCAC GCCGTACAGA GCCCAGCGGA GCAGAATAAA GGAGTACCTC
CAAGGCTTCA ACGTCGAAGT AAACACCGTG GACGCATTTC AGGGGAGAGA GAAAGACGTG
GTAATATTCT CCCTCACGGC TACACGGGAA GACTCGCTCG CATTCGCCGC AGACGCAAAC
AGGCTAAACG TAGCCATCAC GAGAGCCAGG AAGAAGCTAC TCTTCGTGGC AAACGCGAAC
GTAATGGAGC ACGGCATACT GAAGGAGATC CGCGAGTGGG CCAAGAAGAA GAAAGCCATC
TACGACTGGC GCCTAAAGAA GTGGACATAG
 
Protein sequence
MASPVKEDLW SMLEKLLEKE RGSSATEAER FPARFLGGEK VLLFEKRGKP FSFEPGDVVG 
VAEGGLVEPL GVVLDATSET LIVENVYRKR LQQLREVELA DAELTLGYDL QLDLLKQVRG
GKAEIVAVFN EKPVELFEGA PQKPLGRVDA RTVKVSLVKR SAGGGEAREE VRLDESQSRV
VNAALELEEG EVLLVVGPPG TGKTTTIAGI AEKLAERGER VLISSHTNRA VDNAVGKLPI
DFTVRVGRPE KVLREIEPYL LSYKVKSALG ERYADLQERI NELLETIRIH RGYLRGKDRS
TFSLLERSLR EHERELRELL EERRKLIEEV QSEVLGGARV VGSTLIKSQL YPLRDYPFDT
VIIDEASQVS VTLALLAMVK GRKWIVVGDH KQLLPIFRSE VAREELEDLG AFTRLMRFYG
EKGGYPRTLW LRKSYRSHPD IVGFAARYVY EGKIKPAAKP KEKTLALSPG YPDFLEPRKP
FTLIHVDSQE ERRGGSRINE AEARVCYELV DALTKHGVPQ EEIGVITPYR AQRSRIKEYL
QGFNVEVNTV DAFQGREKDV VIFSLTATRE DSLAFAADAN RLNVAITRAR KKLLFVANAN
VMEHGILKEI REWAKKKKAI YDWRLKKWT