Gene Tpen_0304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0304 
Symbol 
ID4601127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp270252 
End bp272450 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content51% 
IMG OID639773065 
ProductAAA family ATPase, CDC48 subfamily protein 
Protein accessionYP_919717 
Protein GI119719222 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID[TIGR01242] 26S proteasome subunit P45 family
[TIGR01243] AAA family ATPase, CDC48 subfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAACG AAGGAAAAGT CCCCGAGATA GAGCTCAAAG TGCTTGAGGT AAGACAGCAC 
GAGGCTGGGA GGGGCAGGGT GAGGATAGAC GAGGACGCCA TGGAGGCTCT GGGAATAAGT
GCTGGTGATG TTGTGGAGAT AGAGGGTAAG AGGAAGACTG TCGCCATCGC TTGGCCTGGA
TACGCCGAGG ACAAGGGGAA AGGCATAATC CGAATGGATG GCTGGACGAG GAAGAACGCC
GGCGTCAGTA TAGGCGACAA AGTCAAGGTT AGAAAGGCTG AGGTGAAGCC GGCCCAGTTC
ATAAGACTTG CCCCGGTGTC TATGACACTC GCCGTCGACG AGAACTTCGT GGCTTACGTG
AAGAAGAGGC TTGTTGACAG GCCGATAATT GAGGGCGACG TTATACAGAT CCCCGTGCTA
GGCCAGGTAA TTCACTTCAA CGTTGTGAAC ATCAAGCCAA AAGGAGTAGT CGTAGTCACG
GATAAAACTC AGCTCAAAAT ACTCGAGAGA CCTGTTGATA CCGGAAAGAT ACCAAGGGTA
ACCTACGACG ATATAGGGGA CTTGGAGGAG GCTAAGCAGA AGATTAGGGA GATGGTTGAG
TTGCCGTTAA GGCACCCAGA GCTCTTCAAG CGCCTCGGTA TTGATCCTCC GAAGGGTATA
CTGCTCTACG GTCCTCCGGG TACCGGTAAG ACCCTCCTAG CAAAAGCCGT AGCAAACGAG
ACTGATGCTT ACTTCATAGC TATTAATGGC CCTGAGATTA TGAGTAAGTT CTACGGTGAG
AGCGAGCAGA GACTCAGAGA GATATTCGAA GAGGCTAAGG AGCACGCTCC TGCGATTATC
TTTATCGACG AGATAGACGC CATAGCCCCC AAGAGAGAAG AAGTCACAGG AGAAGTAGAA
AAAAGAGTAG TAGCACAGCT ACTAGCACTA ATGGACGGGC TTGAAGCTCG CGGAGACGTT
ATCGTTATCG GAGCTACCAA TAGGCCTAAC GCCTTGGACC CTGCGCTTAG AAGGCCCGGC
CGCTTTGACA GGGAGATCGA GATAGGCATC CCGGACAAGC GGGGAAGGCT CGAAATATTC
AAGGTTCACA CTAGGAGCAT GCCTCTAGCA AAGGACGTGG ACCTTGAAAA GCTCGCCGAG
ATAACCCACG GCTTCGTCGG GGCAGATATA GCAGCGCTGT GTAGAGAAGC CGCGATGAAA
GCCCTCCGGA GAGTACTGCC GAAGATAGAC TTGGAGAAGG ACGAGATCCC AGTAGAGGTG
CTTGAAACGA TAGAAGTCAC CATGGACGAC TTCATGAACG CATTCAGGGA AATAACTCCG
AGCGCCCTCA GGGAGATCGA GGTAGAAGTA CCGGCGGTGC ACTGGGACGA CATCGGAGGG
CTGGAAGACG TCAAACAACA ACTAAGAGAG GCTGTCGAGT GGCCCTTAAA GTACCCAGAG
TCGTTCAGCC GCCTCGGTAT TGATCCTCCG AAGGGTATAC TGCTCTACGG TCCTCCGGGT
ACCGGTAAGA CCCTCCTAGC AAAAGCCGTA GCGACAGAGA GTGAGGCGAA CTTTGTTAGC
ATAAAGGGTC CAGAGGTCTA CAGTAAGTGG GTTGGTGAAA GCGAGAGAGC CATACGAGAG
CTGTTCAGAA AGGCGAGGCA GGTTGCTCCG AGCATAATAT TCATAGACGA GATAGACGCC
CTCGCACCAA TGAGAGGGCT TGTAACTTCC GATTCGGGGG TCACAGAGCG CGTTGTCAGC
CAGCTTTTGA CTGAGATGGA TGGTTTGGAG AGGCTTGAGG GTGTTGTTGT TATTGCTGCT
ACTAATAGGC CTGACATTAT TGATCCTGCT TTGCTGAGGC CTGGCAGGTT TGATAGGTTG
ATCTACGTGC CTCCACCAGA CGAGAAGGCT AGACTCGAGA TACTCAAGGT TCACACTAGG
AGAATGCCTC TAGCCGAAGA CGTAGACCTA GCAGAAATAG CGAGAAAAAC AGAAGGATAC
ACAGGTGCAG ATATAGAAGT GCTGGTGCGC GAGGCAGGGC TTCTGGCATT AAGGGAGAAT
ATTTCGATAG ACAAGGTTTA CAGGAGACAC TTCGAGGAAG CGTTAAAGAA GGTAAGACCC
TCTCTTACCC CGGAGATAAT AAAGTTTTAC GAGTCGTGGA ACGAGAGAGC TAGAAAGGTT
TCAAAGCAGC AGCTAACAGT AACGGGTTTC TATGTATGA
 
Protein sequence
MNNEGKVPEI ELKVLEVRQH EAGRGRVRID EDAMEALGIS AGDVVEIEGK RKTVAIAWPG 
YAEDKGKGII RMDGWTRKNA GVSIGDKVKV RKAEVKPAQF IRLAPVSMTL AVDENFVAYV
KKRLVDRPII EGDVIQIPVL GQVIHFNVVN IKPKGVVVVT DKTQLKILER PVDTGKIPRV
TYDDIGDLEE AKQKIREMVE LPLRHPELFK RLGIDPPKGI LLYGPPGTGK TLLAKAVANE
TDAYFIAING PEIMSKFYGE SEQRLREIFE EAKEHAPAII FIDEIDAIAP KREEVTGEVE
KRVVAQLLAL MDGLEARGDV IVIGATNRPN ALDPALRRPG RFDREIEIGI PDKRGRLEIF
KVHTRSMPLA KDVDLEKLAE ITHGFVGADI AALCREAAMK ALRRVLPKID LEKDEIPVEV
LETIEVTMDD FMNAFREITP SALREIEVEV PAVHWDDIGG LEDVKQQLRE AVEWPLKYPE
SFSRLGIDPP KGILLYGPPG TGKTLLAKAV ATESEANFVS IKGPEVYSKW VGESERAIRE
LFRKARQVAP SIIFIDEIDA LAPMRGLVTS DSGVTERVVS QLLTEMDGLE RLEGVVVIAA
TNRPDIIDPA LLRPGRFDRL IYVPPPDEKA RLEILKVHTR RMPLAEDVDL AEIARKTEGY
TGADIEVLVR EAGLLALREN ISIDKVYRRH FEEALKKVRP SLTPEIIKFY ESWNERARKV
SKQQLTVTGF YV