Gene Tpen_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1800 
Symbol 
ID4601793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1740167 
End bp1741576 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content58% 
IMG OID639774573 
ProductATP-dependent protease La 
Protein accessionYP_921198 
Protein GI119720703 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4930] Predicted ATP-dependent Lon-type protease 
TIGRFAM ID[TIGR02653] conserved hypothetical protein
[TIGR02688] conserved hypothetical protein TIGR02688 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.186498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCCGGCA GCCTCGACGA GAAGGTTAAG AGTATATTCG GCATGGTGGC TATAAACAAG 
GCCGTTGCGA ACAACGTGGC TATTTCCCGC GTGCCTAGGT TCATCTCCGA GTACTTGATA
AACTTCAAGT GCGGGGAGAG CCAGGACGCC GAGTGCGTTT CGAGGTTGTC GAGGTACATA
AGCGAGCTCT ACCCTGATCC CTCCGACAGG GAGCTCTTCC TGAGCAGGCT GAAGGAGAGG
GGCTCCCTGA AGCTTCTCGA CGAGTTCAAG GTTAGGGTAG ACCTCAAGAG AAACACGTAC
CTACTGGAGA TCCCCTCTCT GCAGATAGCC GACGCCTTCG TAAACGAGGA TATAGTGAGG
GAGAATGCGA GGCTTTTCTC CGGGCTGTGG GGGGTCGGGC TCCTCTACTA CAAGCCTGAG
CTCTCGAGAA GCAAGAACAG AACCCCCGTC GTGCTACACG ACTACAGGCC CTTCCAGGCG
TCCTACGTGG ATCTCAAGCT GTTCGTGGAG AGCCGGTCTA AGTTCACGTT CGAGGAATGG
GTAGACCTGC TGGTCACGAG CATAGGGCTA AACCCCAAGG CGTACACGCT TGGGCAGAGG
CTACTCCTAC TGTCCAGGCT GATCCCCGTC GCGGAGCCAA ACGTGAACAT CCTGGAGCTA
GGGCCCAGGG CTACGGGTAA GACCTCGCTG TACCGCAACG TGGCCTACTA CTCGAGGATC
TACGCCGGGG GGACTATAAC CCCGGCGAGG CTGTTCTTCG ACGCCAGGCT ATCCGTACCA
GGGGACCTGG CGCTACACGA CGTGATAGTA TTCGACGAGA TAAGCCGGGT AAGGCTTTCG
AACCCCGACG AGCTCGTAGC CAAGCTGAAG GACTATATGG TCGACGGCTT CTTCGAGCGC
GGAGCCCTCA AGAGAGCCCA CAGCACCTGC TCGCTGGTAT TCCTCGGAAA CGTCGACGTG
GAGAGAATAG CGGACCCGAG CCACATCCTC GAGTACCTTC CAAGCTTCAT GAGGGACTCG
GCATTCCTGG ACAGAATCCA CGGGTTTATC CCCGGGTGGA GGTTGCCGAA GATTATGAGG
AGCGAGGAGT CCCTCGCGAG CGGCTACGGG CTAGCCTCAG ACTACCTAGC GGAGGTGTTG
CACAGGCTCC GAGACGTAGC CGCCGAGAGC GTGGTAGCCG ACCATGTGGA GTTCGTCGGG
AAGTACACCA TCAGGGACGA GAAAGCCGTG AAGCGCCTAC TCTCGGGAGC CGTGAAGATA
CTCTTCCCGA ACTTCGAGTT CGACAACGCG GAGCTTGCGA GGGTGGCTAG GGGCATAGTA
GGGCTTAGAA ACAACGTGTC TAGACTGCTG ACAGCCATCT CGCCCTCCGA GTTCCCCCCG
AAAAAGCTCG AAGTAAAGGT TAGAGGGTAA
 
Protein sequence
MAGSLDEKVK SIFGMVAINK AVANNVAISR VPRFISEYLI NFKCGESQDA ECVSRLSRYI 
SELYPDPSDR ELFLSRLKER GSLKLLDEFK VRVDLKRNTY LLEIPSLQIA DAFVNEDIVR
ENARLFSGLW GVGLLYYKPE LSRSKNRTPV VLHDYRPFQA SYVDLKLFVE SRSKFTFEEW
VDLLVTSIGL NPKAYTLGQR LLLLSRLIPV AEPNVNILEL GPRATGKTSL YRNVAYYSRI
YAGGTITPAR LFFDARLSVP GDLALHDVIV FDEISRVRLS NPDELVAKLK DYMVDGFFER
GALKRAHSTC SLVFLGNVDV ERIADPSHIL EYLPSFMRDS AFLDRIHGFI PGWRLPKIMR
SEESLASGYG LASDYLAEVL HRLRDVAAES VVADHVEFVG KYTIRDEKAV KRLLSGAVKI
LFPNFEFDNA ELARVARGIV GLRNNVSRLL TAISPSEFPP KKLEVKVRG