Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1800 |
Symbol | |
ID | 4601793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1740167 |
End bp | 1741576 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639774573 |
Product | ATP-dependent protease La |
Protein accession | YP_921198 |
Protein GI | 119720703 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4930] Predicted ATP-dependent Lon-type protease |
TIGRFAM ID | [TIGR02653] conserved hypothetical protein [TIGR02688] conserved hypothetical protein TIGR02688 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.186498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCCGGCA GCCTCGACGA GAAGGTTAAG AGTATATTCG GCATGGTGGC TATAAACAAG GCCGTTGCGA ACAACGTGGC TATTTCCCGC GTGCCTAGGT TCATCTCCGA GTACTTGATA AACTTCAAGT GCGGGGAGAG CCAGGACGCC GAGTGCGTTT CGAGGTTGTC GAGGTACATA AGCGAGCTCT ACCCTGATCC CTCCGACAGG GAGCTCTTCC TGAGCAGGCT GAAGGAGAGG GGCTCCCTGA AGCTTCTCGA CGAGTTCAAG GTTAGGGTAG ACCTCAAGAG AAACACGTAC CTACTGGAGA TCCCCTCTCT GCAGATAGCC GACGCCTTCG TAAACGAGGA TATAGTGAGG GAGAATGCGA GGCTTTTCTC CGGGCTGTGG GGGGTCGGGC TCCTCTACTA CAAGCCTGAG CTCTCGAGAA GCAAGAACAG AACCCCCGTC GTGCTACACG ACTACAGGCC CTTCCAGGCG TCCTACGTGG ATCTCAAGCT GTTCGTGGAG AGCCGGTCTA AGTTCACGTT CGAGGAATGG GTAGACCTGC TGGTCACGAG CATAGGGCTA AACCCCAAGG CGTACACGCT TGGGCAGAGG CTACTCCTAC TGTCCAGGCT GATCCCCGTC GCGGAGCCAA ACGTGAACAT CCTGGAGCTA GGGCCCAGGG CTACGGGTAA GACCTCGCTG TACCGCAACG TGGCCTACTA CTCGAGGATC TACGCCGGGG GGACTATAAC CCCGGCGAGG CTGTTCTTCG ACGCCAGGCT ATCCGTACCA GGGGACCTGG CGCTACACGA CGTGATAGTA TTCGACGAGA TAAGCCGGGT AAGGCTTTCG AACCCCGACG AGCTCGTAGC CAAGCTGAAG GACTATATGG TCGACGGCTT CTTCGAGCGC GGAGCCCTCA AGAGAGCCCA CAGCACCTGC TCGCTGGTAT TCCTCGGAAA CGTCGACGTG GAGAGAATAG CGGACCCGAG CCACATCCTC GAGTACCTTC CAAGCTTCAT GAGGGACTCG GCATTCCTGG ACAGAATCCA CGGGTTTATC CCCGGGTGGA GGTTGCCGAA GATTATGAGG AGCGAGGAGT CCCTCGCGAG CGGCTACGGG CTAGCCTCAG ACTACCTAGC GGAGGTGTTG CACAGGCTCC GAGACGTAGC CGCCGAGAGC GTGGTAGCCG ACCATGTGGA GTTCGTCGGG AAGTACACCA TCAGGGACGA GAAAGCCGTG AAGCGCCTAC TCTCGGGAGC CGTGAAGATA CTCTTCCCGA ACTTCGAGTT CGACAACGCG GAGCTTGCGA GGGTGGCTAG GGGCATAGTA GGGCTTAGAA ACAACGTGTC TAGACTGCTG ACAGCCATCT CGCCCTCCGA GTTCCCCCCG AAAAAGCTCG AAGTAAAGGT TAGAGGGTAA
|
Protein sequence | MAGSLDEKVK SIFGMVAINK AVANNVAISR VPRFISEYLI NFKCGESQDA ECVSRLSRYI SELYPDPSDR ELFLSRLKER GSLKLLDEFK VRVDLKRNTY LLEIPSLQIA DAFVNEDIVR ENARLFSGLW GVGLLYYKPE LSRSKNRTPV VLHDYRPFQA SYVDLKLFVE SRSKFTFEEW VDLLVTSIGL NPKAYTLGQR LLLLSRLIPV AEPNVNILEL GPRATGKTSL YRNVAYYSRI YAGGTITPAR LFFDARLSVP GDLALHDVIV FDEISRVRLS NPDELVAKLK DYMVDGFFER GALKRAHSTC SLVFLGNVDV ERIADPSHIL EYLPSFMRDS AFLDRIHGFI PGWRLPKIMR SEESLASGYG LASDYLAEVL HRLRDVAAES VVADHVEFVG KYTIRDEKAV KRLLSGAVKI LFPNFEFDNA ELARVARGIV GLRNNVSRLL TAISPSEFPP KKLEVKVRG
|
| |