Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0742 |
Symbol | |
ID | 4601149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 689824 |
End bp | 694959 |
Gene Length | 5136 bp |
Protein Length | 1711 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639773518 |
Product | hypothetical protein |
Protein accession | YP_920147 |
Protein GI | 119719652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATATGA GTAGGAAAAC CCTATTAGGA ATACTGGCAG TACTCGCAAT ACTAGCACTA GCCGCAAACA CATATGCCTA CCCGGTTAAG AGCGCATACA ACATACTCCA GCTTCCCAAC GGACAGCCCT TCGCGAACCA GCAGGTAATA ATTGTCTACT TCAACGAGAC CGGGAACTGT ATACTTGCCT ATGCTATAGG CACGACGGAC TCCACCGGGA AGATAACACT CACAATCGCG CAGCCCGGCG GAGTGATAAA CCAGCCGACT AGTGGTACAT ACAACATGTC CGTCTTCTGG CAGGCCTACG GTAGAACATT CCTGCTTTAC ACCACAGGAA GTGGTCAAAG TGGAAACACA GTCCTGGGGC TTTTGAATAG CACAATTACA CTGAACTACA TTTGGAACTT CACATTCCAA GCTGTAACCA TGATTAACGG GCAAACGGTG CCTCTCTACT TCCAGGATCC TGAGGATGCT ACGAGGGCGG ACGTAGCATA CTTCGAGGTA TACCTCTACG GTAAGACGGG CGCGCCCATA TTCACCTCTG TCGGCGACAG TCAAGGAAAG TCTCAGGGCT GGGTTCTAGT TCCTATCTAC GAGGTTAACA TCAAGCCGAC GCTTTCTCCT CAGTGCTACC ACATTGAGAA GCAGTGGAAC GTCGCGCTGT ACAAGGAGGT GTACTGGCTA CTCGACCAGG GACAAGGACA GAAGTTAAAG GTGCTCGTTG GGAAGGAGAA CGTCACCCTC AGCTACGACT CTGCTACTGA CACCTTAACC GCGACTGTAA AGGACCTGGC AACAGGCACC ACGTCCACCC AGACGTTCAC GGATGCCGGC CATATAGGCA CGCTTGTTGA TAACGCGCAC ACGTTCCTTC TCCAGGTCGC TGTCAAGGAT ATATGCGGTA ATTCGCTGAA CAACTGGCAG AACCCACCGA TAAGCATAGA GCTGACCTCG CCGAGCCTAG GCAAGCTCAG GGCAGGCAAA GCCAACCCGA GCGGAGTAGC GCCAGAGGAA GGCTATTCGT TTGGCGGTAG CGTCTACTTC TGGTTGCCGG ACCTCACGCT GATCTACAAG GAGAAGATGG GCATCGCCGC CGAAATAGGC GGAATCCAGG TATTCTCCGA AGCATTTAAC ACGACAACGA CGGGTAGCAT TACCTACACT AAACCCGTAT CCTTCACCCC GTTCACTCTA ACCGGAAGCA ACGGAGTATA CACTGCTACG ATAAAGGTAA GCATCGTACC AACAAAGATA ACGATCAAGA CGAGCGGCCT CATAGCCGTT CAGCCGCTCC AGGGCGCTAT CGTGCAGATA AGCCCACTTG TCGGCGACCC CAACACCTAC ACCGACGCTA ACGGAAACGT GGCCCTAATG CCGTTCCAGA TAGTTGGAGG AGGAACTGTC GGCGACGGAT CCGGGAAGCC GATAATCGTA CGCTCGGGAG CGACGAACCC AGGCTACCTG CCCGTACCCT ACGGGCTGTG GAGCACCGGT AAAGTCTTCA CGTACACTGT CCGCGTGCTC TGGGCGCTAC CCGGCACCGA GTACTACGTC GACGTAACGC CGGACGCTAA CACAATAGAG CTCAACCTGA CGAAGGCCAT TGAAACCGGA TGCACGGTGC AGAGCTTCGT GCTGTTCGCT AAGGTGTACG TGGTCAACAT GAAGGCCGTT GACCTCTGCA ACAGGCCGAT AACGAGTGCA GACGACCCCA ACGCGACGCT TATACTCACG TACACAACGC CGGACGGGCA GACAGTCCAG TTCCCCGCAG GGCTCGGTAG CAACGGTACA ACGTTCGCTG CTTTCGTACC TGGAGGAAGC TTCGCCGTCA AGCTGTTCTT CAAGGGAGTC CTCCTGGACC CCGTCTCCGG TCCTAACCCG CTCGTAGTAA GCGGGAACAT CGGTAACATC AGTAAGGCTA CCTACACGTT CCCGATAGGC GACTTAACGC TGAGGATCAC GCCGTGGGAC GTCAAAGAGC CGCTAGTCAA CGTATCCGTG CTCCTAGAGT ACTTCAAGGC AGGCGCTAGG ACCTATTATG AGAGCCATGT AACAGGCTGC GACGGCACGG TCACGTTCAC CAAGGTGCCC TTCAAGGTCG CCGCGAACAG CAAGATAACC GTGACTATAA CCACTAGGCC CGATACCCCG TACATACGCC ACCCCAAGGA CGACGGCCTC GTCATCGGTA AGTGGGACCT GACGAGCCTC CTCAACACTA CAACGCCCGC GTGCAACGTA GGACCTATCG ACGTGCCGAC CTGGGTGTTC AGCTTCACTC TTGAGGCCGT AGACCACAAC GGCAACGTGC TGAAAGAGCT CCCGACCAGT AACGGCACAG CCCCCGTTAT AGTCGCCATA AACGACACCT ACACGAAGAA CGAGTACAAC GTCACCCAGG TATGCCAGGC AGGGCCCGGC TGTCTCTGCT GGCCGCTCAT AAACGTCAAC TACCAGATAT TCAACTCGAC CAACTACCTC GGCACGCCGT GGGGCAACGG CATCTCCGAG GCAAGGTTCA AGATAACCGG TAGCCAGTGG ATGAACAGCA ACTACCCGCA CCTCTTCATA GCCGGCGCGT ACTACAACTT CATAGTCTGG TACGGCGGAG TAATGGTCTA CAACTACAAC TTCACGCTAC CGGCGCCCAG CGAGCCGCTC GACTACAGCA AGGTCGTAAC GACGGTGGTA CTCTTCAACG AGACGACGGG CGCGACGAGG ACCGTTGAGA CGGACAAGCT CGACTACACC TGGGTACTCG CGAATGGCGG CGCCGTCGAG CACCCGATAG CCAGGCTCTA CGGAGCACCC GCGTGGAGCG GTAGGTACAG CGTCAAGCTC CAGCTGGTAA CCTGGGTAGT CAACCTCGAC GTCTACGCGC TCAGCAAGAT AGGCGCGGGC CTCATACCTG GCCTCAACGT AACCCTCGCT AGGAACGACA CCGTCAACTG GAAGAAGCTC GTAGGGAACT ATACCGCTAT AACGAAGCCC ACGATAACTC TGGGCGCTGT AGCGTGGAGC GCCGTCACCG GTAGCGACGG CAAGGCATCG CTACCCGTCG CGATCTGGCA GCCCAAGCTC AAGCTCGCTA ATGTGAAGTT CGGAGCCAGC ATAACCAACG TCACGGTGCT GGCCGGCAAG AAGTATGGGA CGCCCAACGT TCCAGACACG CCGACCGCGG GCTCGCTCAC TGTAGGCAAT GTCAACTTAG GCACCCTCTA CGGCTACATA GTAGGACCCT ACAACGTTAC CACGAACGCT ATATTCAACG ACGTGAGCCA GAGCCCCGAC TGGTACAAGT TCTACGGCGG GACCTGGAAC AAGTTCCGCG GACCGCAGGC GTGGATCGGC GCCAACTGGA ACATGACGCT GTGGAGCGGC GCGGCTAAGG TCGTCTACAC CGCCGCCATG GAGGGCTTCT GTGTCAGCGT GACAGGCCCA GACTTCAGGG ACAACCTCGT GCCGCTCGCC AACCAGCCCG TAACGGTGAC CGTCCTCGGT GCCTCGGGTG CCTCCGCGGC TTTCGCCTCC GCGGCTACCG GTAGCGACGG AACGGTGACT ATTAGCCCCG ACAAGGGCGC CGCCGTAGCG ACGCCCGTTG GTAGCCTGCC GGTGTTCACA GGCAAGGTAG CCTTCCTCGG CGTTACAGGC CTCAAGTACA CTCTGTCCAC GAAGCTCAAC CTCGACGACG CGCTCGGCCT CAAGAACTAC GGTATAACCA CGGGCCAGGC GTTCGACCCA GACACGTTGT CGACGACTGT TAGCTTCAAC GTTACCAACA ACATGCCCGG CGGTACGTGC GTGGCGCTGA AGTGGGACGC CATAAAGGTG ACGGTCTTCG ACTGGAGCGG TAAGCCTCTG AAGAACATGA TGGTAGCCGC CATACTCAGG GAGCCCAGGG CCAAGGCTAT TCCGAGCACT GCCGGCTTCA CCGCGGAGAA CGGTAGCGTG ATACTCTACG TGCCGCCAGG CACGCAGAAG TACCAGCTCA TCGTCTACTG GCGCGACAGC TACCTGCTGA GGGTTGCCGG CAAGATTCCG AGGGAGATAG CGATATTCGA CACCGTCACC GACTACGATA CTCCGAGAAC CTACGCGCCC GGTAGCGGCA CGACGCTCGA AACCTTCGTC TACGTCGGCA TAATAATGCT CAGGAACGCG CAGGGCCAGG CCCTGTCGCC CGACATACTC AGCAAGATAA CGGTCGAGAT ACAGTGGCCA GACCTCGTAG TGACCACTCA CAAGCCCGAG AACGACGGTA GAGTACCCAT AATACTCAAC AAGGACACCG CTAAGAGCTG GCCGCTCGAC GCCAGCGCCG CGAGGAGCCC AGACACGCCT GCCTCCACGA TCAGCCAGGC GCCTCTCGGA GCCTACAAGG TAACCGTTAA CCTCGCCGGC GTAGGGACAC TCGCCGTGCA GACCATCAAG ATCGAGAAAG GCAGGTTCGA GACCAGCACG CAGATATTCG AGGTTAGGCT CGACATCTTC GACGTCAAGC TGACGTTCAC CTCGCCGTTC GGCACGCCGC TCGCCGGAGC AACTGTCACC ATAACCAAGC CCGACGGTAC AAGCATCACC GACAGCCTCG ACAGCTCTGG TAGCATAACC GTCAAGGAGG TGCCTCCGGG CAACCTCCAG TACACAGTGA AGGACTGGAA GGGCATAGCG ATAGGCTACT CCGGTAGCGT TGCGCGCGCC CCAGCCGTGG GCATTACCGT GCCGAAGATC GGCAAGCTGA CGGTCAAGGT GCTCGGAGCC AGGGGCCAGG GCATCGAAGG TGCCACCGTC GCCATCGAGA ATGTCGGCAC CTTCACGACC GACGCTAGCG GCATAGTAAG CCTCGAGCTA CCCAGCGGCA CCTACGCGGT AACGGCGAGC AAGGGCGGGA GGACTGCGAG CGCCACCGCC ACCGTGAGTG ACGGCAAGGA GACCGTCACC GAGCTGAAGC TCGACATCTT CCTCACCATC GCCGGCTGGG AGATGAGCAG CAGCGAGTTC CTCGGACTCA TACTGCTCGT AGTGCTCCTC GTCCTAGTGC TCTTCATAAT CGCCCACGAG TACGCTGTCT ACAGGCGCCG CCGCCTCGCC AAGGTAATCG CTCCGGCAGA AACGCAGGCA AAGTAA
|
Protein sequence | MHMSRKTLLG ILAVLAILAL AANTYAYPVK SAYNILQLPN GQPFANQQVI IVYFNETGNC ILAYAIGTTD STGKITLTIA QPGGVINQPT SGTYNMSVFW QAYGRTFLLY TTGSGQSGNT VLGLLNSTIT LNYIWNFTFQ AVTMINGQTV PLYFQDPEDA TRADVAYFEV YLYGKTGAPI FTSVGDSQGK SQGWVLVPIY EVNIKPTLSP QCYHIEKQWN VALYKEVYWL LDQGQGQKLK VLVGKENVTL SYDSATDTLT ATVKDLATGT TSTQTFTDAG HIGTLVDNAH TFLLQVAVKD ICGNSLNNWQ NPPISIELTS PSLGKLRAGK ANPSGVAPEE GYSFGGSVYF WLPDLTLIYK EKMGIAAEIG GIQVFSEAFN TTTTGSITYT KPVSFTPFTL TGSNGVYTAT IKVSIVPTKI TIKTSGLIAV QPLQGAIVQI SPLVGDPNTY TDANGNVALM PFQIVGGGTV GDGSGKPIIV RSGATNPGYL PVPYGLWSTG KVFTYTVRVL WALPGTEYYV DVTPDANTIE LNLTKAIETG CTVQSFVLFA KVYVVNMKAV DLCNRPITSA DDPNATLILT YTTPDGQTVQ FPAGLGSNGT TFAAFVPGGS FAVKLFFKGV LLDPVSGPNP LVVSGNIGNI SKATYTFPIG DLTLRITPWD VKEPLVNVSV LLEYFKAGAR TYYESHVTGC DGTVTFTKVP FKVAANSKIT VTITTRPDTP YIRHPKDDGL VIGKWDLTSL LNTTTPACNV GPIDVPTWVF SFTLEAVDHN GNVLKELPTS NGTAPVIVAI NDTYTKNEYN VTQVCQAGPG CLCWPLINVN YQIFNSTNYL GTPWGNGISE ARFKITGSQW MNSNYPHLFI AGAYYNFIVW YGGVMVYNYN FTLPAPSEPL DYSKVVTTVV LFNETTGATR TVETDKLDYT WVLANGGAVE HPIARLYGAP AWSGRYSVKL QLVTWVVNLD VYALSKIGAG LIPGLNVTLA RNDTVNWKKL VGNYTAITKP TITLGAVAWS AVTGSDGKAS LPVAIWQPKL KLANVKFGAS ITNVTVLAGK KYGTPNVPDT PTAGSLTVGN VNLGTLYGYI VGPYNVTTNA IFNDVSQSPD WYKFYGGTWN KFRGPQAWIG ANWNMTLWSG AAKVVYTAAM EGFCVSVTGP DFRDNLVPLA NQPVTVTVLG ASGASAAFAS AATGSDGTVT ISPDKGAAVA TPVGSLPVFT GKVAFLGVTG LKYTLSTKLN LDDALGLKNY GITTGQAFDP DTLSTTVSFN VTNNMPGGTC VALKWDAIKV TVFDWSGKPL KNMMVAAILR EPRAKAIPST AGFTAENGSV ILYVPPGTQK YQLIVYWRDS YLLRVAGKIP REIAIFDTVT DYDTPRTYAP GSGTTLETFV YVGIIMLRNA QGQALSPDIL SKITVEIQWP DLVVTTHKPE NDGRVPIILN KDTAKSWPLD ASAARSPDTP ASTISQAPLG AYKVTVNLAG VGTLAVQTIK IEKGRFETST QIFEVRLDIF DVKLTFTSPF GTPLAGATVT ITKPDGTSIT DSLDSSGSIT VKEVPPGNLQ YTVKDWKGIA IGYSGSVARA PAVGITVPKI GKLTVKVLGA RGQGIEGATV AIENVGTFTT DASGIVSLEL PSGTYAVTAS KGGRTASATA TVSDGKETVT ELKLDIFLTI AGWEMSSSEF LGLILLVVLL VLVLFIIAHE YAVYRRRRLA KVIAPAETQA K
|
| |