Gene Tpen_0742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0742 
Symbol 
ID4601149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp689824 
End bp694959 
Gene Length5136 bp 
Protein Length1711 aa 
Translation table11 
GC content58% 
IMG OID639773518 
Producthypothetical protein 
Protein accessionYP_920147 
Protein GI119719652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATATGA GTAGGAAAAC CCTATTAGGA ATACTGGCAG TACTCGCAAT ACTAGCACTA 
GCCGCAAACA CATATGCCTA CCCGGTTAAG AGCGCATACA ACATACTCCA GCTTCCCAAC
GGACAGCCCT TCGCGAACCA GCAGGTAATA ATTGTCTACT TCAACGAGAC CGGGAACTGT
ATACTTGCCT ATGCTATAGG CACGACGGAC TCCACCGGGA AGATAACACT CACAATCGCG
CAGCCCGGCG GAGTGATAAA CCAGCCGACT AGTGGTACAT ACAACATGTC CGTCTTCTGG
CAGGCCTACG GTAGAACATT CCTGCTTTAC ACCACAGGAA GTGGTCAAAG TGGAAACACA
GTCCTGGGGC TTTTGAATAG CACAATTACA CTGAACTACA TTTGGAACTT CACATTCCAA
GCTGTAACCA TGATTAACGG GCAAACGGTG CCTCTCTACT TCCAGGATCC TGAGGATGCT
ACGAGGGCGG ACGTAGCATA CTTCGAGGTA TACCTCTACG GTAAGACGGG CGCGCCCATA
TTCACCTCTG TCGGCGACAG TCAAGGAAAG TCTCAGGGCT GGGTTCTAGT TCCTATCTAC
GAGGTTAACA TCAAGCCGAC GCTTTCTCCT CAGTGCTACC ACATTGAGAA GCAGTGGAAC
GTCGCGCTGT ACAAGGAGGT GTACTGGCTA CTCGACCAGG GACAAGGACA GAAGTTAAAG
GTGCTCGTTG GGAAGGAGAA CGTCACCCTC AGCTACGACT CTGCTACTGA CACCTTAACC
GCGACTGTAA AGGACCTGGC AACAGGCACC ACGTCCACCC AGACGTTCAC GGATGCCGGC
CATATAGGCA CGCTTGTTGA TAACGCGCAC ACGTTCCTTC TCCAGGTCGC TGTCAAGGAT
ATATGCGGTA ATTCGCTGAA CAACTGGCAG AACCCACCGA TAAGCATAGA GCTGACCTCG
CCGAGCCTAG GCAAGCTCAG GGCAGGCAAA GCCAACCCGA GCGGAGTAGC GCCAGAGGAA
GGCTATTCGT TTGGCGGTAG CGTCTACTTC TGGTTGCCGG ACCTCACGCT GATCTACAAG
GAGAAGATGG GCATCGCCGC CGAAATAGGC GGAATCCAGG TATTCTCCGA AGCATTTAAC
ACGACAACGA CGGGTAGCAT TACCTACACT AAACCCGTAT CCTTCACCCC GTTCACTCTA
ACCGGAAGCA ACGGAGTATA CACTGCTACG ATAAAGGTAA GCATCGTACC AACAAAGATA
ACGATCAAGA CGAGCGGCCT CATAGCCGTT CAGCCGCTCC AGGGCGCTAT CGTGCAGATA
AGCCCACTTG TCGGCGACCC CAACACCTAC ACCGACGCTA ACGGAAACGT GGCCCTAATG
CCGTTCCAGA TAGTTGGAGG AGGAACTGTC GGCGACGGAT CCGGGAAGCC GATAATCGTA
CGCTCGGGAG CGACGAACCC AGGCTACCTG CCCGTACCCT ACGGGCTGTG GAGCACCGGT
AAAGTCTTCA CGTACACTGT CCGCGTGCTC TGGGCGCTAC CCGGCACCGA GTACTACGTC
GACGTAACGC CGGACGCTAA CACAATAGAG CTCAACCTGA CGAAGGCCAT TGAAACCGGA
TGCACGGTGC AGAGCTTCGT GCTGTTCGCT AAGGTGTACG TGGTCAACAT GAAGGCCGTT
GACCTCTGCA ACAGGCCGAT AACGAGTGCA GACGACCCCA ACGCGACGCT TATACTCACG
TACACAACGC CGGACGGGCA GACAGTCCAG TTCCCCGCAG GGCTCGGTAG CAACGGTACA
ACGTTCGCTG CTTTCGTACC TGGAGGAAGC TTCGCCGTCA AGCTGTTCTT CAAGGGAGTC
CTCCTGGACC CCGTCTCCGG TCCTAACCCG CTCGTAGTAA GCGGGAACAT CGGTAACATC
AGTAAGGCTA CCTACACGTT CCCGATAGGC GACTTAACGC TGAGGATCAC GCCGTGGGAC
GTCAAAGAGC CGCTAGTCAA CGTATCCGTG CTCCTAGAGT ACTTCAAGGC AGGCGCTAGG
ACCTATTATG AGAGCCATGT AACAGGCTGC GACGGCACGG TCACGTTCAC CAAGGTGCCC
TTCAAGGTCG CCGCGAACAG CAAGATAACC GTGACTATAA CCACTAGGCC CGATACCCCG
TACATACGCC ACCCCAAGGA CGACGGCCTC GTCATCGGTA AGTGGGACCT GACGAGCCTC
CTCAACACTA CAACGCCCGC GTGCAACGTA GGACCTATCG ACGTGCCGAC CTGGGTGTTC
AGCTTCACTC TTGAGGCCGT AGACCACAAC GGCAACGTGC TGAAAGAGCT CCCGACCAGT
AACGGCACAG CCCCCGTTAT AGTCGCCATA AACGACACCT ACACGAAGAA CGAGTACAAC
GTCACCCAGG TATGCCAGGC AGGGCCCGGC TGTCTCTGCT GGCCGCTCAT AAACGTCAAC
TACCAGATAT TCAACTCGAC CAACTACCTC GGCACGCCGT GGGGCAACGG CATCTCCGAG
GCAAGGTTCA AGATAACCGG TAGCCAGTGG ATGAACAGCA ACTACCCGCA CCTCTTCATA
GCCGGCGCGT ACTACAACTT CATAGTCTGG TACGGCGGAG TAATGGTCTA CAACTACAAC
TTCACGCTAC CGGCGCCCAG CGAGCCGCTC GACTACAGCA AGGTCGTAAC GACGGTGGTA
CTCTTCAACG AGACGACGGG CGCGACGAGG ACCGTTGAGA CGGACAAGCT CGACTACACC
TGGGTACTCG CGAATGGCGG CGCCGTCGAG CACCCGATAG CCAGGCTCTA CGGAGCACCC
GCGTGGAGCG GTAGGTACAG CGTCAAGCTC CAGCTGGTAA CCTGGGTAGT CAACCTCGAC
GTCTACGCGC TCAGCAAGAT AGGCGCGGGC CTCATACCTG GCCTCAACGT AACCCTCGCT
AGGAACGACA CCGTCAACTG GAAGAAGCTC GTAGGGAACT ATACCGCTAT AACGAAGCCC
ACGATAACTC TGGGCGCTGT AGCGTGGAGC GCCGTCACCG GTAGCGACGG CAAGGCATCG
CTACCCGTCG CGATCTGGCA GCCCAAGCTC AAGCTCGCTA ATGTGAAGTT CGGAGCCAGC
ATAACCAACG TCACGGTGCT GGCCGGCAAG AAGTATGGGA CGCCCAACGT TCCAGACACG
CCGACCGCGG GCTCGCTCAC TGTAGGCAAT GTCAACTTAG GCACCCTCTA CGGCTACATA
GTAGGACCCT ACAACGTTAC CACGAACGCT ATATTCAACG ACGTGAGCCA GAGCCCCGAC
TGGTACAAGT TCTACGGCGG GACCTGGAAC AAGTTCCGCG GACCGCAGGC GTGGATCGGC
GCCAACTGGA ACATGACGCT GTGGAGCGGC GCGGCTAAGG TCGTCTACAC CGCCGCCATG
GAGGGCTTCT GTGTCAGCGT GACAGGCCCA GACTTCAGGG ACAACCTCGT GCCGCTCGCC
AACCAGCCCG TAACGGTGAC CGTCCTCGGT GCCTCGGGTG CCTCCGCGGC TTTCGCCTCC
GCGGCTACCG GTAGCGACGG AACGGTGACT ATTAGCCCCG ACAAGGGCGC CGCCGTAGCG
ACGCCCGTTG GTAGCCTGCC GGTGTTCACA GGCAAGGTAG CCTTCCTCGG CGTTACAGGC
CTCAAGTACA CTCTGTCCAC GAAGCTCAAC CTCGACGACG CGCTCGGCCT CAAGAACTAC
GGTATAACCA CGGGCCAGGC GTTCGACCCA GACACGTTGT CGACGACTGT TAGCTTCAAC
GTTACCAACA ACATGCCCGG CGGTACGTGC GTGGCGCTGA AGTGGGACGC CATAAAGGTG
ACGGTCTTCG ACTGGAGCGG TAAGCCTCTG AAGAACATGA TGGTAGCCGC CATACTCAGG
GAGCCCAGGG CCAAGGCTAT TCCGAGCACT GCCGGCTTCA CCGCGGAGAA CGGTAGCGTG
ATACTCTACG TGCCGCCAGG CACGCAGAAG TACCAGCTCA TCGTCTACTG GCGCGACAGC
TACCTGCTGA GGGTTGCCGG CAAGATTCCG AGGGAGATAG CGATATTCGA CACCGTCACC
GACTACGATA CTCCGAGAAC CTACGCGCCC GGTAGCGGCA CGACGCTCGA AACCTTCGTC
TACGTCGGCA TAATAATGCT CAGGAACGCG CAGGGCCAGG CCCTGTCGCC CGACATACTC
AGCAAGATAA CGGTCGAGAT ACAGTGGCCA GACCTCGTAG TGACCACTCA CAAGCCCGAG
AACGACGGTA GAGTACCCAT AATACTCAAC AAGGACACCG CTAAGAGCTG GCCGCTCGAC
GCCAGCGCCG CGAGGAGCCC AGACACGCCT GCCTCCACGA TCAGCCAGGC GCCTCTCGGA
GCCTACAAGG TAACCGTTAA CCTCGCCGGC GTAGGGACAC TCGCCGTGCA GACCATCAAG
ATCGAGAAAG GCAGGTTCGA GACCAGCACG CAGATATTCG AGGTTAGGCT CGACATCTTC
GACGTCAAGC TGACGTTCAC CTCGCCGTTC GGCACGCCGC TCGCCGGAGC AACTGTCACC
ATAACCAAGC CCGACGGTAC AAGCATCACC GACAGCCTCG ACAGCTCTGG TAGCATAACC
GTCAAGGAGG TGCCTCCGGG CAACCTCCAG TACACAGTGA AGGACTGGAA GGGCATAGCG
ATAGGCTACT CCGGTAGCGT TGCGCGCGCC CCAGCCGTGG GCATTACCGT GCCGAAGATC
GGCAAGCTGA CGGTCAAGGT GCTCGGAGCC AGGGGCCAGG GCATCGAAGG TGCCACCGTC
GCCATCGAGA ATGTCGGCAC CTTCACGACC GACGCTAGCG GCATAGTAAG CCTCGAGCTA
CCCAGCGGCA CCTACGCGGT AACGGCGAGC AAGGGCGGGA GGACTGCGAG CGCCACCGCC
ACCGTGAGTG ACGGCAAGGA GACCGTCACC GAGCTGAAGC TCGACATCTT CCTCACCATC
GCCGGCTGGG AGATGAGCAG CAGCGAGTTC CTCGGACTCA TACTGCTCGT AGTGCTCCTC
GTCCTAGTGC TCTTCATAAT CGCCCACGAG TACGCTGTCT ACAGGCGCCG CCGCCTCGCC
AAGGTAATCG CTCCGGCAGA AACGCAGGCA AAGTAA
 
Protein sequence
MHMSRKTLLG ILAVLAILAL AANTYAYPVK SAYNILQLPN GQPFANQQVI IVYFNETGNC 
ILAYAIGTTD STGKITLTIA QPGGVINQPT SGTYNMSVFW QAYGRTFLLY TTGSGQSGNT
VLGLLNSTIT LNYIWNFTFQ AVTMINGQTV PLYFQDPEDA TRADVAYFEV YLYGKTGAPI
FTSVGDSQGK SQGWVLVPIY EVNIKPTLSP QCYHIEKQWN VALYKEVYWL LDQGQGQKLK
VLVGKENVTL SYDSATDTLT ATVKDLATGT TSTQTFTDAG HIGTLVDNAH TFLLQVAVKD
ICGNSLNNWQ NPPISIELTS PSLGKLRAGK ANPSGVAPEE GYSFGGSVYF WLPDLTLIYK
EKMGIAAEIG GIQVFSEAFN TTTTGSITYT KPVSFTPFTL TGSNGVYTAT IKVSIVPTKI
TIKTSGLIAV QPLQGAIVQI SPLVGDPNTY TDANGNVALM PFQIVGGGTV GDGSGKPIIV
RSGATNPGYL PVPYGLWSTG KVFTYTVRVL WALPGTEYYV DVTPDANTIE LNLTKAIETG
CTVQSFVLFA KVYVVNMKAV DLCNRPITSA DDPNATLILT YTTPDGQTVQ FPAGLGSNGT
TFAAFVPGGS FAVKLFFKGV LLDPVSGPNP LVVSGNIGNI SKATYTFPIG DLTLRITPWD
VKEPLVNVSV LLEYFKAGAR TYYESHVTGC DGTVTFTKVP FKVAANSKIT VTITTRPDTP
YIRHPKDDGL VIGKWDLTSL LNTTTPACNV GPIDVPTWVF SFTLEAVDHN GNVLKELPTS
NGTAPVIVAI NDTYTKNEYN VTQVCQAGPG CLCWPLINVN YQIFNSTNYL GTPWGNGISE
ARFKITGSQW MNSNYPHLFI AGAYYNFIVW YGGVMVYNYN FTLPAPSEPL DYSKVVTTVV
LFNETTGATR TVETDKLDYT WVLANGGAVE HPIARLYGAP AWSGRYSVKL QLVTWVVNLD
VYALSKIGAG LIPGLNVTLA RNDTVNWKKL VGNYTAITKP TITLGAVAWS AVTGSDGKAS
LPVAIWQPKL KLANVKFGAS ITNVTVLAGK KYGTPNVPDT PTAGSLTVGN VNLGTLYGYI
VGPYNVTTNA IFNDVSQSPD WYKFYGGTWN KFRGPQAWIG ANWNMTLWSG AAKVVYTAAM
EGFCVSVTGP DFRDNLVPLA NQPVTVTVLG ASGASAAFAS AATGSDGTVT ISPDKGAAVA
TPVGSLPVFT GKVAFLGVTG LKYTLSTKLN LDDALGLKNY GITTGQAFDP DTLSTTVSFN
VTNNMPGGTC VALKWDAIKV TVFDWSGKPL KNMMVAAILR EPRAKAIPST AGFTAENGSV
ILYVPPGTQK YQLIVYWRDS YLLRVAGKIP REIAIFDTVT DYDTPRTYAP GSGTTLETFV
YVGIIMLRNA QGQALSPDIL SKITVEIQWP DLVVTTHKPE NDGRVPIILN KDTAKSWPLD
ASAARSPDTP ASTISQAPLG AYKVTVNLAG VGTLAVQTIK IEKGRFETST QIFEVRLDIF
DVKLTFTSPF GTPLAGATVT ITKPDGTSIT DSLDSSGSIT VKEVPPGNLQ YTVKDWKGIA
IGYSGSVARA PAVGITVPKI GKLTVKVLGA RGQGIEGATV AIENVGTFTT DASGIVSLEL
PSGTYAVTAS KGGRTASATA TVSDGKETVT ELKLDIFLTI AGWEMSSSEF LGLILLVVLL
VLVLFIIAHE YAVYRRRRLA KVIAPAETQA K