Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1290 |
Symbol | |
ID | 4600598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1231493 |
End bp | 1234237 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639774066 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_920691 |
Protein GI | 119720196 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR02578] CRISPR-associated protein, Csm1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.726249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGTCTCT CGCTGGAGCT TAGGGAGCCG TACCCCGTGT ACAGGGCGTT CAGGGTCCCC GTCTGGGGGA GGTGCGGCCC GGTCAGCGTG AGGAGGGTCG AGCCGGGCGA CGTGAAGGCC CTGGAGTGGT TCATCGGGGC GGCCAGGAGG CTGGCGGAGA GGCTCGCCGC GGAGGCCCGG GACGACTACG AGAAGCTGGA AGTCGTCGCG AACGTCGCCG CCGTGCTCCT CAAGGCCCCT CTCTCGCGGG AGCTCTCGCC CGTGCACCCG AGCCCTATGA AAGCCCAGGT GGCGCTCCTA GCCCTCCCCG ACGCCCTCTT CAAGAGGTAC TTCGGGCTGG TGAAGGGCGA CGTCTACTCG ATGCTCCGCG ACGTCCTCGG GCTCGACCTG GGGGCCATGG GGCTCGGCGA GGTCTTCGGG CGCGAGGCCT ACGAGAACGT GTACAGGCTG TGGGTAGCGT TCCCGGCTGA CACGAGGCCC GGGTACAACA CGTCGAGCCT AGCCGCCCAC CTGCTCATGA CCTCCGCTCT GGCGTGGGCC CTGGCCTACG AGTCCGGGAA GACGGAGGAG GAGAGGCGCA GGGAGGCCGC GGTCGCCAGG GTTGCCGCCC TCCTCCACGA CATAGGCAAG GCGGTGGACC CGGAGAGGCA CGCCGAGGAG TCTGCCAGGA TAGCCGAGTA CCTCCTCAAG GGGATAGTGG GCGACGAGGT CCTCGCCAGG GTGGTGGGGG AGGTCAGGGA GCACCACGCC CGCGAGGGCT ACGTGAGCAG GGCCGACAGG CTGGCGTCGG CCGCGGACAG GCTCGCCAAG GTCGTCGACA GGGCTATAGG GGACAGGGTC TCCAGGATGG AGGGGCTGGT GGGCGGGAGG AGGGACGACT GGGGCTTCTG GAGGAGGCTC TACGAGAGGC TCGACGACCT CAAGAGGGAG GGGCTCGCCA GGGAGGACCC CGTCAAGGAG CTGACTGAGC TCTTCCTGGA GAGAGCGGAG GAAGCCGCCG AGAATGTGAG GAAGGAGGAG AAGGAGGAGG GTGTGAAGGG CTTAACGCTA CTCCTCTTCG ACGTTGGCTC CATACAGGAG TTCGTCTACA GGAGCATGGA GCTCAGGGTT GTAGCCGCGG CGAGCCTCCT AGTGGACTTC GTGACGTACT CCTACCTGCC GCTCTACCTG AGGGCTAACG GCGTGCGCGT ACCGCCGGAG GCATTCCTCT ACTCCGGGGG CGGTATCCTC CTCATGTTGC TACCGGAGTC GCTCGCCGAG AGAGTCTGGG AGCTCGCCCG GAAGGTCAAA GAGGAGTTGC CGGAGCCGCT GAGGCTGGTG GTAGCGGATG CGGAGTTCCA CGTGGACTTC AAGACAGCCT GGGAGAAGAT CGAGGAGGCC CTGCTGATCG CCAAGCTCGG CGTAGAGCTG GCGGACGAGC CGCAGTTCGA CCCGCAGAAC GGCAGGGAGC TCTGCAGGCT CTGCCTGAGG GAGTGGGCCT CAACGAGTGT TCGCACGCCC GAAGGCGAGG TACCCGCGTG CCAGACATGC AGCAGGCTCT ACGATCTGGG CTCCGAGGTC CACTTCCGCA AGAAGTGGGA GTCGAGGGTC GAGGTGGGAG GCCGCTCCTT CACCCCGAGC GAGGCCTTCG ATGCGGTGTG GAATGATGTG TCGAAGTTCG TGGTAGAGGT TATCGCGGGC CACGACCCGA GGGAGGCGGC TAAGCCCAAG CGCCTCAGGG ACTGCGCGGT GATCAAGTTC GACGGGAACG CGATGGGAGC GTTCATGTCG AAGGCCCTGT CCTTCACGGA CGCCATCGAG AGGAGCTTCA GGGTGGACAT GGCCTTGAAG AACGCCTACC TCAAGGCCCT CGAAGCCCTC TACGAGGGGG TCAGGAGGGT CGCGGGCGAC GGGGCGGCGG AAGCCGAGGT GGCGAGGGTG TTCCTGGGGA CGCTGTACAT GGGAGGGGAC GACGGCGTCC TGATAGCTCC TGCGTGGGCG GCGCCCCTCC TAGCCCACTT CATAGCGGAG GAGTTCTCGA GGCAGCTCGG GCTGGTCGCG ACGCTCACGG CCTCCGTCGC CGCGGGGCCC GCCAGGATGA GCGTGTGGTC TCTCGTCGAC TGCGCGTCGG CGGAGATGGA GGAGGCGAAG CTCGCCGCTA CGAGGCACAG GTGCGGCGCG TTAGTCTTCG ACGTGTTCGA CTCCGGCTCG CCGTCCGGGG CGACAGCCAG GGAGAGGCTG AAGAGGTACT CGCGGAAGCT TGGCGAGAAG TCGAGGGACC AGCTCATAGA CGGCTACCAG CCCTACCTCA TCGAGAGGAA GGCTCTCAGC GGGGAGGGAG TCCCGGAGGC GTGGGCCAGG CTGTTCAGCC ACGTCCTCGG GGTTCCGGCG CGGGGCGCCT GGTGCGAGGA CTCCTCGTTC AAAGCCCACG CGGAGGCTTT CGGGAAAGCT TACCTGGCTT CGAGGGCGGA GGGCGAGGAC GAGGAGGTCG AGAGGGCTAG GAAGAGGCTG GCGGCGCTGA GGAGGGTCGC CCTGGACACG TGGAGGGAGG TCTCGGGCTC CGCGTACTGG AGGGAGCAGG CGTACATATA CGTGCTCAGG CAGCTGGAGT CCGAGGCCCT CGGCGAGGAG ACAAGCGAGG CGTACGGCGC GCTCAAGCGC TTCATCGAGT CGAACCTGTT CGACGAGTCT GGGAACGCGT CGGGCTACGT GCCCCTGGTC GACCTCCTCA CGTTCATAAA GCTGGTTAAG GGTGGTGCGT GGTGA
|
Protein sequence | MSLSLELREP YPVYRAFRVP VWGRCGPVSV RRVEPGDVKA LEWFIGAARR LAERLAAEAR DDYEKLEVVA NVAAVLLKAP LSRELSPVHP SPMKAQVALL ALPDALFKRY FGLVKGDVYS MLRDVLGLDL GAMGLGEVFG REAYENVYRL WVAFPADTRP GYNTSSLAAH LLMTSALAWA LAYESGKTEE ERRREAAVAR VAALLHDIGK AVDPERHAEE SARIAEYLLK GIVGDEVLAR VVGEVREHHA REGYVSRADR LASAADRLAK VVDRAIGDRV SRMEGLVGGR RDDWGFWRRL YERLDDLKRE GLAREDPVKE LTELFLERAE EAAENVRKEE KEEGVKGLTL LLFDVGSIQE FVYRSMELRV VAAASLLVDF VTYSYLPLYL RANGVRVPPE AFLYSGGGIL LMLLPESLAE RVWELARKVK EELPEPLRLV VADAEFHVDF KTAWEKIEEA LLIAKLGVEL ADEPQFDPQN GRELCRLCLR EWASTSVRTP EGEVPACQTC SRLYDLGSEV HFRKKWESRV EVGGRSFTPS EAFDAVWNDV SKFVVEVIAG HDPREAAKPK RLRDCAVIKF DGNAMGAFMS KALSFTDAIE RSFRVDMALK NAYLKALEAL YEGVRRVAGD GAAEAEVARV FLGTLYMGGD DGVLIAPAWA APLLAHFIAE EFSRQLGLVA TLTASVAAGP ARMSVWSLVD CASAEMEEAK LAATRHRCGA LVFDVFDSGS PSGATARERL KRYSRKLGEK SRDQLIDGYQ PYLIERKALS GEGVPEAWAR LFSHVLGVPA RGAWCEDSSF KAHAEAFGKA YLASRAEGED EEVERARKRL AALRRVALDT WREVSGSAYW REQAYIYVLR QLESEALGEE TSEAYGALKR FIESNLFDES GNASGYVPLV DLLTFIKLVK GGAW
|
| |