Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0785 |
Symbol | |
ID | 4601133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 735267 |
End bp | 737153 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639773561 |
Product | hypothetical protein |
Protein accession | YP_920190 |
Protein GI | 119719695 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGGAGAG AGCACTGGTC GACGTTTCTG CTGTGTTTTC TCTTGCTGGT TCCAGCGCTC CGCGCTGCGC CCATCGCAGA GCACTACTAC GTGGAGGTCC TAGAGTACGT GGCTGGAGCT TGGAAGAGTA GGTACGTGCC CGTCAACGTC AATTCCCCCT CGCTGACGGT TCAAACAGTT GCAAGCGTGC TCGTCGTGCG CTCGGATCCC TCGTCCGGCC TGCAACCCGT ATCCGTAAGT GTCGACGGCT CCAGGTACAG CGTTGTGAAC GGGACGGGCG TTCTGTGGTT CGCGGCGTCC GTGGCGCTAG ACGGGAAAAT GCACGTGGTG GAGGTAAGGT TCCAGAAGGT CAGCCCGGGG CCCGTAGTTT CCGGAGTTAT AGGGGTGCAG TCAAGCCTTC CGTCGTCGCC TAGCTTCAAC GTGTCGGTAC CGCCGCTTCC CGGCTTCGTC GCCGCCGGCG TGAGGTTGGA GCTTTTACTC CTGTCGCCCG GCGACGTGTT CAAAGTCCTC GACAAGCCAT TCTTCGTGCT AAATTCCTCT TCGATAAGAG TCTTAGGGCA GGATATATTC GTCGCGGACG TAGTCGTGCC TTTCCTCAAC GTGTCGCTAC GACCCGGTGC TATCAGCGTG AAGGCGTGGT ACCTCTACTT TATACCTCCG AGCGACGACG AGGTGGTGTA CCCGCCTTAC AGCTTCAGGC TATTCTTCGC AAACCACCCC TCGCTCCCGG GAGTCGAGGA AAACGCTCTA GCCGGGCGGC CACCCCACGT ATTGCTACGT TTCGCGAGGG ACCTCTCGGA GAAGGCTGGG CTACGGAGCT ACAACGTTAG CGTGACCGTA GCTAAGCCTG AGAGCCTCTG TGGGGTCAAG GACTACTCGT ACAGGGTTAT TCCCCCCGAG GGAGGCGTGT TGGAAGGCTC TAGAGTAATC CTGGGCGCCA ACACTACCAT CACTGTGAGG TTTTTCTCCG CCGGCATCTC CCTGGGAGAC GTCGTCGTCT ACACGCCGCC CCCCGAGCTA CTCGTACAGC CGCCTATCTA CAGCCTCTCG TTGAAATTCA CTGATATCGC GGGCTACCCG TTAAACAACA CATACTTCGT AGTGTACAGG GCCGGCGTCC CCGTGTACTC GGGCATCGCC AGGGGAGGGG AAGCCGTGGT CTGCCCCCTA GCCGCGGGTA CCTACGACGT CGTAGCATAC GTAGCGTCTA GGGTCGTGGG GAGGGGGCGC GTGACGCTCC TAGGCGACTC GGCCGCAGAA ATACTCACGA ACACCACGAC TGTGAGCTTC CAGTTCGTGC GGCAAGGAGC CGGCGAAGTG CTCACCTCTT ACAAGGCTGT CCTCAAAGGA GCTGTTGAGC TCGTCGCGAA CAGTTCCGCG GAGGGCCTAG CCGTGTTCCA CGGAGTCCCC CCGGGTACCT ACTCGCTCGA GGTGTACTGG AACAACACTA GGCTTGCGAG GTACTCAGTC GAGGTAGACT TGAAGGGCGG CAGGAGCGTT CTCTCGATAC AGGCATACAG GCTTCAAGTG TTGGTGAGAA ACCTCCTGGA CCAACCGGTG AAGGGTGCCG TGGTTTTCCT CGAGGGAGGA GGCTTCTCGT CTACCAGGCT CACAGACGAG GCTGGAAGGG CGGACTTCGG GCTAGTACCG GCGGGGAACT ACACGTTGAT AGTAGAGGGG GCCCAGCCCC AAACCGTCCG GCTGATCTCC GACACGTTCA GAGTCGTACA GGTAGACGAG ATCGTGAAAA TAGGCGGTTT CACGGTGACC GGTAAAGTCG CGCTTTACGC ATTGGTGGCA ACAGTCTTCC TCGCAGCAAT CGTTGCGGTT AGGCGAGCTT TGAAACGGAG GGAAAAGGGT ATAGAGGAGG TAGACTTTGC GCGATGA
|
Protein sequence | MRREHWSTFL LCFLLLVPAL RAAPIAEHYY VEVLEYVAGA WKSRYVPVNV NSPSLTVQTV ASVLVVRSDP SSGLQPVSVS VDGSRYSVVN GTGVLWFAAS VALDGKMHVV EVRFQKVSPG PVVSGVIGVQ SSLPSSPSFN VSVPPLPGFV AAGVRLELLL LSPGDVFKVL DKPFFVLNSS SIRVLGQDIF VADVVVPFLN VSLRPGAISV KAWYLYFIPP SDDEVVYPPY SFRLFFANHP SLPGVEENAL AGRPPHVLLR FARDLSEKAG LRSYNVSVTV AKPESLCGVK DYSYRVIPPE GGVLEGSRVI LGANTTITVR FFSAGISLGD VVVYTPPPEL LVQPPIYSLS LKFTDIAGYP LNNTYFVVYR AGVPVYSGIA RGGEAVVCPL AAGTYDVVAY VASRVVGRGR VTLLGDSAAE ILTNTTTVSF QFVRQGAGEV LTSYKAVLKG AVELVANSSA EGLAVFHGVP PGTYSLEVYW NNTRLARYSV EVDLKGGRSV LSIQAYRLQV LVRNLLDQPV KGAVVFLEGG GFSSTRLTDE AGRADFGLVP AGNYTLIVEG AQPQTVRLIS DTFRVVQVDE IVKIGGFTVT GKVALYALVA TVFLAAIVAV RRALKRREKG IEEVDFAR
|
| |