Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0481 |
Symbol | |
ID | 4601875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 437003 |
End bp | 438427 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639773249 |
Product | hypothetical protein |
Protein accession | YP_919893 |
Protein GI | 119719398 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.264378 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGAGG AGTACATGAA GGGAGATGGA GCCGTGACGG GGGAGGGCGT CGAAGAGTTC GACGAGGGGT TCATTCGCGA GAAGCTTAAA GAGCTTCTGA GGAAAGCTAT CTTCCACCGA TTCAACTACA GTGGTCTCGA GGCTACGTGC ACGCGCGTAA TGTGGGAGTT GATTAACCTC AAGGCTGAGT ACCAGCGCTA CGGCGAGGTC TACGTAGAGC TTACCAACAA AGCGGAGGTA GGCGTCGAGA AGGCTTGCAG AATAGCGAAG AGAGTGTTTA GAGAAGCCTT GGAGCACTTC GAAGAGCTCA AGGTACGGAG GACTGGGAAG GCAACGTGGA AGGTTAAGAT TCCGGGCGAG AAGTGTAGAA TCTACGTGTA TAGGAAGCCA GCCGGGCACT GGGCTGTTGA GATTCGCTTG TATATCAGAG TCACCAAGTT CATTGTTCCC GACACTCTAA GGCTTCCACC GGAGCTACTA GTAGACGCAC AGACGGGCTG GTTGTACGGA GATGCATCGT ACATAGCGAG CCGCAAAGAT GTCAGAATGG GCACCTCACA AGCATGGCAA GTAACCTCCT TCCCCGGTTT CTGGCCGGGG AAGGAGGTCG AAGTATACAT CAGAAGTGTA GTAATCCACG AGACTCATGT CAGCGTCGTG TGGACTGTTA GAGTTAAGGG TGTCCGCAAC GCTCCTAAGG AATGGAGGCT AAGGAAGGAA GAAAAGCAGA GTGTTATCCT GTCAGAGATT AAGGCAGCGA ACGAAGGAGA AATAGACATT TTCAGGGCTG TTCGAATTGC AACATACTAC GCCGCAGACG GAAAGTATCC AGGGCCAAAC ACGGCTAAAC GTTTGCTGGA ATTCGGGGTT GGCAACGAGC CGTACTGGAT TAGAGTCGAG GGTGCGGTGA GGATTGCGAA GCTTTTACAC GAGGAGGTAC CACAGCTATT AGCATTCATG TGCAGTGCTG GTTGCAAGAA GGCACGGTAT CTTGCCCGCC TCGCGCTCGT TGAGCCGAAA CGCAACCTCT CGCCGCGCTA CTTAGAGGTC GCCGGCGTAC GGATGAACCT GCAACTTGTA GGTACTAAGA ACTACCGTAC TCTTATAGCA AGAGTTTTCA TCACAAATAA TAATGAGGAG TTACTCAGGG GTTTTCCTGA GCGGGCAAGG GAAGAGGGTC TCATAGTTAA AAAGATGAAG ATAGACAAGA AGTACTACGG CTACTACGCT GGCTTGCGCG AGTTGATGAG CTATGCCGAT AAACACTTAG AGGCGTATGA CATATTGATC GACTTTGTCA AGGGGGAGCT CGAAGAAATG CCTCTAGACC ACCCCGCCCG CCAAAGCGTC GAAAGGCTTC TTGAACGCCT GAAAAAAGCT AGAGAACGCG CACTCAGAAA GCACGCTGGG GGCGAAAACA ACTAA
|
Protein sequence | MVEEYMKGDG AVTGEGVEEF DEGFIREKLK ELLRKAIFHR FNYSGLEATC TRVMWELINL KAEYQRYGEV YVELTNKAEV GVEKACRIAK RVFREALEHF EELKVRRTGK ATWKVKIPGE KCRIYVYRKP AGHWAVEIRL YIRVTKFIVP DTLRLPPELL VDAQTGWLYG DASYIASRKD VRMGTSQAWQ VTSFPGFWPG KEVEVYIRSV VIHETHVSVV WTVRVKGVRN APKEWRLRKE EKQSVILSEI KAANEGEIDI FRAVRIATYY AADGKYPGPN TAKRLLEFGV GNEPYWIRVE GAVRIAKLLH EEVPQLLAFM CSAGCKKARY LARLALVEPK RNLSPRYLEV AGVRMNLQLV GTKNYRTLIA RVFITNNNEE LLRGFPERAR EEGLIVKKMK IDKKYYGYYA GLRELMSYAD KHLEAYDILI DFVKGELEEM PLDHPARQSV ERLLERLKKA RERALRKHAG GENN
|
| |