Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0789 |
Symbol | |
ID | 4601137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 741183 |
End bp | 742595 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773565 |
Product | hypothetical protein |
Protein accession | YP_920194 |
Protein GI | 119719699 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.926627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGATA TGATGGGGTA TGAGGAAATC GAGAAGACTG TTAGGGAGGA GACGTGGAGG CGTGCGGCTC TAAGCTTGTA TGCCACCAGA ACAGAGGATA AGAGGAGGGG CAGGACGCAC TACCGCGGCC TCTTCGACAC TGTGACCGAG GTCGACTGGG ACTTCACCAG GTTCCTCGTA AACGGCTACA CGGTTGTGCC GGACGAGGCT TACCCCAGGT TCAGCAAGCT GTTCGACTAC GACGCCCGCC AGTATCTCTT GTTGAACGAC GACGAGAGGC CGAGGGAGGG AGGCGCAGTC ATAGAGCTCA GAGGAAGGTT GCAAGCAATC GTCGACGCCG GCGCCGATGG TCTTAGAGCC GAGAGGCATG GAAGGCACTG GCGCGTGTAC GTGCCACACG AGAACTGGCA CGTCGTCGTG AGCAAGCCTA CTAACGGCTG GTCCGTACAC GTACCGCTGG AGGGCTACTG GACTGAGACT AGTTTCCCGG AGGTTCTCGT GAGGACATCG CCAGACGTGC TTAGAAGCCT GCAGAGGGGG TGGATCCTGA CGGATGTGAC ACCCCCTCAT GGGCGCCACA GCGATGTACG TTTCAGCACA ACGCAACCGT GGCAGTTGCC AGCCACGCTC GCGGCCTTTC CTAGTGGCAA TATCCGCTTG GGCGTCACGG CGGGCATTCT TGGCAGTACC AGGCTAAGCA TCGAGTGGCA TGTACTCGTC TACGGCTACG AGGAGGAGCT GGGCTGGGCT TCAAGGCTTG TCGGCGAAGT TAAACGTGTA GAGTATCGCA GGCTGGTCGA GGAGTGCAAG GCGCTTAACG GCGATTCCGT GGCGCTACTG ACTGCGTATG AAGGAGACGG CATACTCGCG TATTTCCTGA GAATGAGAGA GCTTTACTTC AGAATTAGAC ATGAGATAGT TTACCTGCCA GCTGAGAGCG CCATTGTCAA TGCCCGCTTA GCTGTGGAGA GGGCTAGCGA GTACACAAAG TTCGTCTCAT TGGTGACGAA ATGCGCCAAG ATTAAACACT TCTTGTTCGT CGGCTACGGG ATACCGCAGA AGAGGGGTAG GAAGAACGGG CAGAAAAACA ACCCGTTCTA CGCCGAGATA GCGGGGGCTA AGCTACACGT AGTCTACTAC ACGGCTGATA ACCGTATTTA CGCGAGGATC GTGGTTGATG CTGTGCCTCT AGGCTGGGTG GAGGAGGCGC GTGCTCAGGG CTGGGACGTC CGGGTGGTTC GCATGGGGAG CAAGGAATAC TACCAGGTGA CTCACAATTC TCTCTTTGAA CACGCGCGTA GCGACGCAAA GCTACGCGCA ACGCTCCTCG CCTTCACAAG GTACAAGGCC ACACAGTACC CCAAGGCGCA GAGCCTTGTA AAGCTCTTAG AAGAGCTGGG GACAGAAGAC TAA
|
Protein sequence | MVDMMGYEEI EKTVREETWR RAALSLYATR TEDKRRGRTH YRGLFDTVTE VDWDFTRFLV NGYTVVPDEA YPRFSKLFDY DARQYLLLND DERPREGGAV IELRGRLQAI VDAGADGLRA ERHGRHWRVY VPHENWHVVV SKPTNGWSVH VPLEGYWTET SFPEVLVRTS PDVLRSLQRG WILTDVTPPH GRHSDVRFST TQPWQLPATL AAFPSGNIRL GVTAGILGST RLSIEWHVLV YGYEEELGWA SRLVGEVKRV EYRRLVEECK ALNGDSVALL TAYEGDGILA YFLRMRELYF RIRHEIVYLP AESAIVNARL AVERASEYTK FVSLVTKCAK IKHFLFVGYG IPQKRGRKNG QKNNPFYAEI AGAKLHVVYY TADNRIYARI VVDAVPLGWV EEARAQGWDV RVVRMGSKEY YQVTHNSLFE HARSDAKLRA TLLAFTRYKA TQYPKAQSLV KLLEELGTED
|
| |