Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1772 |
Symbol | |
ID | 4601949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1714395 |
End bp | 1715504 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639774545 |
Product | THUMP domain-containing protein |
Protein accession | YP_921170 |
Protein GI | 119720675 |
COG category | [R] General function prediction only |
COG ID | [COG1818] Predicted RNA-binding protein, contains THUMP domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.375193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGGTA GCGCTTTGAT CATAACGTGT AAGCTCGGCT TCGAGAGGGT AGTAGCGTCC TACGTAGAGG AGCTCGACCC GGCGGCGGAG GTTGAGGCTA CGCCGCAAGG CTTCGCCGGG CTTGTACTGG TTCGGCCGGG CAACCTTAAA GCCGAGGAGC TCGCACGCGC GGTTAAAGAG AGGGTTCCGG AAGCCGAGAA GGTGTTCGTC GCGGACGCTG AGTGCAACGC GAGCATAGAG GAGATAGTCA GGTGCGCGGT GGGGATAAGC GCCGGGATAA GCCGGGAGGA GAGCTTCGCC GTTAGGACTG TGAGGAGGGG TAGCCACGGC TTTACAAGCC TAGACGTGAA CGTAGCCGTG GGTTCAGCAG TGAAGGAAGC AACGGGGGCC AGGGTCGACC TGGAGAACCC CGACAAGGTA ATCTTCGTGC AGATACTGCA GGACAAAGCT TACCTGTCCC TGGTCCCGGG CTCCGAGTTC TACAAGAAGA TGCCCTCCTC GAAGTACCCC GTGTACAAGG TCTTCAGAAA GCTCGTCGTG GCACACGAGC CGTACCTCGG TCCTCCGGAC GCGTCGTACG TCATGGGTAC CCGTATCGGC AGGGAGGTCC AGGTGTTCGA GGTCGGAAAG CTGTACGTAA CGCCCGTGGG GGCGGTTGAC GCGTACTCCC TCTACAGCTT CCTGAGGGGC GCCTTCGAGG GGCAGAGGTC CCGGTTCGAG TTGCAGAAGA GGAGCTACGG GAGGGAGGTC GTCAAGACGG AGGTGTACGT GCAGGACATG TACCAGTTCG CGAGGTCCAG GCTCGGGAAG CCGCTCATAA TATTCGAGCC CGAGGGGGAG CCCGTGTCCA GGGTGGCCGG GGAGGTCGCG GACTTCATAA TAAGGAAGGT CTTCAAGGAG AAAGAGGAGG TAGCGATAAT GGTTGGGGCT AGAGAAGGCG TGCCGACGGG GCTTTTCCGG TTTGCGGACT TCGTTCTCGA CGTGGCCCCC GGGGTGGTTA TCTCGACGGA GTACGCCCTT TCCTCCGGGC TGATAGCGCT CGCCACGATA CTTCACGAGA AGCTCGTAGA GGCGGCGTCC AGCGGGGAGC TAGGCGCGGG CGAGCCTTAG
|
Protein sequence | MGGSALIITC KLGFERVVAS YVEELDPAAE VEATPQGFAG LVLVRPGNLK AEELARAVKE RVPEAEKVFV ADAECNASIE EIVRCAVGIS AGISREESFA VRTVRRGSHG FTSLDVNVAV GSAVKEATGA RVDLENPDKV IFVQILQDKA YLSLVPGSEF YKKMPSSKYP VYKVFRKLVV AHEPYLGPPD ASYVMGTRIG REVQVFEVGK LYVTPVGAVD AYSLYSFLRG AFEGQRSRFE LQKRSYGREV VKTEVYVQDM YQFARSRLGK PLIIFEPEGE PVSRVAGEVA DFIIRKVFKE KEEVAIMVGA REGVPTGLFR FADFVLDVAP GVVISTEYAL SSGLIALATI LHEKLVEAAS SGELGAGEP
|
| |