Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1658 |
Symbol | |
ID | 4601735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1604310 |
End bp | 1605644 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639774431 |
Product | rhomboid family protein |
Protein accession | YP_921056 |
Protein GI | 119720561 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.598011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTTGC CGTTTCAGCG AGACTACGAG GACATAGAGA GGCGTGGAGC CCCGGTCACG CTTGGAATAA TACTCGTAAA CGTGGCGGTC TACCTCGTGT CGAGCTACGA GAACGGCTTC CTCGCGGTCT CCGACGCCTG GGTTAACGCG TTCGCGTTTG TCCCAGCGTA CTTCGCGAGG CCCGAGCACC TCTACAGGCT CTTCACTTCG ATGTTCCTGC ACGCCAACCT GGCGCACATA TTCTTCAACA TGCTTTACCT CTACACGTTC GGGAAGAGCG TCGAGGCAGT CCTCGGTAGC GAGAGGTACT TCCTGCTTTA CTTCGCGAGC GGTATCCTTG CGAGCGTCTT CCACACGGCG TTCCTACCCA TAGAGGGCGC GAGCTCGGCT TTCGTGCCGG CCCTCGGTGC CAGCGGTGCT ATAAGCGGGG TTCTAGGCGC GTACCTCCTG CTGTTCCCGG GGACCAGGCT CACGATGTGC TTCTTTTACG TATTCATCCC GCTATGCTTC ACCATGAAAG CGGCGGCTTA CCTCGTGTTC TGGTTCGCGC TACAGATACT GCAGGGGTTC CTGGGGGCTA GCCTCGGCGT CGCGGTATTC GCGCACGCAG GGGGGTTTAT AGGCGGGCTC GCGCTACTTC CGGTGCTCGT CAGCGAGGAG AGGATAGGGC TGCTCAGGCT GTACTCTTCG ATGCACTACT TCTTCAGGAA CATATTCTTC ACTGAGAGGG GCTTCACGAG GCTCAGCAAG GCAGTCGTAG CGGTGCTGAT AGGGCTCGTC GCGGCCGCGG CAGTCTACTC GGCTGTAGAG GCGGAGACCA CGGGGGGTGT GAACAAGGTG TTAACGGTGA GCGTCACGGG TAGAGGCGTC AACGATTCTG AAAGCGTGAT AATCCAGCTC CAGCCGGGCG GAGTCCTGGA CGTCACCCCC ATATCGAGTA GCGGCGTGAG GGTCGTTGTA AACCGCCTCA GGGCTGCCAA CCTCCTCTAC AACAGCCAGG CCGCCGGAAA GAGTATCTCC GTGGATAAAA GCCTGAGGGG AGTCGTCAAC GGCTTACCCG TCGAGATAAC GATAAAGGCT AACCTCTCGT TCGACTCCTA CGGGTTGCTA GACAGGGGGC AGGGCCACGT GACCACCGAC GTCCTCGCCT GCGATGCCTA CGGCAGGTGC ACCGTTTCGG GTAAGGGTAG CTACGACTTC TCCGTGAGCA GCGAGGCCAC GATGGCGGGC TTCAAGGGGA TACCCATAGC GGAGCTAGCC CTTCTCTCGC TGGCTTCGAG CCTGGCCGCC GTGGTCAACG TTCTTAGGGC GGAGAGGTAC GCGATAGTAG AGTAG
|
Protein sequence | MGLPFQRDYE DIERRGAPVT LGIILVNVAV YLVSSYENGF LAVSDAWVNA FAFVPAYFAR PEHLYRLFTS MFLHANLAHI FFNMLYLYTF GKSVEAVLGS ERYFLLYFAS GILASVFHTA FLPIEGASSA FVPALGASGA ISGVLGAYLL LFPGTRLTMC FFYVFIPLCF TMKAAAYLVF WFALQILQGF LGASLGVAVF AHAGGFIGGL ALLPVLVSEE RIGLLRLYSS MHYFFRNIFF TERGFTRLSK AVVAVLIGLV AAAAVYSAVE AETTGGVNKV LTVSVTGRGV NDSESVIIQL QPGGVLDVTP ISSSGVRVVV NRLRAANLLY NSQAAGKSIS VDKSLRGVVN GLPVEITIKA NLSFDSYGLL DRGQGHVTTD VLACDAYGRC TVSGKGSYDF SVSSEATMAG FKGIPIAELA LLSLASSLAA VVNVLRAERY AIVE
|
| |