Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1218 |
Symbol | |
ID | 4601630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1155105 |
End bp | 1156223 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639773994 |
Product | peptidase M48, Ste24p |
Protein accession | YP_920619 |
Protein GI | 119720124 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0501] Zn-dependent protease with chaperone function |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.416464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGAGA AGTTGTTGCT GGTCTTGAGC CTCGCCAGCG TTGCTCTCAT CTTCTCGTTG CTACTGCAGG CACTGGTCCC CGTTTCCGGC TTCTACGGCC CCGTGGGTGC TTGGTGGGCG CTGGACTTCA TCGGCTACGC GCTTGCGATG GCTGTGCTCG TAACGGCTCC GTCCCTCTTC GGCAAGTACC TCACCCCGAC ACCCTCAAAC CTAGGGGTGT TGAGGTCCTC GATGATCGCC ACGATGGCCG GCGTGCTGGG CGGGTTTGGG CTCGTAGCTT TGGGGGTTTC GAGCCTGCTT GGCGCGGAGC TCACAGGCCA GCTTCTCTCC CTGGCTATAG TCTTCGCTCT CGTGCCCTCG CTCTTCTCCT GGCTCTTCTC CCCCCTACTC ATAAACGTCA TGTACGGGTG CAAGCCGGAC CCAGTGCTCC AGGACATAGT CAACAGGGTC GCGCAGAGGG CCGGGATGAA GCCGCCGAAA GCTGTTATCG CCACCCGTAT GCGCGAGCCT AACGCCTTCG CCTATTCCTC CCCGCTGTTC GGGAGCTACG TCGCCGTCAC GGAGGGCATG ATGCGGCTCG CCAAGGGCGA GGAGCTGGAG GCGGTCATAG GGCACGAGCT GGGCCACCAC AAGCACAAGG ACAACACCGT GATGCTGATC TTCGGGCTAA TACCCTCCGT CGTGTACTTC CTAGGGCGCT TCCTGGCGTA CATGGGCTTC TTCTCGTCCG GGGCCAGGTA CGACGGCGAC GGGGAGAGGA GAGGGGGAGG CGGGGGCTTC CTCCTGGTGC TCGTCGGGAT AGCCCTCATG GCTGTAAGCG TGATCATACA GCTGGCCGTG CTTGCCCTCT CGAGACTGAG GGAGCACTAC GCCGACGTCC ACGGCGCCAT GGTGACATCC CCCGACGCCA TGATATCGGC GCTAGCGTCC CTCGACTCAT ACTACGGTAG CCGAGAGGTG GCGAAGAGGA GGGTCGAGGA CAGCAAGCTC AAGATGTTCT TCATCTACGC GCTCGCAGAG CCGCTGGTAA GCCTCGAAGA GCTGCTCGCA ACTCATCCGC CGATAGAGAA GAGGATAGCC TTCCTCGAGG CGCTCAAGCG CACCTCTCTC CGCGCTTAA
|
Protein sequence | MKEKLLLVLS LASVALIFSL LLQALVPVSG FYGPVGAWWA LDFIGYALAM AVLVTAPSLF GKYLTPTPSN LGVLRSSMIA TMAGVLGGFG LVALGVSSLL GAELTGQLLS LAIVFALVPS LFSWLFSPLL INVMYGCKPD PVLQDIVNRV AQRAGMKPPK AVIATRMREP NAFAYSSPLF GSYVAVTEGM MRLAKGEELE AVIGHELGHH KHKDNTVMLI FGLIPSVVYF LGRFLAYMGF FSSGARYDGD GERRGGGGGF LLVLVGIALM AVSVIIQLAV LALSRLREHY ADVHGAMVTS PDAMISALAS LDSYYGSREV AKRRVEDSKL KMFFIYALAE PLVSLEELLA THPPIEKRIA FLEALKRTSL RA
|
| |