Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1388 |
Symbol | |
ID | 4600671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1342970 |
End bp | 1344304 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639774163 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_920788 |
Protein GI | 119720293 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.196735 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACTTTC TAGATGTACG GTTAGGCCGG CTCTTTGAGC TTAGTCGACT TTACGTTTAT ACATATATAA AGCTTTTATA CTCGAATTGG AAAGCTCCTC CAATGCGGAA AGCAGTACTA GTACTACTTG TCTTAGTGCT ACTAATACCG GTACTACAGG TAACCACGGC GCCGGCAACG AATAAGGTAA GGGTTGTCGT TGGATATGAG AACGAGGGTG CCCTGGCTGC GGTCGAGGGG CTACCGGGAG CCGAGAAGGT AAAGGTTCTA CGCGAGATAA AAGCCGCTGT CTTCTACCTG CCACCCGAGG CTATCGAGAA GGCTAAGGGG ATTAAGGGCG TAAGGTACGT CGAGGAGGAC AAGGTCGCGG TAGCCTTGGA GCTCTCGAGC TACCCGGACG TCCTCTGGGA CGTGAAGATG ATTAACGCCA GCAAGGTCTG GGACAAGTAC TACCCGGTGT ACGGCTGGAA GGCGCTCGGA AGGGGAGTGG TAGTCGCGGT TCTGGACACG GGGATAGACT ACACCCACCC CGAGCTTAAA GGCAAAGTTG TGTGGTGCGC GAACACCGTC GGGGTTAAGA CGTACACGGG TACGAAGCTG AGTAACTGCG CCGACAGGAA CGGGCACGGG ACCCACGTCG CCGGCACCAT AGCCTCCGCG ATAAACGGGG TTGGAAACGC GGGCGTCGCG CCGAACGTAA CGCTCTACGC GGTTAAAGTA CTCAACGACG CCGGCTCCGG GACGTACTCC GACATAGCGG AGGGTATAAT CATCGCCGTG AAGGGGCCGG ACGGGGTCGC GGGTACTAGC GACGACGCCA AGATACTCAG CATGTCTCTC GGCGGAAGTA GCGACAGCCA GGTCCTATAC GATGCGGTTA AGTGGGCGTA CAGCAACGGC GCTGTCCTCG TAGCGGCCGC GGGGAACTCG GGCGACGGAG ACCCCACGAC GGACAACGTC GCCTACCCGG CGAGGTACAG CGAGGTCATA GCGGTGGCAG CCGTGGATAG CAACGCCAAC GTGCCCACTT GGAGTAGCGA CGGACCCGAG GTAGACGTAG CGGCGCCCGG TGTAAACGTC TACTCCACGT ACAAGAACGG CGGCTACGCT ACTCTCTCCG GGACCAGCAT GGCGACCCCG CACGTCTCCG CCACGGTTGC CCTCATACAG GCCCTCAGGC TCGCAGCCGG GAAGCAGCCC CTGACGCCGT CCCAGGTCTA CGACGTCCTC ACCAAGACTG CTAAGGACAT AAACTCGCCC GGCTTCGACG TCTTCACGGG CTACGGGCTC GTAGACGCGC TGGCGGCGGT AGACTACGCG CTGAGCCTAC CCTAA
|
Protein sequence | MHFLDVRLGR LFELSRLYVY TYIKLLYSNW KAPPMRKAVL VLLVLVLLIP VLQVTTAPAT NKVRVVVGYE NEGALAAVEG LPGAEKVKVL REIKAAVFYL PPEAIEKAKG IKGVRYVEED KVAVALELSS YPDVLWDVKM INASKVWDKY YPVYGWKALG RGVVVAVLDT GIDYTHPELK GKVVWCANTV GVKTYTGTKL SNCADRNGHG THVAGTIASA INGVGNAGVA PNVTLYAVKV LNDAGSGTYS DIAEGIIIAV KGPDGVAGTS DDAKILSMSL GGSSDSQVLY DAVKWAYSNG AVLVAAAGNS GDGDPTTDNV AYPARYSEVI AVAAVDSNAN VPTWSSDGPE VDVAAPGVNV YSTYKNGGYA TLSGTSMATP HVSATVALIQ ALRLAAGKQP LTPSQVYDVL TKTAKDINSP GFDVFTGYGL VDALAAVDYA LSLP
|
| |