Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1714 |
Symbol | |
ID | 4601739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1650039 |
End bp | 1653920 |
Gene Length | 3882 bp |
Protein Length | 1293 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639774487 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_921112 |
Protein GI | 119720617 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.256155 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAATA AGAAAACAAT CGCAGGACAA AAGCTAGTAG CAGTTGTACT AGCGGCAGTG TTGCTGTTCT CTTCTCTTCC AGTAGTAGCA TTTCCACGAG AAACACAGCC TTTAGACAAC GCGGGTATTC CATCCGGAGT CTTCGTGGGC AACTTTACTG CTGGTGGTGA GCAAGTAAGG GCATCCACGG TAGTGTTTAA GAAGGGTGTC TCGGAGAACA TAGTTTCCAA GGCTTACGGC GAAGTGTTGA AAATGTATAC CTTTGAGCGT ACACTTTCTG TTGCTAACGG GAAACAAGTG AAGGTGAGAG AGGTTAAGGT TTCTAGGCTT GTGATGGACA AGCAGGGCCG CTATATGTTC AGGGTTGCTG AGAAGCCGGA AGTTGTTAAG CAGGTTCTGC GTGCGTATGG TGACTACGTG GAGGCTGTTG TTGCGAAACC TGTTCCTCGA ACCTTTGAAA GGTTACCAGA AATCGACAAT CTGCCTAAGC TGCCAAGTCC TGCACCTACA AACGTAATTA TTGGGCAATT AATAGGTGCA GCGCAGGTAC GTTCAGTCTA CGGTGTCAAC GGCTCTGGTG TGAACATTGC GATAGTAGAT ACGGGTGTTG ACTTTGGTCA TCCAGATTTG ACTTCGGCAC TCGCGTACTG GAGTGGCACT TACAAAGGTG AGTGGGTAGT AGAGCCGCTG GTCTTCGACG CTGATGAGTC TCAAGTGCTT CTTTTCCAGG ATGTTCAGCT TGTTAACTCT ACTCATGTCT ATGTCGGTGG CAAGAGTTAC ACCACGCTTA TACCCTGGCC TGTAGATATC TATCCACCCT ACGACTACTA CAGAATCCCA TTATCTGTGT ATAACATGGT GTCTCAGCCT GGTGGCGGGC TTAGGTTTGG TGTCACGTAT CTATGGAGGT ATGACGGCAT ACGTATCGTC GGCGTGTTGC TAGTAAAGCC GTACACCTTT GGTTACTACG CCTACGCACT TATAGACGTT AATAATAACG GCAGGTTTGA CGACGAGGTG GCACCACCAA TTGATCCCTA TGGGAGGGGT AGTGTTCTGA GGTACTCAGC GAACAGGGTT ATCGCGCCTG ATTACGACAG AAATGGTTTT CCTGACGACT CTTTCGGCGT TGCCGGCGGC TTCTTCTATG ATTGGTGGTG GTACTTCAAC TATCCTGCGG AGATTTTCCC TGGCTGGGAT AGACAGGGCC GTTGGCTTAG CATTTTCTAC GACTTTTACG GTCATGGTAC TTCGTGCGCA TCGGCAGCGG CGGGACGTGG CGTCGTAGCG TATAACGTAA CTGGGCTGGG CGTGGTCAAG CTTACCGGCA TAGCGCCCGG AGCCAAGATA GTAGGCGTTA AGGCTCTGTG GATCGGGAAC GTCGAGGTCG GCATGCTGTG GGCGGCGGGC TTTGATGTTA ACCCATACGA TGGGAAATTC TACTACACTG GTAGCAGGAG GGCGCATATC ATAAGTAACA GCTGGGGTAC CTCGTACTTC ACCTACGACG TAGGAGCTTT CGGCTACGAC CTGGAGTCCG TGTTCGTTGC GGGTCTCTCT ATGCCAGGCT TCCTCGACCC GAGGTACCCC GGTATTTTGA TAGTGCAGGC CGGCGGCAAC GGTGGCCCTG GCTACGGTAC CATAACGTCC CCCGGCGCGT CTCCCGGAGT ATTAACAGTA GGTGCCTCCA CCTCGATGCA CTTCGCCTAC GTACTATCGA AGCAGGGTTA CGGCTCTGTC TTCATGTCTG GAGGGGGCTG GGCTTTCGAC GAGGTTGTGA GCTGGTCGTT GCGCGGGCCC ACTGTTGCTG GCTACGTTAA GCCGGACGTG GTAAACGTGG GCGCCTTCGG CTTCACAGCG GCTCCTGTAA CCGTGAACTA CACCATATTC GGCGGTACTA GCTACGCTAC CCCGCTCACA GCCGGTGTTG CGGCGCTGGT GTACCAGGTT CTGCCCACGG CGGATCCCGA CCTCGTGAAG AGCATAATAG CGTCTACTGC CCAGGGAGTA GGGTACGATG GTCCGTCCCA AGGCTTCGGC CGCGTAAACG CATTCTACGC TGTGTCTTTA GCGAGATTGC TTGCTGGTAA AACAGCGGCT AAGTACGAAA TTCAGTTTAT GAGTAACTCT CTATGGTCTG CCTACTCCTC TAAAGCGGCT AGTACCTGGT ACTGGCAGTG GTGCGACAAT ATCGCTGCGT ACATGCTTTG GTGGGCTGGT ACCGAGCTAT CTCTCCCAAG CTGTAGTATG CCGTCTGCGG TAGCATCGAG AATTGGTGGT TCGCTGTTCT TCGGTGACGT ACCGGTCGGT GGCAACAAAA GTATCACACT GACTGTTAAG AACCCAACAA ACAAGACTGT TTCCGTCTCC CTGTCTCCAC GGATATTTAA GCTGACGAAT ACCACGGTGC TGAGCAGGTC TCTTTCGCTC TCGCCTGGGA CAAGCTACAA TAGAACTTAC TGGGTCTTCA CCGCTGCTAA CCTGACGTCC ACGTTGCGCT TCATGGAGGT CGTGGCTACT ATCCCGTTCT CCAAGTTCGA CTCGAACTAC AACTATCGGC CGGACGTGAG GGTGAGGGTG TGGATCCACA TCTGGAGGAG CGATACCAAC GCTAACGGCG TCCCGGATCC AGACGAAATG GTACTCGTGA ACTACGGCGC CGCCTGGTCT AACTGGAACC TAGCCACGAT GAGCAACCCG GCGGGTAGGC TAAGCGCGGG AGGCTTCAAG GGGATAGTAG TCACGGTAGA CCTGGTAAGG GGTCCCGACG CGCCGAGCTA CGTACCACCG ATACCCGTCA CAGTCACAGT TAACTACGTG GACACGGCTG GCGATAGCTG GGTCTCTGTC TCGCCGAGCA CGGCTACGAT CTCACCTGGT GGCTCGCAGA CCTTTACGCT AACTCTCAAG GTTCCAAGCA ATGCTGTGCC CACGACGTAT ATTGGCGAGG TTGTTGCAGT GAACAACGTT ACTGGTCACG CAACGGCGAT ACCGTACTCG TTCAACGTGT ATACTACGGT TGGAGCTTCC TCTGTAAACC TGGTTACAGG TGTTAACGGC AGGTGGCCTA GTGCGTTCAG CATTAGGGGT GCTAACGACT GGGGTTGGAG GTACGAGTCG GGCGACTGGC GCTTGTTCCA CGTTAAGCCC GGCGTGCCGA ACGGTCTAGC TTTCGAGTTC GAGGCTGGCT GGTCTCTGCC TGATACCTCG CTAATAGCCT ACGCCGTTGG ACCCGACGGC CAGTTCGCGG GTGCCTACTT CGGTCAGGGT GCTTCGTGGC ATCGGTACTT GGGAGGAGGG CTTTTCATGT GGTTTGATAC GGGCGCGGGC TCTGTGCAGA ACGCTAAGCG CGTAGTGATA TTCCCGGCTA TCGACTACCG TACATGGCTG TACCCGCACG GCAAGCCTGA GAGCGGTGTG TTCACATTTG TAGTGAGGAG CGCGCTGTTT GACGCGTCGG CTGGGGCTTC CGAGATTATC TCGGCGAAAG CAAGGGTGCT ACAGGCTCAG CAGAAGCTTC CAGCGTCCGT TACTGGTGGT GGCACGTTCG TGATAAGGTA CTCACTTCCG TACATTGTGA AATATATCTC TGCTGGTGCG TATAGACCGA TGACTCCGTG GCTCGACTAC GATCAGAGAT ACACGCCGGG GATCACTTCT ATCTCGCCGT CCTCGGTCTC TGGACCCTAT CCGTCTGGAA CTGTGTTTAC ATTCTCCTTT ATGGTGCAGA ACTACGGTGC TGAGGGTCAG AAGTTCGATG CCGTTGCTGG CTTCCTTGTA AGTCTCCCGT CGCTACCGGT GTACTACAGG GACTATGGAG GTAGCTACGT AAAATGGACG GACTGGTATC TATTAGAGGA CTGGATTAGA GTATCTAAGT AA
|
Protein sequence | MGNKKTIAGQ KLVAVVLAAV LLFSSLPVVA FPRETQPLDN AGIPSGVFVG NFTAGGEQVR ASTVVFKKGV SENIVSKAYG EVLKMYTFER TLSVANGKQV KVREVKVSRL VMDKQGRYMF RVAEKPEVVK QVLRAYGDYV EAVVAKPVPR TFERLPEIDN LPKLPSPAPT NVIIGQLIGA AQVRSVYGVN GSGVNIAIVD TGVDFGHPDL TSALAYWSGT YKGEWVVEPL VFDADESQVL LFQDVQLVNS THVYVGGKSY TTLIPWPVDI YPPYDYYRIP LSVYNMVSQP GGGLRFGVTY LWRYDGIRIV GVLLVKPYTF GYYAYALIDV NNNGRFDDEV APPIDPYGRG SVLRYSANRV IAPDYDRNGF PDDSFGVAGG FFYDWWWYFN YPAEIFPGWD RQGRWLSIFY DFYGHGTSCA SAAAGRGVVA YNVTGLGVVK LTGIAPGAKI VGVKALWIGN VEVGMLWAAG FDVNPYDGKF YYTGSRRAHI ISNSWGTSYF TYDVGAFGYD LESVFVAGLS MPGFLDPRYP GILIVQAGGN GGPGYGTITS PGASPGVLTV GASTSMHFAY VLSKQGYGSV FMSGGGWAFD EVVSWSLRGP TVAGYVKPDV VNVGAFGFTA APVTVNYTIF GGTSYATPLT AGVAALVYQV LPTADPDLVK SIIASTAQGV GYDGPSQGFG RVNAFYAVSL ARLLAGKTAA KYEIQFMSNS LWSAYSSKAA STWYWQWCDN IAAYMLWWAG TELSLPSCSM PSAVASRIGG SLFFGDVPVG GNKSITLTVK NPTNKTVSVS LSPRIFKLTN TTVLSRSLSL SPGTSYNRTY WVFTAANLTS TLRFMEVVAT IPFSKFDSNY NYRPDVRVRV WIHIWRSDTN ANGVPDPDEM VLVNYGAAWS NWNLATMSNP AGRLSAGGFK GIVVTVDLVR GPDAPSYVPP IPVTVTVNYV DTAGDSWVSV SPSTATISPG GSQTFTLTLK VPSNAVPTTY IGEVVAVNNV TGHATAIPYS FNVYTTVGAS SVNLVTGVNG RWPSAFSIRG ANDWGWRYES GDWRLFHVKP GVPNGLAFEF EAGWSLPDTS LIAYAVGPDG QFAGAYFGQG ASWHRYLGGG LFMWFDTGAG SVQNAKRVVI FPAIDYRTWL YPHGKPESGV FTFVVRSALF DASAGASEII SAKARVLQAQ QKLPASVTGG GTFVIRYSLP YIVKYISAGA YRPMTPWLDY DQRYTPGITS ISPSSVSGPY PSGTVFTFSF MVQNYGAEGQ KFDAVAGFLV SLPSLPVYYR DYGGSYVKWT DWYLLEDWIR VSK
|
| |