Gene Tpen_1714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1714 
Symbol 
ID4601739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1650039 
End bp1653920 
Gene Length3882 bp 
Protein Length1293 aa 
Translation table11 
GC content54% 
IMG OID639774487 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_921112 
Protein GI119720617 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.256155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAATA AGAAAACAAT CGCAGGACAA AAGCTAGTAG CAGTTGTACT AGCGGCAGTG 
TTGCTGTTCT CTTCTCTTCC AGTAGTAGCA TTTCCACGAG AAACACAGCC TTTAGACAAC
GCGGGTATTC CATCCGGAGT CTTCGTGGGC AACTTTACTG CTGGTGGTGA GCAAGTAAGG
GCATCCACGG TAGTGTTTAA GAAGGGTGTC TCGGAGAACA TAGTTTCCAA GGCTTACGGC
GAAGTGTTGA AAATGTATAC CTTTGAGCGT ACACTTTCTG TTGCTAACGG GAAACAAGTG
AAGGTGAGAG AGGTTAAGGT TTCTAGGCTT GTGATGGACA AGCAGGGCCG CTATATGTTC
AGGGTTGCTG AGAAGCCGGA AGTTGTTAAG CAGGTTCTGC GTGCGTATGG TGACTACGTG
GAGGCTGTTG TTGCGAAACC TGTTCCTCGA ACCTTTGAAA GGTTACCAGA AATCGACAAT
CTGCCTAAGC TGCCAAGTCC TGCACCTACA AACGTAATTA TTGGGCAATT AATAGGTGCA
GCGCAGGTAC GTTCAGTCTA CGGTGTCAAC GGCTCTGGTG TGAACATTGC GATAGTAGAT
ACGGGTGTTG ACTTTGGTCA TCCAGATTTG ACTTCGGCAC TCGCGTACTG GAGTGGCACT
TACAAAGGTG AGTGGGTAGT AGAGCCGCTG GTCTTCGACG CTGATGAGTC TCAAGTGCTT
CTTTTCCAGG ATGTTCAGCT TGTTAACTCT ACTCATGTCT ATGTCGGTGG CAAGAGTTAC
ACCACGCTTA TACCCTGGCC TGTAGATATC TATCCACCCT ACGACTACTA CAGAATCCCA
TTATCTGTGT ATAACATGGT GTCTCAGCCT GGTGGCGGGC TTAGGTTTGG TGTCACGTAT
CTATGGAGGT ATGACGGCAT ACGTATCGTC GGCGTGTTGC TAGTAAAGCC GTACACCTTT
GGTTACTACG CCTACGCACT TATAGACGTT AATAATAACG GCAGGTTTGA CGACGAGGTG
GCACCACCAA TTGATCCCTA TGGGAGGGGT AGTGTTCTGA GGTACTCAGC GAACAGGGTT
ATCGCGCCTG ATTACGACAG AAATGGTTTT CCTGACGACT CTTTCGGCGT TGCCGGCGGC
TTCTTCTATG ATTGGTGGTG GTACTTCAAC TATCCTGCGG AGATTTTCCC TGGCTGGGAT
AGACAGGGCC GTTGGCTTAG CATTTTCTAC GACTTTTACG GTCATGGTAC TTCGTGCGCA
TCGGCAGCGG CGGGACGTGG CGTCGTAGCG TATAACGTAA CTGGGCTGGG CGTGGTCAAG
CTTACCGGCA TAGCGCCCGG AGCCAAGATA GTAGGCGTTA AGGCTCTGTG GATCGGGAAC
GTCGAGGTCG GCATGCTGTG GGCGGCGGGC TTTGATGTTA ACCCATACGA TGGGAAATTC
TACTACACTG GTAGCAGGAG GGCGCATATC ATAAGTAACA GCTGGGGTAC CTCGTACTTC
ACCTACGACG TAGGAGCTTT CGGCTACGAC CTGGAGTCCG TGTTCGTTGC GGGTCTCTCT
ATGCCAGGCT TCCTCGACCC GAGGTACCCC GGTATTTTGA TAGTGCAGGC CGGCGGCAAC
GGTGGCCCTG GCTACGGTAC CATAACGTCC CCCGGCGCGT CTCCCGGAGT ATTAACAGTA
GGTGCCTCCA CCTCGATGCA CTTCGCCTAC GTACTATCGA AGCAGGGTTA CGGCTCTGTC
TTCATGTCTG GAGGGGGCTG GGCTTTCGAC GAGGTTGTGA GCTGGTCGTT GCGCGGGCCC
ACTGTTGCTG GCTACGTTAA GCCGGACGTG GTAAACGTGG GCGCCTTCGG CTTCACAGCG
GCTCCTGTAA CCGTGAACTA CACCATATTC GGCGGTACTA GCTACGCTAC CCCGCTCACA
GCCGGTGTTG CGGCGCTGGT GTACCAGGTT CTGCCCACGG CGGATCCCGA CCTCGTGAAG
AGCATAATAG CGTCTACTGC CCAGGGAGTA GGGTACGATG GTCCGTCCCA AGGCTTCGGC
CGCGTAAACG CATTCTACGC TGTGTCTTTA GCGAGATTGC TTGCTGGTAA AACAGCGGCT
AAGTACGAAA TTCAGTTTAT GAGTAACTCT CTATGGTCTG CCTACTCCTC TAAAGCGGCT
AGTACCTGGT ACTGGCAGTG GTGCGACAAT ATCGCTGCGT ACATGCTTTG GTGGGCTGGT
ACCGAGCTAT CTCTCCCAAG CTGTAGTATG CCGTCTGCGG TAGCATCGAG AATTGGTGGT
TCGCTGTTCT TCGGTGACGT ACCGGTCGGT GGCAACAAAA GTATCACACT GACTGTTAAG
AACCCAACAA ACAAGACTGT TTCCGTCTCC CTGTCTCCAC GGATATTTAA GCTGACGAAT
ACCACGGTGC TGAGCAGGTC TCTTTCGCTC TCGCCTGGGA CAAGCTACAA TAGAACTTAC
TGGGTCTTCA CCGCTGCTAA CCTGACGTCC ACGTTGCGCT TCATGGAGGT CGTGGCTACT
ATCCCGTTCT CCAAGTTCGA CTCGAACTAC AACTATCGGC CGGACGTGAG GGTGAGGGTG
TGGATCCACA TCTGGAGGAG CGATACCAAC GCTAACGGCG TCCCGGATCC AGACGAAATG
GTACTCGTGA ACTACGGCGC CGCCTGGTCT AACTGGAACC TAGCCACGAT GAGCAACCCG
GCGGGTAGGC TAAGCGCGGG AGGCTTCAAG GGGATAGTAG TCACGGTAGA CCTGGTAAGG
GGTCCCGACG CGCCGAGCTA CGTACCACCG ATACCCGTCA CAGTCACAGT TAACTACGTG
GACACGGCTG GCGATAGCTG GGTCTCTGTC TCGCCGAGCA CGGCTACGAT CTCACCTGGT
GGCTCGCAGA CCTTTACGCT AACTCTCAAG GTTCCAAGCA ATGCTGTGCC CACGACGTAT
ATTGGCGAGG TTGTTGCAGT GAACAACGTT ACTGGTCACG CAACGGCGAT ACCGTACTCG
TTCAACGTGT ATACTACGGT TGGAGCTTCC TCTGTAAACC TGGTTACAGG TGTTAACGGC
AGGTGGCCTA GTGCGTTCAG CATTAGGGGT GCTAACGACT GGGGTTGGAG GTACGAGTCG
GGCGACTGGC GCTTGTTCCA CGTTAAGCCC GGCGTGCCGA ACGGTCTAGC TTTCGAGTTC
GAGGCTGGCT GGTCTCTGCC TGATACCTCG CTAATAGCCT ACGCCGTTGG ACCCGACGGC
CAGTTCGCGG GTGCCTACTT CGGTCAGGGT GCTTCGTGGC ATCGGTACTT GGGAGGAGGG
CTTTTCATGT GGTTTGATAC GGGCGCGGGC TCTGTGCAGA ACGCTAAGCG CGTAGTGATA
TTCCCGGCTA TCGACTACCG TACATGGCTG TACCCGCACG GCAAGCCTGA GAGCGGTGTG
TTCACATTTG TAGTGAGGAG CGCGCTGTTT GACGCGTCGG CTGGGGCTTC CGAGATTATC
TCGGCGAAAG CAAGGGTGCT ACAGGCTCAG CAGAAGCTTC CAGCGTCCGT TACTGGTGGT
GGCACGTTCG TGATAAGGTA CTCACTTCCG TACATTGTGA AATATATCTC TGCTGGTGCG
TATAGACCGA TGACTCCGTG GCTCGACTAC GATCAGAGAT ACACGCCGGG GATCACTTCT
ATCTCGCCGT CCTCGGTCTC TGGACCCTAT CCGTCTGGAA CTGTGTTTAC ATTCTCCTTT
ATGGTGCAGA ACTACGGTGC TGAGGGTCAG AAGTTCGATG CCGTTGCTGG CTTCCTTGTA
AGTCTCCCGT CGCTACCGGT GTACTACAGG GACTATGGAG GTAGCTACGT AAAATGGACG
GACTGGTATC TATTAGAGGA CTGGATTAGA GTATCTAAGT AA
 
Protein sequence
MGNKKTIAGQ KLVAVVLAAV LLFSSLPVVA FPRETQPLDN AGIPSGVFVG NFTAGGEQVR 
ASTVVFKKGV SENIVSKAYG EVLKMYTFER TLSVANGKQV KVREVKVSRL VMDKQGRYMF
RVAEKPEVVK QVLRAYGDYV EAVVAKPVPR TFERLPEIDN LPKLPSPAPT NVIIGQLIGA
AQVRSVYGVN GSGVNIAIVD TGVDFGHPDL TSALAYWSGT YKGEWVVEPL VFDADESQVL
LFQDVQLVNS THVYVGGKSY TTLIPWPVDI YPPYDYYRIP LSVYNMVSQP GGGLRFGVTY
LWRYDGIRIV GVLLVKPYTF GYYAYALIDV NNNGRFDDEV APPIDPYGRG SVLRYSANRV
IAPDYDRNGF PDDSFGVAGG FFYDWWWYFN YPAEIFPGWD RQGRWLSIFY DFYGHGTSCA
SAAAGRGVVA YNVTGLGVVK LTGIAPGAKI VGVKALWIGN VEVGMLWAAG FDVNPYDGKF
YYTGSRRAHI ISNSWGTSYF TYDVGAFGYD LESVFVAGLS MPGFLDPRYP GILIVQAGGN
GGPGYGTITS PGASPGVLTV GASTSMHFAY VLSKQGYGSV FMSGGGWAFD EVVSWSLRGP
TVAGYVKPDV VNVGAFGFTA APVTVNYTIF GGTSYATPLT AGVAALVYQV LPTADPDLVK
SIIASTAQGV GYDGPSQGFG RVNAFYAVSL ARLLAGKTAA KYEIQFMSNS LWSAYSSKAA
STWYWQWCDN IAAYMLWWAG TELSLPSCSM PSAVASRIGG SLFFGDVPVG GNKSITLTVK
NPTNKTVSVS LSPRIFKLTN TTVLSRSLSL SPGTSYNRTY WVFTAANLTS TLRFMEVVAT
IPFSKFDSNY NYRPDVRVRV WIHIWRSDTN ANGVPDPDEM VLVNYGAAWS NWNLATMSNP
AGRLSAGGFK GIVVTVDLVR GPDAPSYVPP IPVTVTVNYV DTAGDSWVSV SPSTATISPG
GSQTFTLTLK VPSNAVPTTY IGEVVAVNNV TGHATAIPYS FNVYTTVGAS SVNLVTGVNG
RWPSAFSIRG ANDWGWRYES GDWRLFHVKP GVPNGLAFEF EAGWSLPDTS LIAYAVGPDG
QFAGAYFGQG ASWHRYLGGG LFMWFDTGAG SVQNAKRVVI FPAIDYRTWL YPHGKPESGV
FTFVVRSALF DASAGASEII SAKARVLQAQ QKLPASVTGG GTFVIRYSLP YIVKYISAGA
YRPMTPWLDY DQRYTPGITS ISPSSVSGPY PSGTVFTFSF MVQNYGAEGQ KFDAVAGFLV
SLPSLPVYYR DYGGSYVKWT DWYLLEDWIR VSK