Gene Tpen_1388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1388 
Symbol 
ID4600671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1342970 
End bp1344304 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content59% 
IMG OID639774163 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_920788 
Protein GI119720293 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.196735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTTTC TAGATGTACG GTTAGGCCGG CTCTTTGAGC TTAGTCGACT TTACGTTTAT 
ACATATATAA AGCTTTTATA CTCGAATTGG AAAGCTCCTC CAATGCGGAA AGCAGTACTA
GTACTACTTG TCTTAGTGCT ACTAATACCG GTACTACAGG TAACCACGGC GCCGGCAACG
AATAAGGTAA GGGTTGTCGT TGGATATGAG AACGAGGGTG CCCTGGCTGC GGTCGAGGGG
CTACCGGGAG CCGAGAAGGT AAAGGTTCTA CGCGAGATAA AAGCCGCTGT CTTCTACCTG
CCACCCGAGG CTATCGAGAA GGCTAAGGGG ATTAAGGGCG TAAGGTACGT CGAGGAGGAC
AAGGTCGCGG TAGCCTTGGA GCTCTCGAGC TACCCGGACG TCCTCTGGGA CGTGAAGATG
ATTAACGCCA GCAAGGTCTG GGACAAGTAC TACCCGGTGT ACGGCTGGAA GGCGCTCGGA
AGGGGAGTGG TAGTCGCGGT TCTGGACACG GGGATAGACT ACACCCACCC CGAGCTTAAA
GGCAAAGTTG TGTGGTGCGC GAACACCGTC GGGGTTAAGA CGTACACGGG TACGAAGCTG
AGTAACTGCG CCGACAGGAA CGGGCACGGG ACCCACGTCG CCGGCACCAT AGCCTCCGCG
ATAAACGGGG TTGGAAACGC GGGCGTCGCG CCGAACGTAA CGCTCTACGC GGTTAAAGTA
CTCAACGACG CCGGCTCCGG GACGTACTCC GACATAGCGG AGGGTATAAT CATCGCCGTG
AAGGGGCCGG ACGGGGTCGC GGGTACTAGC GACGACGCCA AGATACTCAG CATGTCTCTC
GGCGGAAGTA GCGACAGCCA GGTCCTATAC GATGCGGTTA AGTGGGCGTA CAGCAACGGC
GCTGTCCTCG TAGCGGCCGC GGGGAACTCG GGCGACGGAG ACCCCACGAC GGACAACGTC
GCCTACCCGG CGAGGTACAG CGAGGTCATA GCGGTGGCAG CCGTGGATAG CAACGCCAAC
GTGCCCACTT GGAGTAGCGA CGGACCCGAG GTAGACGTAG CGGCGCCCGG TGTAAACGTC
TACTCCACGT ACAAGAACGG CGGCTACGCT ACTCTCTCCG GGACCAGCAT GGCGACCCCG
CACGTCTCCG CCACGGTTGC CCTCATACAG GCCCTCAGGC TCGCAGCCGG GAAGCAGCCC
CTGACGCCGT CCCAGGTCTA CGACGTCCTC ACCAAGACTG CTAAGGACAT AAACTCGCCC
GGCTTCGACG TCTTCACGGG CTACGGGCTC GTAGACGCGC TGGCGGCGGT AGACTACGCG
CTGAGCCTAC CCTAA
 
Protein sequence
MHFLDVRLGR LFELSRLYVY TYIKLLYSNW KAPPMRKAVL VLLVLVLLIP VLQVTTAPAT 
NKVRVVVGYE NEGALAAVEG LPGAEKVKVL REIKAAVFYL PPEAIEKAKG IKGVRYVEED
KVAVALELSS YPDVLWDVKM INASKVWDKY YPVYGWKALG RGVVVAVLDT GIDYTHPELK
GKVVWCANTV GVKTYTGTKL SNCADRNGHG THVAGTIASA INGVGNAGVA PNVTLYAVKV
LNDAGSGTYS DIAEGIIIAV KGPDGVAGTS DDAKILSMSL GGSSDSQVLY DAVKWAYSNG
AVLVAAAGNS GDGDPTTDNV AYPARYSEVI AVAAVDSNAN VPTWSSDGPE VDVAAPGVNV
YSTYKNGGYA TLSGTSMATP HVSATVALIQ ALRLAAGKQP LTPSQVYDVL TKTAKDINSP
GFDVFTGYGL VDALAAVDYA LSLP