Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3342 |
Symbol | |
ID | 8743962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 3448385 |
End bp | 3451285 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646513925 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_003404879 |
Protein GI | 284166600 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGGAT TTCCGATACG CTGCCGTCGG CTCATCGTAG TCTCTCTGGC GGTGGTTCTT TGTACCTCAC TGTTATTTCC AATCGGTTTG GCGGCCGGAA GCGATCCGCT CGCGGACGCG GCCAAAACGG GTGACACAGT CTCCCAGCAA TCGGCCGACG GTGTGACTGC AGTCGAACGC TCGAAGCCCG CACAGATCGA CCCGACACTC GAGGATGCCG ACGGTATCGT GGAGGTAATT GTCCGACTCG AGAGCGACCG GATGACCGCG ATATCGACCG GTGAAACGAA ACCGGCGGCG TTGCGGGCCG CCGCAGACGA CTCACAGACA TCGCTCGAGC GCGCGGCCGA GACGACTGCG GGACTCGACG TCGAACGACA GTTCTGGCTC GCGAACGCCG CATTGGTGGC AGTCGACACC GATCGCGTCG CTCTAGAGAC GGTCGGTTCG ATCGACGGCG TCGTCGAAAT TCACGCCGAC GCCGCCGTGG AACTCGCAGC GGGGGCGACA GCGTCCAACC CGAACGCCTC GACAGTCGGT CCGCGATCCG GACCGACTGC TAACAGATCG ACGACGAACG GGTCGTCAAT GACCGCAGCG ACGGGGTTTG GTAGTGCGTA TACTTACGGC CTCGAGCAAC TCTCCGTGCC TGCAGCCCAA GAGAAATACG GCGCCCGCGG CAACGGAGCG ACGGTCGCAG TCCTCGATAC GGGCGCCGAC GACTCACATC CCGACGTGAC GGTCGATGCG TGGCGGGATT TCTCTGGCAA ATCGTCGACG CCGATGGATT ACAACGGCCA CGGGACCCAC GTCGCCGGGA CAGTCGTCGG CGGCGACGCG AGCGGGACGC AGATCGGCGT CGCACCCGAG GCGAACCTGC TCGCCGGCGC CGTTCTGACT GACTGTACCG ACGGGAGTTG CGTCGGCCGG ACGTCGGACG TGATCTCCGG CATGCAGTGG GCTGTCGATA ACGGCGCCGA CGTTATCAGC TTGAGTCTCG GTTCCGAGGG GTACACCACC TCTTATATCA GCGCCGTTCG GAACGCGGAA GCCAGCGGTA CCGCCGTCGT CGCTAGCGCC GGAAACGGCG GCGACGGCGT CTCCTCGTCG CCGGGAAACG TCTACGACGC TATCAGTGCC GGGGCCACCG ACGAGAGGAA GCGCGTCGCT GACTTCTCGA GCGGTGAGGT CATCGATACT CGCGACGCGT GGGGGTGGCG CGCGCCCGAC GAGTGGCCCA GCAGCTACGT CGTTCCGACG GTGACGGCGC CCGGCGAGCG CGTCCTCAGC GCGTCCTCGA ACGGCGGCTA CGTTCGCAAG AGCGGAACTA GTATGGCCAC ACCCCACGTC GCGGGCGTGG TCGCGCTGTT GCAGGGGGCG ACCGACCGAC ACCTCGAGCC CGACGAGATC GAGGCGGCGC TGACGGAAAC GGCCGCGAAG CCGGCCGGAG AACCCGAGGA GCAGGACACC AGATACGGTC ACGGGATCAT CGACGCGGTC GCCGCCCTCG AGGCGGCCGG GTCGTTCGCG ACCGTCGAAG GGACGGTGAC CGATACGGTG ACGGACAAGC CGATCGCGGA TGCGACCGTC ACTCTCGAGG GCGACGACGG AACCGTTTCC GAGACGACGA CCGACCTTTA CGGGCGGTAC GAACTCAAAG GCGTCACCGG CGACCGCGAG TACGCCCTCA CTATCGCTGC CGACGGGTAC GAGACGAGCA GCGAGACACG GTTCGTGCCG GCCGACGAGA CGACGACGGT CGACGTCTCG CTCGCCGGTG ACGGGGAACT CGAGGTGATC CTCACGGACG ACCAGTTCGG CGACGGGATC GCGAACGGAA CCGTAACGGC GACGACCTGG TACGGCACGT ATCCGGCGAG CCACGAGGGC GACGGGAGCT ACGTCGTCAG GGACGTCCCC ACTCGAGGCG AGTATACGCT GACCGCGGCC GCACCGGGAT ACGACGACCG AGAGCGCGAC GTAACGGTGA CGAAGTCAGG GACGCACGTC ACTGAGCGGT TCGAGCTCAC GGGCGACGCG ACGCTCGAGA TCGCCGCCGA AGACGCGGTG ACGGGGACAC CGATCTCGAA CGCGACCGTC GTTATCGAAC GCTCGGACGG CGCTTCGTTC GAAGCCGCCG ACCCGACGGA CGGTGCCGGG ACGGTCGCGG TCACGATACC GGGAACCGAC GAGGAGTACA CCGTCTGCGT CGACGCAGCG GGATACGAGA CGGGAACTGA GTCACGAATC GTCTCGAGCG GGGATGACAC GGACGTCGGT CTCGCACTCG AGGGAGACGG CGTTCTCGAG GTGATCCTCG AGGACGCGCA GTTCGGTGAC GGCATCGCGG ACGCGACCGT CGACGCGATC GGCCGACAGG GGACGTATTC AGGCGTTCAC ACGAAGCACG GAACGTACCG CATCGAATCC GTTCCCGGCG GTGACGAGTA CGCGGTGAAC GTGTCCGCGG CGGGCTACGT CGACGAGACG CTCTCGATGG AAATCGATTC GAACCGAACG GCGCGTGAAC GGGCGGTTCT CGAGGGCGAC GCGACGCTGT CGGTGACCGT CACCGACGAG GACGGCGATC CGATCGACGG CGCGACCGTT ACGATCGAAC GCCCGGGCGG AACCTCGTTC GCGGTCGCCA ACGAGACGGA TTCGGACGGC ACACTCGAGA CGACCGTGTC CGGAACCGGT GTAGGATATG CAGTCGAAGT CGGTGCGGAG GGATACGAGT CGGAGCGCGT GACGACGGAG GAGATCTCGA GCGGAGCGAC CGAGTCCGTC ACCGTCACGA TGACGGCGGC CGACAACGGT GTCCCTGGAT TTGGAATCGC AGTCGGCGTG ATTGCCTTAG TGACGGCGCT CGTCGTCGGT ATCTCGCGTC GGACACCATA G
|
Protein sequence | MSGFPIRCRR LIVVSLAVVL CTSLLFPIGL AAGSDPLADA AKTGDTVSQQ SADGVTAVER SKPAQIDPTL EDADGIVEVI VRLESDRMTA ISTGETKPAA LRAAADDSQT SLERAAETTA GLDVERQFWL ANAALVAVDT DRVALETVGS IDGVVEIHAD AAVELAAGAT ASNPNASTVG PRSGPTANRS TTNGSSMTAA TGFGSAYTYG LEQLSVPAAQ EKYGARGNGA TVAVLDTGAD DSHPDVTVDA WRDFSGKSST PMDYNGHGTH VAGTVVGGDA SGTQIGVAPE ANLLAGAVLT DCTDGSCVGR TSDVISGMQW AVDNGADVIS LSLGSEGYTT SYISAVRNAE ASGTAVVASA GNGGDGVSSS PGNVYDAISA GATDERKRVA DFSSGEVIDT RDAWGWRAPD EWPSSYVVPT VTAPGERVLS ASSNGGYVRK SGTSMATPHV AGVVALLQGA TDRHLEPDEI EAALTETAAK PAGEPEEQDT RYGHGIIDAV AALEAAGSFA TVEGTVTDTV TDKPIADATV TLEGDDGTVS ETTTDLYGRY ELKGVTGDRE YALTIAADGY ETSSETRFVP ADETTTVDVS LAGDGELEVI LTDDQFGDGI ANGTVTATTW YGTYPASHEG DGSYVVRDVP TRGEYTLTAA APGYDDRERD VTVTKSGTHV TERFELTGDA TLEIAAEDAV TGTPISNATV VIERSDGASF EAADPTDGAG TVAVTIPGTD EEYTVCVDAA GYETGTESRI VSSGDDTDVG LALEGDGVLE VILEDAQFGD GIADATVDAI GRQGTYSGVH TKHGTYRIES VPGGDEYAVN VSAAGYVDET LSMEIDSNRT ARERAVLEGD ATLSVTVTDE DGDPIDGATV TIERPGGTSF AVANETDSDG TLETTVSGTG VGYAVEVGAE GYESERVTTE EISSGATESV TVTMTAADNG VPGFGIAVGV IALVTALVVG ISRRTP
|
| |