Gene Htur_3342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3342 
Symbol 
ID8743962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3448385 
End bp3451285 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content66% 
IMG OID646513925 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003404879 
Protein GI284166600 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGGAT TTCCGATACG CTGCCGTCGG CTCATCGTAG TCTCTCTGGC GGTGGTTCTT 
TGTACCTCAC TGTTATTTCC AATCGGTTTG GCGGCCGGAA GCGATCCGCT CGCGGACGCG
GCCAAAACGG GTGACACAGT CTCCCAGCAA TCGGCCGACG GTGTGACTGC AGTCGAACGC
TCGAAGCCCG CACAGATCGA CCCGACACTC GAGGATGCCG ACGGTATCGT GGAGGTAATT
GTCCGACTCG AGAGCGACCG GATGACCGCG ATATCGACCG GTGAAACGAA ACCGGCGGCG
TTGCGGGCCG CCGCAGACGA CTCACAGACA TCGCTCGAGC GCGCGGCCGA GACGACTGCG
GGACTCGACG TCGAACGACA GTTCTGGCTC GCGAACGCCG CATTGGTGGC AGTCGACACC
GATCGCGTCG CTCTAGAGAC GGTCGGTTCG ATCGACGGCG TCGTCGAAAT TCACGCCGAC
GCCGCCGTGG AACTCGCAGC GGGGGCGACA GCGTCCAACC CGAACGCCTC GACAGTCGGT
CCGCGATCCG GACCGACTGC TAACAGATCG ACGACGAACG GGTCGTCAAT GACCGCAGCG
ACGGGGTTTG GTAGTGCGTA TACTTACGGC CTCGAGCAAC TCTCCGTGCC TGCAGCCCAA
GAGAAATACG GCGCCCGCGG CAACGGAGCG ACGGTCGCAG TCCTCGATAC GGGCGCCGAC
GACTCACATC CCGACGTGAC GGTCGATGCG TGGCGGGATT TCTCTGGCAA ATCGTCGACG
CCGATGGATT ACAACGGCCA CGGGACCCAC GTCGCCGGGA CAGTCGTCGG CGGCGACGCG
AGCGGGACGC AGATCGGCGT CGCACCCGAG GCGAACCTGC TCGCCGGCGC CGTTCTGACT
GACTGTACCG ACGGGAGTTG CGTCGGCCGG ACGTCGGACG TGATCTCCGG CATGCAGTGG
GCTGTCGATA ACGGCGCCGA CGTTATCAGC TTGAGTCTCG GTTCCGAGGG GTACACCACC
TCTTATATCA GCGCCGTTCG GAACGCGGAA GCCAGCGGTA CCGCCGTCGT CGCTAGCGCC
GGAAACGGCG GCGACGGCGT CTCCTCGTCG CCGGGAAACG TCTACGACGC TATCAGTGCC
GGGGCCACCG ACGAGAGGAA GCGCGTCGCT GACTTCTCGA GCGGTGAGGT CATCGATACT
CGCGACGCGT GGGGGTGGCG CGCGCCCGAC GAGTGGCCCA GCAGCTACGT CGTTCCGACG
GTGACGGCGC CCGGCGAGCG CGTCCTCAGC GCGTCCTCGA ACGGCGGCTA CGTTCGCAAG
AGCGGAACTA GTATGGCCAC ACCCCACGTC GCGGGCGTGG TCGCGCTGTT GCAGGGGGCG
ACCGACCGAC ACCTCGAGCC CGACGAGATC GAGGCGGCGC TGACGGAAAC GGCCGCGAAG
CCGGCCGGAG AACCCGAGGA GCAGGACACC AGATACGGTC ACGGGATCAT CGACGCGGTC
GCCGCCCTCG AGGCGGCCGG GTCGTTCGCG ACCGTCGAAG GGACGGTGAC CGATACGGTG
ACGGACAAGC CGATCGCGGA TGCGACCGTC ACTCTCGAGG GCGACGACGG AACCGTTTCC
GAGACGACGA CCGACCTTTA CGGGCGGTAC GAACTCAAAG GCGTCACCGG CGACCGCGAG
TACGCCCTCA CTATCGCTGC CGACGGGTAC GAGACGAGCA GCGAGACACG GTTCGTGCCG
GCCGACGAGA CGACGACGGT CGACGTCTCG CTCGCCGGTG ACGGGGAACT CGAGGTGATC
CTCACGGACG ACCAGTTCGG CGACGGGATC GCGAACGGAA CCGTAACGGC GACGACCTGG
TACGGCACGT ATCCGGCGAG CCACGAGGGC GACGGGAGCT ACGTCGTCAG GGACGTCCCC
ACTCGAGGCG AGTATACGCT GACCGCGGCC GCACCGGGAT ACGACGACCG AGAGCGCGAC
GTAACGGTGA CGAAGTCAGG GACGCACGTC ACTGAGCGGT TCGAGCTCAC GGGCGACGCG
ACGCTCGAGA TCGCCGCCGA AGACGCGGTG ACGGGGACAC CGATCTCGAA CGCGACCGTC
GTTATCGAAC GCTCGGACGG CGCTTCGTTC GAAGCCGCCG ACCCGACGGA CGGTGCCGGG
ACGGTCGCGG TCACGATACC GGGAACCGAC GAGGAGTACA CCGTCTGCGT CGACGCAGCG
GGATACGAGA CGGGAACTGA GTCACGAATC GTCTCGAGCG GGGATGACAC GGACGTCGGT
CTCGCACTCG AGGGAGACGG CGTTCTCGAG GTGATCCTCG AGGACGCGCA GTTCGGTGAC
GGCATCGCGG ACGCGACCGT CGACGCGATC GGCCGACAGG GGACGTATTC AGGCGTTCAC
ACGAAGCACG GAACGTACCG CATCGAATCC GTTCCCGGCG GTGACGAGTA CGCGGTGAAC
GTGTCCGCGG CGGGCTACGT CGACGAGACG CTCTCGATGG AAATCGATTC GAACCGAACG
GCGCGTGAAC GGGCGGTTCT CGAGGGCGAC GCGACGCTGT CGGTGACCGT CACCGACGAG
GACGGCGATC CGATCGACGG CGCGACCGTT ACGATCGAAC GCCCGGGCGG AACCTCGTTC
GCGGTCGCCA ACGAGACGGA TTCGGACGGC ACACTCGAGA CGACCGTGTC CGGAACCGGT
GTAGGATATG CAGTCGAAGT CGGTGCGGAG GGATACGAGT CGGAGCGCGT GACGACGGAG
GAGATCTCGA GCGGAGCGAC CGAGTCCGTC ACCGTCACGA TGACGGCGGC CGACAACGGT
GTCCCTGGAT TTGGAATCGC AGTCGGCGTG ATTGCCTTAG TGACGGCGCT CGTCGTCGGT
ATCTCGCGTC GGACACCATA G
 
Protein sequence
MSGFPIRCRR LIVVSLAVVL CTSLLFPIGL AAGSDPLADA AKTGDTVSQQ SADGVTAVER 
SKPAQIDPTL EDADGIVEVI VRLESDRMTA ISTGETKPAA LRAAADDSQT SLERAAETTA
GLDVERQFWL ANAALVAVDT DRVALETVGS IDGVVEIHAD AAVELAAGAT ASNPNASTVG
PRSGPTANRS TTNGSSMTAA TGFGSAYTYG LEQLSVPAAQ EKYGARGNGA TVAVLDTGAD
DSHPDVTVDA WRDFSGKSST PMDYNGHGTH VAGTVVGGDA SGTQIGVAPE ANLLAGAVLT
DCTDGSCVGR TSDVISGMQW AVDNGADVIS LSLGSEGYTT SYISAVRNAE ASGTAVVASA
GNGGDGVSSS PGNVYDAISA GATDERKRVA DFSSGEVIDT RDAWGWRAPD EWPSSYVVPT
VTAPGERVLS ASSNGGYVRK SGTSMATPHV AGVVALLQGA TDRHLEPDEI EAALTETAAK
PAGEPEEQDT RYGHGIIDAV AALEAAGSFA TVEGTVTDTV TDKPIADATV TLEGDDGTVS
ETTTDLYGRY ELKGVTGDRE YALTIAADGY ETSSETRFVP ADETTTVDVS LAGDGELEVI
LTDDQFGDGI ANGTVTATTW YGTYPASHEG DGSYVVRDVP TRGEYTLTAA APGYDDRERD
VTVTKSGTHV TERFELTGDA TLEIAAEDAV TGTPISNATV VIERSDGASF EAADPTDGAG
TVAVTIPGTD EEYTVCVDAA GYETGTESRI VSSGDDTDVG LALEGDGVLE VILEDAQFGD
GIADATVDAI GRQGTYSGVH TKHGTYRIES VPGGDEYAVN VSAAGYVDET LSMEIDSNRT
ARERAVLEGD ATLSVTVTDE DGDPIDGATV TIERPGGTSF AVANETDSDG TLETTVSGTG
VGYAVEVGAE GYESERVTTE EISSGATESV TVTMTAADNG VPGFGIAVGV IALVTALVVG
ISRRTP