Gene Htur_2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2247 
Symbol 
ID8742851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2314214 
End bp2315824 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content69% 
IMG OID646512830 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003403800 
Protein GI284165521 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTCT CGGCACCCGT TCTCGCCAGT GGTGGCGCTC TCGCCGGCGG TGTCGGCGCG 
TCGTCGGTCG ACGGACACGC CGGCAACGAG ACGACGGCCG ACGAGTCGAT CAGAATCGAC
GACTCCCTCG AGTCGTCCGA CGGGACGGTC GAGATCGTGG TTCGACTCGA GGAGCCGGCA
GTTCCCGACG CGGTCCCGAC CGACGATGCC GACGCGCACC TCGCGGACCA CGCCGAAGAG
AGCCAGGAAC CGCTGCTCGA CTACGCCGAC CGAACCGCGG GGATCAGCGT CGAGACCGAG
TTCTGGGTGG CCAACGCCGT GTTGCTCACC GTCGACACCG AGCGGGTCGA CCTCGAGACG
TTCGCTCGGT TCCCCGCGGT CGAGGCGGTC CACGAGAACT TCGAACTCTC CATCCCCGAG
CGGCCGCCGT CGAATGCGAC CGCACTGGGG GCGACGAACG GAGGGGAATC CACGACAACA
GACGTCACCA CGACCGCTAC CGATCCGCAA CCGACCGCTG GACTCGAGTT ACTGAACGCG
CCCGCCGTCT GGGAGGAGTA CGGAACGCGG GGCGAGGGGG TCCGCGTCGC CGTCCTCGAT
ACCGGAATCG ACGCGACGCA CCCGGACCTC GACCTCTACA CCGACGATCC GTCGGATCCG
ACGTACCCGG GCGGCTGGGC CGAGTTCGAC GGCAACGGGA ACCGCATCGA AGGATCGACG
CCCTACGATT CCGGAACGCA CGGCACGCAC GTCAGCGGCA CCATCGCCGG CGGAACCGCG
AGCGGCGCTC GAATCGGCGT CGCTCCGGAG GCGGAGCTGC TCCACGGGCT CGTCCTGCGC
GAGACCAGCG GCTCGTTCGC ACAGATCGTC GCCGGCATGG AGTGGGCGCT CGCGTCCGAG
GCCGACGTAA TCAGTATGAG CCTCGGATCG AACGGCAGAC ACGACGCGTT GATCGATCCG
GTTCGAAACG CCAGGGACAG CGGCGCCGTC GTCGTCGCGG CGGTCGGAAA CGAGGGCGTC
GAGACGTCAA ACTCGCCCGG GAACGTCTAC GACGCCGTCA GCGTCGGCGC CGTCGACGAG
AGCGGTGTCG TCCCCGCGTT TTCCGACGGC GAACGGATCA ACCGATCCGA ATGGCAAACG
TCGCTGCAAT CGTGGCCGTC GTCGTACACC GCTCCCGACG TCGTGGCCCC AGGCGTCCGG
GTTACGAGCA CCGTTCCCGG CGGCTATCAG TCGCTGCCGG GGACGTCGAT GGCGACCCCG
CACGTCTCCG GAGCGGTCGC CTTGCTCCGC TCGATCGATC CGACTGCAAC GCCCGACGAC
CTCAAGGACG CGCTGTACGG GACGGCCTGG ATACCCGAGA CGGCACAGGC ACGGTCGGAG
ACGGAGATCC GCTACGGCCA CGGGATCGTC GACGCTGAGA CGGCGGCGGA CGCGCTCGTC
GCGAGCGATC GGCGCCCCGT CAGAACGACC GCCGGCGAAT CCGCGGAGAC GCCCACCGAT
GAGACGTCGG CGGGGCTCGT TACACACTTC GGCGGTGTGG TGATCGTCGT CGTCACGGTC
GGCCTCTGGA CCCTCCGTTC CGGGTTCTCG TTCCCTCGCG ATGACCCGTG A
 
Protein sequence
MTLSAPVLAS GGALAGGVGA SSVDGHAGNE TTADESIRID DSLESSDGTV EIVVRLEEPA 
VPDAVPTDDA DAHLADHAEE SQEPLLDYAD RTAGISVETE FWVANAVLLT VDTERVDLET
FARFPAVEAV HENFELSIPE RPPSNATALG ATNGGESTTT DVTTTATDPQ PTAGLELLNA
PAVWEEYGTR GEGVRVAVLD TGIDATHPDL DLYTDDPSDP TYPGGWAEFD GNGNRIEGST
PYDSGTHGTH VSGTIAGGTA SGARIGVAPE AELLHGLVLR ETSGSFAQIV AGMEWALASE
ADVISMSLGS NGRHDALIDP VRNARDSGAV VVAAVGNEGV ETSNSPGNVY DAVSVGAVDE
SGVVPAFSDG ERINRSEWQT SLQSWPSSYT APDVVAPGVR VTSTVPGGYQ SLPGTSMATP
HVSGAVALLR SIDPTATPDD LKDALYGTAW IPETAQARSE TEIRYGHGIV DAETAADALV
ASDRRPVRTT AGESAETPTD ETSAGLVTHF GGVVIVVVTV GLWTLRSGFS FPRDDP