Gene Htur_5018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5018 
Symbol 
ID8745824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013748 
Strand
Start bp8968 
End bp10521 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content66% 
IMG OID646515632 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003406579 
Protein GI284176303 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value0.317126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAGGA GAGACTTTCT GACGACGGCA GGAGCAACGG CAGGCACACT CTCGACCCCA 
GCGTTCGTCG GCCCGTTCGA CGGCCGGATC GGCTCGAGTG ACACACTCAC ACTGGCTGCG
CTCGACGCGG AGAACATCAA GACGTTCGGC GATTTGAACC CGTCGTTCGT GTTCTTCTAC
AGCGACGACA GCGCCCGATC ATCGTTGCAG TCGTGGGTCG ACTCGAGCGA CGACCGCGTA
CTCAAACGCG ACCATCCGAC GGTCGGTATG ATGACGATCT CGATGCCCTG GTCGGAGGTC
GGCCTCAAGC AGTACTCCGC TGGCGTCGGA GACTACGATG TCGGCCTCGA GCGCATCGAC
GGCGGTGCGC AAGCCCTCGA GTACGTCGAT ACGATCGACG CGAACATGGT CATGTCCCGA
CCGGAACCGC TCGAGGAACT CGAGGGGACC GGCGTGGCCT CTCTTGACCT CGGATTCCGC
GAGAAGGTCT CGATGCTCGC CGAAACGGGG GCGTGGAATC CCACGCCCGA GACGGCCGGA
CTCGCGTTCG ACGAGGACGC GCCGGAAGCG ACACTGTCCG AGTCACGCGC TCACGTGCGC
GCGGACGATA CCGTTCTCTC GGGTGTCGAC ACCTCGAGCC TGACCGTCGC CGTCATCGAC
ACCGGTGTCA ACACGGGGCT CAACCTCGAC GAAAGCCGCC TCCACGGCGA GTCGACAGGC
TACGCGAAGG ATGGCGACCC GACATACAGC GAGGAGGGCA CCGACGCGAT CACCGACGGC
GACGGACACG GAACGTGGGT CGCCCACTGT ATCGCGGGTT CGAACGGCTT CGCTCCCGAT
GCGACCGTAC TGGGGCTCAA GGTGCTGGGC GACGACGGGA GCGGCGACAC GAAAGACATC
ATCGCCGGCA TCGAGAAGGC GATCGACGTC GGCGCCGACG TCGCCTGCCT CTCCCTTGGC
AGCCCGCAGT GGTCCGAGTC GCTTGCGGCG GCGCTCAACG ACGCTCGAGA GGCCGGTGTG
TTCTGTGCGG TGGCAGTGGG CAATGATCGC TATGGAACCG TGTGGGTCGC CAGCCCGGCA
GACGCTGACG GCGGCTTCGG CGTCCAGGCG TGTAACGTCC CCGAGTCGGG CGACCGCGAC
GATACGGAAC TCGCGTACTT CGGGAACACC GGACCGGACC CCGGCAGCAC CGACCTCTCC
GGTGGCGACT CGGAGGGTGC GGTTCCACTG CTGGCCGCGC CGGGGATGTC GATTACAATC
GAACTCCCCA GTGGCCTGAG CACGCTCTCG GGAACCTCGA TGGCCGCACC CCACGTGGCT
GGCGGCGCGG CCGTTTCCCG AGCAGCGGGC TACGGCGTCG ACGAGACGTG GTCGCGACTC
ATCAAGTACG CTTACCCGCT CCCGAACGCG GGCGCAACCG AGGCGAAACA CGGTCTGCTC
GACGTTCAGG CGCTACTCGA GGGAACCGAA CCAGCCGACG ATCCGGCCGA CGTGCGGACG
GTTGAAGCCG CCGCACGCGA CGACTTCAAC GAATCGCTGT CGACGGTCCT ATAG
 
Protein sequence
MFRRDFLTTA GATAGTLSTP AFVGPFDGRI GSSDTLTLAA LDAENIKTFG DLNPSFVFFY 
SDDSARSSLQ SWVDSSDDRV LKRDHPTVGM MTISMPWSEV GLKQYSAGVG DYDVGLERID
GGAQALEYVD TIDANMVMSR PEPLEELEGT GVASLDLGFR EKVSMLAETG AWNPTPETAG
LAFDEDAPEA TLSESRAHVR ADDTVLSGVD TSSLTVAVID TGVNTGLNLD ESRLHGESTG
YAKDGDPTYS EEGTDAITDG DGHGTWVAHC IAGSNGFAPD ATVLGLKVLG DDGSGDTKDI
IAGIEKAIDV GADVACLSLG SPQWSESLAA ALNDAREAGV FCAVAVGNDR YGTVWVASPA
DADGGFGVQA CNVPESGDRD DTELAYFGNT GPDPGSTDLS GGDSEGAVPL LAAPGMSITI
ELPSGLSTLS GTSMAAPHVA GGAAVSRAAG YGVDETWSRL IKYAYPLPNA GATEAKHGLL
DVQALLEGTE PADDPADVRT VEAAARDDFN ESLSTVL