Gene Htur_2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2098 
Symbol 
ID8742698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2167253 
End bp2168533 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content63% 
IMG OID646512680 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003403654 
Protein GI284165375 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTAA ACAGGCGATC GGTACTCAAG GGTATTGGTG CAACGGGTGT ATCACTAACG 
TTCGCGGGGT TCGCAAGTGC TGAGGGCGAT GCTCGGTATA TCGTTACCGT CGAAAACGAT
CGTGCCCGTG ATCGTCTTGA GGCCGCGGAA TTCGCAATCA AAAACGTGCT CGCTGGCGGT
GCTGTGGTAG TTGCCGTCGG CAGAGAAGAT GCAGTTGATG ATCTCGAAGG AATCCGGGGT
GTCAGAACGG CCGCACGAGA CGTCTTATTC GCCCTGGAGG AGCCGGTGGC GACCGAACCG
GCTGACGAAC ACTTCGACGA GCCCATTTTT TGGGATCGCC AGTGGGACAA GCACGTCACC
GACGTAAGGC GGGCCCATCA AACGGCGACT GGGGACGGTT CCACGATCGC GGTCATCGAT
ACGGGGATTG ACGCGAGCCA CCCGGACCTC CAGAACGTGG ACACCAATAA TAGCGCCGCG
ATCATCGACG GTGACGTGAC CGCCGGCGAC GGTGGGCAGG TACACTGGCA CGGAACTCAC
GTCGCGGGGA TCGCCGCCGC TCAGGGCGGG AGCGTCACTG GAATGGCACC GGACGCCACC
ATCCTAAACT TGCGGGTGTT CCCGGAAGAG GGTGACTTGT TCGCGTCTGC GAGCGATATT
CTGCTGGCCC TAGAGTATGC CGCCGACCAA GGTGCCGACG TAGCAAACAT CAGCCTCGGG
GCGGGGCCCT ACCCTCCGCA GGCCAACGCC GGGGGACTCC GCGCCGCCCG CGAGAAGACC
GTTAATAACG TGGTCCGCCG AGGGCTGCTT GTCTCCGCGA GTGCTGGTAA CGAGGACGCG
AACCTCAAGC AAGGCGGGTT CTTCCACCTC ACGAGCAGCG TCGCCGGCGC GATGAGCGTC
AGCGCCACCG GTCCGGACGA CCTCCGAGCA TTCTACTCGA ACTACGGCTC TAATGACATC
GCCGTCGGGG CGCCGGGCGG CGGGTACGAG ACAAGTGAGA AGACCGAGAG CACGGACACT
CCATGGCCGT ACCCGCACAA CCTCGTGTTT TCGACACTCC CCGGCCCGAG TTACGGGTGG
GCAGCCGGGA CGTCGATGGC GGCCCCGCAG GTCACCGGCG CCGCTGCGCT CGTCCACGAG
GTCGCCCCCG ACGCTAACGC GCGTCAGGTT GAACAGGCCA TCAAGAATGG CGCCGATCTC
GTCAATGGGC AAAACGACGA CGACCTCGGT GCGGGTCGCT TGAACGCTGC CGACGCACTG
GATGCGCTAC GAGTACGCTA A
 
Protein sequence
MELNRRSVLK GIGATGVSLT FAGFASAEGD ARYIVTVEND RARDRLEAAE FAIKNVLAGG 
AVVVAVGRED AVDDLEGIRG VRTAARDVLF ALEEPVATEP ADEHFDEPIF WDRQWDKHVT
DVRRAHQTAT GDGSTIAVID TGIDASHPDL QNVDTNNSAA IIDGDVTAGD GGQVHWHGTH
VAGIAAAQGG SVTGMAPDAT ILNLRVFPEE GDLFASASDI LLALEYAADQ GADVANISLG
AGPYPPQANA GGLRAAREKT VNNVVRRGLL VSASAGNEDA NLKQGGFFHL TSSVAGAMSV
SATGPDDLRA FYSNYGSNDI AVGAPGGGYE TSEKTESTDT PWPYPHNLVF STLPGPSYGW
AAGTSMAAPQ VTGAAALVHE VAPDANARQV EQAIKNGADL VNGQNDDDLG AGRLNAADAL
DALRVR