Gene Htur_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2023 
Symbol 
ID8742622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2094152 
End bp2095765 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content61% 
IMG OID646512605 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003403580 
Protein GI284165301 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCTG AGGGCGATAT TACTCGACGG CGGCTGCTCC GAACCGTTCC AGTGGCCGGC 
AGCGTCGGCC TCGCCGGCTG TGCCGAGCAG ACCGGCGACG AGACTGGCAA CGGAACAGAT
ACCACTGGGA ACGAAAGCGA CGAAGAACCG ACGCAAACCG AATCCGAGAC CGAAACCGAC
GACGAACCCG AGGAAACGCC GGATCCACCG AACGTTGAGA GTCAGATCAT CCAGCGGGAC
AAGCCAGCGA TAACGAACGT CCGCCACGTC GTCGAAGGGA CGATGACGTG GCCGTCGATC
GGCTGGGAAG ACCTCGTCGA TTCCGATCTC CTCGGCGTCT GGGAAACGAC GGACGAACGG
CTCTACTTCT CATACAATCG TGAGTTCGTC GTGAATGGGG CAGATTACCA GTACAGTGGC
GGGTACGCCA CACGGAACGG CTATCTCTAT CTCAAATACG AGTCAGGGGC CTCGCAGGAG
TTCCAGTACC GAATCGAGGG CGGTCGATCA GCACCGATCC TCGAGCTATA CCAGAACGGG
GAACGCAAGG CCACGTACGA ACAGACGGAG ACGGAAGACG ATCAGCGAGG TCCAGTTGCG
GTAGCCGAAG ATCAGATCGC CGTTCCCGAA CAGAATGCGA CCACGAAACG CGAAGACGTC
CAGACCGGTG CAGTGGGATC AGGCTTTATC GTCTCGCCTG ACGGTACCGT CGTCACGAAC
GCACACGTCG TCGGCGCGCA TCAAAACTCG GAAGAGACGG CGTACGAGCG GTTCGCAGTG
AAGCAGAGCG AAGCGCTCCG ACAGGACCTC TCGTCGAGTG GAAGCTTGAC CGACGAGCAG
GTGGAGGAGA CGGGTCGGAT ACTGTACGAG GAGATAATGG GCTACTACGA GGAAAACGGA
ACCCTCCGAG ACGTCTCCGA GTCGGTACAC GTCCTGAACG GGAAGGCGAC GACCGACGAC
GACCTCGAAG TCGAGAGCTG GTCCGCCGAG GTCGAGACCG CGGGGACCGT CTACAAGGAG
GACGACGGAG AGCCGTCGAT GGGCCGCGAC ATCGCGGTAC TGGACATCGA CGGGGAGAAC
CTCCCGACGG TGACGCTCGG TAGCGCGAAC GACCTCAGCA CCGGCGAAAA CCTCTACATC
ATCGGCTATC CGGACATCGG TATCAGCGAG TTCTTCGATA CCACGAACAC TACTCTCGAG
CCCACGATGA CGACCGGCAT CGTCAGTGCG CGGCGCGAAC TCAACACCGG AATCAACTCG
ATCCAGACCG ACGCGGCGAT CAACGGCGGC AACAGCGGCG GTCCGATGTA CAACAGCGAC
GGTGAGGTCG TCGGCGTCGC GACGTTCAGC CCCAACGATG CTCAGATCCA GGACATCCAG
TTCGGCCTCC CGATCGAAAT CACGACGGGA TTCCTGACCG AACTTGGCAT CGAGAACACC
ACGGGCGAGA TGCAGTCCGC CTACGAAGCG GGTCTCGATG CCTACTGGCG TGGCGACTGC
GAGACGGCGA CGGCGAAGAT GGAGACCGCC CTCGATCTGT ATCCAGATCA TCCGCAAGCG
CAGTCGTACA TCACGGACTG CGAGAACGGC GAGGCGCCCG GACAGGGGTC GTAA
 
Protein sequence
MDSEGDITRR RLLRTVPVAG SVGLAGCAEQ TGDETGNGTD TTGNESDEEP TQTESETETD 
DEPEETPDPP NVESQIIQRD KPAITNVRHV VEGTMTWPSI GWEDLVDSDL LGVWETTDER
LYFSYNREFV VNGADYQYSG GYATRNGYLY LKYESGASQE FQYRIEGGRS APILELYQNG
ERKATYEQTE TEDDQRGPVA VAEDQIAVPE QNATTKREDV QTGAVGSGFI VSPDGTVVTN
AHVVGAHQNS EETAYERFAV KQSEALRQDL SSSGSLTDEQ VEETGRILYE EIMGYYEENG
TLRDVSESVH VLNGKATTDD DLEVESWSAE VETAGTVYKE DDGEPSMGRD IAVLDIDGEN
LPTVTLGSAN DLSTGENLYI IGYPDIGISE FFDTTNTTLE PTMTTGIVSA RRELNTGINS
IQTDAAINGG NSGGPMYNSD GEVVGVATFS PNDAQIQDIQ FGLPIEITTG FLTELGIENT
TGEMQSAYEA GLDAYWRGDC ETATAKMETA LDLYPDHPQA QSYITDCENG EAPGQGS