Gene HY04AAS1_1164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_1164 
Symbol 
ID6743981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp1077140 
End bp1078534 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content37% 
IMG OID642750974 
Productprotease Do 
Protein accessionYP_002121828 
Protein GI195953538 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0251777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA ATTTTTTCGC TTTGGGTTTA GCTTTATCTA TCCTAGGGTC AAACGTTAGT 
GCAGTATCTG GTCCAATTTT AGAGCAATTT CAAAAAGAAG TAGAACAAAT AGTAGATAAA
GTCTCACCTT CAGTAGTAAC AATTTACGCT ACTCAGATAG TGAAAGCTCC TTTAAACACT
CAAATATTTC CAGGATTTCC ACCGTTTATG ATGCCAGGTA TTCCAACTCC AGAAATCCCA
GAGAAAGAAA AAGATTTGGG TTCTGGTATT ATAATTAAAT ACATTCAATC TAAAAATGCC
TTTATAATTT TAACAAACAA TCATGTTGTA GGAAATTCAA AAGATGTAAT GGTAAAATTA
TCTAGAACTA TTGAAAGAAA AGCTAAGGTT TTAGGTAGGG ACCCTAAAAC AGATTTAGCA
GTGCTAGAAG TTAGCGCTGA GGGCATAGAT AACCCTAGTT CTAGAGTGGC TACTCTTGGG
GATTCTTCTC ATGTAAGAAT AGGACAACTT GTCATAGCTA TAGGGAATCC ATACGGTTTT
AGTAGAACTG TTACAATGGG AGTAATATCT GCTTTAAACA GAAGGCTTGG CCTTTCTCAA
TATGAAGATT ATATACAAAC GGATGCTGCT ATAAACCCAG GCAACAGCGG TGGTCCTCTC
ATAAATATAG AAGGCAAAGT GATAGGTATA AATACTGCAA TGGTGAAAGG GGGTCAAGGG
CTTAGTTTTG CCATACCTAT AAATTTAGCT AAATGGGTTT ATCATCAAAT AATGGAGTAT
GGTAAAGTTA TAAGAGGCTG GCTTGGTGTA TCTATACAGC AGATAACACC TCAGATGGCC
TCCTCTTTAG GTGTAAACTA CGGCGCTATC GTCGCTCAAG TATTTCCAGG CTCACCTGCT
CAAAAGTACG GTCTTAAAGT AGGGGATATA ATAGTATCAG TGGATGGAAA ACCCCTTGAA
AGCATAGACG AGCTCCAATT TAAAACCATG GAATCTCCAC CTGGCACTGT CTTGACTTTG
GGTGTTATAA GAAATCATAA GCTTATTACG ATTAAAGTGA AAACTGCAAA AATGCCAAGC
AACACTAGCA ATATAGGTGT TGTATCGGAA GCTAAAGATT TGGGCCTTAT GGTAAGACCT
CTAAATCCTC AAGAGCAAAG ACGTTACGAA GTAAAAGGCG GCCTTTTGGT GGAAGATGTA
CTGATGGGTT CTCCTGCTTA CGAAGCCGGT ATAAGAGCCG GTGATATCAT TCTTAGCATA
AACCTTCATA GAATCTATAC AAAAGCCGAG ATGGACAATA TATTAGAACG CTTAATATCT
GAACATAAAG ATACTGCTAC TTTCCTAGTA GATAGAAACG GTCAAAACAT CTTTGTTACA
GTGAGATTAA AATAA
 
Protein sequence
MKKNFFALGL ALSILGSNVS AVSGPILEQF QKEVEQIVDK VSPSVVTIYA TQIVKAPLNT 
QIFPGFPPFM MPGIPTPEIP EKEKDLGSGI IIKYIQSKNA FIILTNNHVV GNSKDVMVKL
SRTIERKAKV LGRDPKTDLA VLEVSAEGID NPSSRVATLG DSSHVRIGQL VIAIGNPYGF
SRTVTMGVIS ALNRRLGLSQ YEDYIQTDAA INPGNSGGPL INIEGKVIGI NTAMVKGGQG
LSFAIPINLA KWVYHQIMEY GKVIRGWLGV SIQQITPQMA SSLGVNYGAI VAQVFPGSPA
QKYGLKVGDI IVSVDGKPLE SIDELQFKTM ESPPGTVLTL GVIRNHKLIT IKVKTAKMPS
NTSNIGVVSE AKDLGLMVRP LNPQEQRRYE VKGGLLVEDV LMGSPAYEAG IRAGDIILSI
NLHRIYTKAE MDNILERLIS EHKDTATFLV DRNGQNIFVT VRLK