Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_1164 |
Symbol | |
ID | 6743981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 1077140 |
End bp | 1078534 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642750974 |
Product | protease Do |
Protein accession | YP_002121828 |
Protein GI | 195953538 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0251777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA ATTTTTTCGC TTTGGGTTTA GCTTTATCTA TCCTAGGGTC AAACGTTAGT GCAGTATCTG GTCCAATTTT AGAGCAATTT CAAAAAGAAG TAGAACAAAT AGTAGATAAA GTCTCACCTT CAGTAGTAAC AATTTACGCT ACTCAGATAG TGAAAGCTCC TTTAAACACT CAAATATTTC CAGGATTTCC ACCGTTTATG ATGCCAGGTA TTCCAACTCC AGAAATCCCA GAGAAAGAAA AAGATTTGGG TTCTGGTATT ATAATTAAAT ACATTCAATC TAAAAATGCC TTTATAATTT TAACAAACAA TCATGTTGTA GGAAATTCAA AAGATGTAAT GGTAAAATTA TCTAGAACTA TTGAAAGAAA AGCTAAGGTT TTAGGTAGGG ACCCTAAAAC AGATTTAGCA GTGCTAGAAG TTAGCGCTGA GGGCATAGAT AACCCTAGTT CTAGAGTGGC TACTCTTGGG GATTCTTCTC ATGTAAGAAT AGGACAACTT GTCATAGCTA TAGGGAATCC ATACGGTTTT AGTAGAACTG TTACAATGGG AGTAATATCT GCTTTAAACA GAAGGCTTGG CCTTTCTCAA TATGAAGATT ATATACAAAC GGATGCTGCT ATAAACCCAG GCAACAGCGG TGGTCCTCTC ATAAATATAG AAGGCAAAGT GATAGGTATA AATACTGCAA TGGTGAAAGG GGGTCAAGGG CTTAGTTTTG CCATACCTAT AAATTTAGCT AAATGGGTTT ATCATCAAAT AATGGAGTAT GGTAAAGTTA TAAGAGGCTG GCTTGGTGTA TCTATACAGC AGATAACACC TCAGATGGCC TCCTCTTTAG GTGTAAACTA CGGCGCTATC GTCGCTCAAG TATTTCCAGG CTCACCTGCT CAAAAGTACG GTCTTAAAGT AGGGGATATA ATAGTATCAG TGGATGGAAA ACCCCTTGAA AGCATAGACG AGCTCCAATT TAAAACCATG GAATCTCCAC CTGGCACTGT CTTGACTTTG GGTGTTATAA GAAATCATAA GCTTATTACG ATTAAAGTGA AAACTGCAAA AATGCCAAGC AACACTAGCA ATATAGGTGT TGTATCGGAA GCTAAAGATT TGGGCCTTAT GGTAAGACCT CTAAATCCTC AAGAGCAAAG ACGTTACGAA GTAAAAGGCG GCCTTTTGGT GGAAGATGTA CTGATGGGTT CTCCTGCTTA CGAAGCCGGT ATAAGAGCCG GTGATATCAT TCTTAGCATA AACCTTCATA GAATCTATAC AAAAGCCGAG ATGGACAATA TATTAGAACG CTTAATATCT GAACATAAAG ATACTGCTAC TTTCCTAGTA GATAGAAACG GTCAAAACAT CTTTGTTACA GTGAGATTAA AATAA
|
Protein sequence | MKKNFFALGL ALSILGSNVS AVSGPILEQF QKEVEQIVDK VSPSVVTIYA TQIVKAPLNT QIFPGFPPFM MPGIPTPEIP EKEKDLGSGI IIKYIQSKNA FIILTNNHVV GNSKDVMVKL SRTIERKAKV LGRDPKTDLA VLEVSAEGID NPSSRVATLG DSSHVRIGQL VIAIGNPYGF SRTVTMGVIS ALNRRLGLSQ YEDYIQTDAA INPGNSGGPL INIEGKVIGI NTAMVKGGQG LSFAIPINLA KWVYHQIMEY GKVIRGWLGV SIQQITPQMA SSLGVNYGAI VAQVFPGSPA QKYGLKVGDI IVSVDGKPLE SIDELQFKTM ESPPGTVLTL GVIRNHKLIT IKVKTAKMPS NTSNIGVVSE AKDLGLMVRP LNPQEQRRYE VKGGLLVEDV LMGSPAYEAG IRAGDIILSI NLHRIYTKAE MDNILERLIS EHKDTATFLV DRNGQNIFVT VRLK
|
| |