Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0929 |
Symbol | |
ID | 6743741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | - |
Start bp | 876362 |
End bp | 877783 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642750735 |
Product | protease Do |
Protein accession | YP_002121594 |
Protein GI | 195953304 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000000572701 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAA TTTTTGCAAT ATTATCTGTA TTTGCCCTTT TGTTTTTCAC AAACTCCTGT GTGAAGAAAA GTAGAGTAGA ACAAGCTACC TCGACTCAGC AACCGTCTAG CCAGTACCAA CTCAAATTAA ATGTACCAGT ATTGGCACAG ATGCAAGATG AACTTGTGCA GATTGTAAAA AGAGTCTCTC CTTCTGTGGT AACGATATTT TCTACCCAAG AGGTAAACGT ACCGCTTTTT CCACAAATAC CTGGTTTTGA CCTTCCAACA CCTTCGATAC CACAAGAAAC AAAGGCTCTT GGGTCCGGTG TTATATTTGA ATACAACAAG CAAAACGATA CGTTTTTTAT ACTTACAAAC AACCATGTTA TAGCTCATAG CAAAAGCGTA GTGGTGAATT TTGGAAAAAA TGAACAGCAT CAAGCTAAAG TATTAGGTGC AGATCCAAAA ACAGATTTGG CAGTATTAGA AGTTAGTGCA AAGGGGATAA AAGATCCAGA TTCAAGAGTG GCAACGCTTG GTAATTCAGA TACACTCCAA GTGGGCCAAA TCGTGCTAGC TATAGGTAAT CCTTATGGCC TAGATAGGAC TGTGACGATG GGGGTTATAT CTGCTTTACA TAGGAGTATA GGTTTAACTC AATATGAAAA TTACATACAA ACGGATGCTG CTATAAACCC AGGCAACAGC GGCGGGCCTC TTGTAAATAT ACAAGGCCAG GTAATAGGTA TAAACTCTGC TATGGTAGAA GGCGGTCAAG GTCTTGGCTT TGCCATCCCT ATAAATTTAG CAAAATGGGT ATCTTCTCAA ATTATAAAAC ATGGTTCTGT TACAAGGGGA TGGATAGGTG TGATGATACA ACAAGTTACG CCAAGCTTAG CAAAAGCTCT AAAAGTTCAA AACGGTGCTG TGGTAGTTCA AGTTATGCCA AATGGCCCTG CAGATAAAGC TGGTATAAAA GTGGGTGATG TTATAGTAGG TATAGACAAC GAAAATATAA GCACTATACA ACAGCTGCAA TTTAAAGTGA TGGAGACAAA ACCTGGTACT ACTCTCACAT TTCACATAAT AAGAAATGGA AAGCCCATGG ACTTAAAAGT AACTATAGGG AAAATGCCTA CAAACCCAAC GTCTGTTAGT GAAACTCAAA CTACCACAGA CCTTGGCATA TCTGTAGCAA ACCTAACTCC ACAACAGATG CAAACTTACG GCGGTGGTGT TTATGTGGTA AGCGTAGGCC CAAACAGTCC AGCAGCTAGT TCTCTTCAGC CTGGAGACGT GATACTAATG GTAAACAACC ATCCTGTGAA CTCTGTAAAT GATTTTAAAT CGCTTGTATC TCAATATGTA AAATCTGGAT ATGTGTTGTT TTTGGTGGCA AGGGATGGGC AAAGATTTTA CGTAAGTATA CAAACAAGGT GA
|
Protein sequence | MRKIFAILSV FALLFFTNSC VKKSRVEQAT STQQPSSQYQ LKLNVPVLAQ MQDELVQIVK RVSPSVVTIF STQEVNVPLF PQIPGFDLPT PSIPQETKAL GSGVIFEYNK QNDTFFILTN NHVIAHSKSV VVNFGKNEQH QAKVLGADPK TDLAVLEVSA KGIKDPDSRV ATLGNSDTLQ VGQIVLAIGN PYGLDRTVTM GVISALHRSI GLTQYENYIQ TDAAINPGNS GGPLVNIQGQ VIGINSAMVE GGQGLGFAIP INLAKWVSSQ IIKHGSVTRG WIGVMIQQVT PSLAKALKVQ NGAVVVQVMP NGPADKAGIK VGDVIVGIDN ENISTIQQLQ FKVMETKPGT TLTFHIIRNG KPMDLKVTIG KMPTNPTSVS ETQTTTDLGI SVANLTPQQM QTYGGGVYVV SVGPNSPAAS SLQPGDVILM VNNHPVNSVN DFKSLVSQYV KSGYVLFLVA RDGQRFYVSI QTR
|
| |