Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SYO3AOP1_1160 |
Symbol | |
ID | 6331817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfurihydrogenibium sp. YO3AOP1 |
Kingdom | Bacteria |
Replicon accession | NC_010730 |
Strand | + |
Start bp | 1207285 |
End bp | 1208781 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 642657442 |
Product | protease Do |
Protein accession | YP_001931326 |
Protein GI | 188997075 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000336016 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATAA AAGCGGAGGT TATAAAAATT ATGAAAAAGG TAGTTTTACC AATTGCTTTA ATTTTAGCAG TTGCATATGT TGTTTTTGCT CAGCTTGGAA AACAGCAAGA AAGTTATAAT CAAAGTATAA AAGGTCAAAC TTTATCACTT CCAAATCTCG AAGCTATAGA AAATGAAAGG ATAAAGCTTA TTGAAATGGT TTCTCCAGGT GTTGCAACCG TATTTACCAC TCAAGAAGTT AAAATTCCTA ATCCTTTTAG TGATATTCCT TTTGGTGATT TTTTTGGCAT TCCAAACACA CCAGAATTTA AACAAAAAAG ACAAGGTCTT GGTTCTGCTT TTATTGTAGA TGTAGATTAT AATAAAAAAG TTGTGTTTCT TCTAACAAAC AATCACGTAA TAGAAAATGC AAAAGATATT CAAGTTGCTT TTAAAAACAA AGTTGTACTA AAAGGAAGAG TTGTTGGTGG CGATAAATTA AGCGATGTTG CATTAATAGA AGTTCCATTT AAAAAAGGAA TAGAAGATTT TGCATCTAAG CATGTTTTAA AACTTGGAGA TTCTGACCAG CTAAAAGTAG GTGCAACAGT TATTGCAATC GGAAGCCCCC TTGGACTTTC AGATACAGTT ACAATAGGAA TTGTATCAGC CAAAAACAGA CAGATAGAAG ATAGACCGGG AGAAGGATTT ATTCAAACAG ACGCAGCAAT TAATCCAGGA AACTCCGGCG GCCCACTGAT CAACATCAAA GGTGAAGTTG TAGGAATCAA TACTGCTATA ATTGCCGGAG CACAAGGTCT TGGTTTTGCA ATTCCAATAA ATCAAGCAAA ATGGGTTATG GATCAAATAT TAAAATATGG AAAAGTAAAA AGAAGTAAAA TTGGTGTAAT CGTTCAACCT TTAACTCCGG AGCTTGCAGA ACACTTTGGA GTAAACGAAG GAGTACTTGT TTCTAATGTG CAACCAGGAG GACCGGCAGA CAAAGCTGGT ATTAGAGCAG GAGATATTAT AGTAGAAGTA AATGGCAAAA AAATTTCTGA AGTTCAAGAT TTACAAAATC AGATAATGAA AAACCCGCCG GGAACAAAAA TTAATCTAAA AGTTATAAGA AATGGTAAAG AGCTGACATT TACAGTAATT ACCGTTCCAT TAGAAGGTAG TGATACACAA GAGCAAACTA CTGATGAAAG TTTGTCTTCA ATAGAAAACA GCATAGGATT AATAGTAAAA GATTTAACAC CAGGATTGAT ACAAAAATAT GGATTGCCTA AGGTGTCTAA TGGAGTTTTG GTATATGGCG TTAAAGAAGG CAGTGCTGCA GAAGATGCAG GTTTACAAGC TGGAGATATT ATCCTTTCTG TGAATAATAT TCCTGTAAAA TCTGCTTCCG AATTTTGGTC TATAATATCA AAAGCTAAAA AAGAAGGAAA AGATAATGTT CTTCTTTATT TACAAAAAGG AGATAATAGA ATATATCTAA CATTACCTTT AAGATGA
|
Protein sequence | MNIKAEVIKI MKKVVLPIAL ILAVAYVVFA QLGKQQESYN QSIKGQTLSL PNLEAIENER IKLIEMVSPG VATVFTTQEV KIPNPFSDIP FGDFFGIPNT PEFKQKRQGL GSAFIVDVDY NKKVVFLLTN NHVIENAKDI QVAFKNKVVL KGRVVGGDKL SDVALIEVPF KKGIEDFASK HVLKLGDSDQ LKVGATVIAI GSPLGLSDTV TIGIVSAKNR QIEDRPGEGF IQTDAAINPG NSGGPLINIK GEVVGINTAI IAGAQGLGFA IPINQAKWVM DQILKYGKVK RSKIGVIVQP LTPELAEHFG VNEGVLVSNV QPGGPADKAG IRAGDIIVEV NGKKISEVQD LQNQIMKNPP GTKINLKVIR NGKELTFTVI TVPLEGSDTQ EQTTDESLSS IENSIGLIVK DLTPGLIQKY GLPKVSNGVL VYGVKEGSAA EDAGLQAGDI ILSVNNIPVK SASEFWSIIS KAKKEGKDNV LLYLQKGDNR IYLTLPLR
|
| |