Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0100 |
Symbol | htrA |
ID | 5135611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 90423 |
End bp | 91793 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640531560 |
Product | protease DO |
Protein accession | YP_001216065 |
Protein GI | 147673156 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000000901293 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAA AACCTTTACT TGTTTTAACT GCTCTGTCTC TTAGTTTGAG CGCGATTCTC TCGCCTTTGC CTGCAACTGC AGCGCTTCCT CTCTCAGTCA ATGGAGAGCA GATTCCTAGC CTAGCCCCCA TGCTTGAAAA AGTCACACCC GCCGTGGTGA GCATTGCTGT GGAAGGGACT CAAGTTTCAA GACAGCGTCT GCCGGATCAG TTTCGTTTTT TCTTCGGACC GGATTTTCCG ACCGAACAAC TCCAAGAGCG ACCTTTCCGT GGCTTAGGTT CTGGGGTCAT CATTAACGCT GATAAAGGGT ATGTCGTCAC TAACTACCAT GTCATTAATG GTGCTGAAAA AATTCGCGTC AAACTGTATG ACGGTCGCGA GTTTGATGCA GAACTTGTCG GTGGTGATGA GATGTCTGAT GTCGCCTTGC TCAAGCTAAA CAAAGCGAAA AACCTCACTG AGATCCGTAT CGCGGACTCC GATAAACTGC GAGTCGGTGA TTTTGCAGTG GCCATCGGTA ACCCATTTGG CTTAGGGCAA ACTGTGACCT CTGGCATTGT CTCAGCCTTA GGGCGTAGTG GTTTGAATAT CGAAAACTTT GAAAACTTCA TCCAGACCGA TGCCGCCATC AACAGCGGCA ACTCAGGAGG AGCTCTGGTT AACCTTAATG GTGAACTCAT CGGTATCAAC ACCGCGATCC TTGGTCCAAA CGGTGGCAAC GTCGGTATAG GTTTTGCCAT CCCATCGAAT ATGATGAAAA ATCTGACCGA TCAAATTCTT GAGTTTGGTG AAGTGAAACG CGGCATGCTG GGTGTACAAG GCGGTGAAAT CACTTCCGAA CTGGCTGATG CGCTCGGCTA TGAATCCTCA AAAGGTGCTT TTGTCAGCCA AGTGGTTCCT GACAGTGCTG CGGACAAAGC GGGCATCAAA GCGGGTGACA TCATTACGTC GCTGAATGGT AAAAAAATCG ATACCTTCTC TGAGCTACGC GCGAAAGTCG CGACCCTAGG CGCAGGAAAA ACCATTACCC TTGGAGTGCT GCGTGATGGT AAGAATCAAA ATATTGATGT AACGCTTGGG GAGCAGCAAA ATGCCAAGAC CAAAGCAGAA TCACTGCATC AAGGTTTGAG CGGCGCGGAG TTAAGCAACA CCACTGACAG CGATCCTATT CAGGGCGTTA AGGTTACTGA GGTTCAAAAA GGCTCTGCCG CTGAATCTTA CCAGCTACAA AAAGACGACA TTATCATTGG CGTTAACCGT AAGCGGGTGA AAAATATCGC CGAGTTGCGT GCGATTATGG AAAAATCACC GAATATTTTG GCATTAAATA TCCAACGTGG AGAGAGAACG CTTTACTTGG TTGTTCGTTA A
|
Protein sequence | MMKKPLLVLT ALSLSLSAIL SPLPATAALP LSVNGEQIPS LAPMLEKVTP AVVSIAVEGT QVSRQRLPDQ FRFFFGPDFP TEQLQERPFR GLGSGVIINA DKGYVVTNYH VINGAEKIRV KLYDGREFDA ELVGGDEMSD VALLKLNKAK NLTEIRIADS DKLRVGDFAV AIGNPFGLGQ TVTSGIVSAL GRSGLNIENF ENFIQTDAAI NSGNSGGALV NLNGELIGIN TAILGPNGGN VGIGFAIPSN MMKNLTDQIL EFGEVKRGML GVQGGEITSE LADALGYESS KGAFVSQVVP DSAADKAGIK AGDIITSLNG KKIDTFSELR AKVATLGAGK TITLGVLRDG KNQNIDVTLG EQQNAKTKAE SLHQGLSGAE LSNTTDSDPI QGVKVTEVQK GSAAESYQLQ KDDIIIGVNR KRVKNIAELR AIMEKSPNIL ALNIQRGERT LYLVVR
|
| |