Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_2812 |
Symbol | htrA |
ID | 7385942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 2341285 |
End bp | 2342871 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643651851 |
Product | serine protease |
Protein accession | YP_002550036 |
Protein GI | 222149079 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.498431 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTCA CGATGGGGAA GCTTATGATC AAGTGTTGCG GACCGGCCAC CCGCCTTCTG ACCTCTCTTG TTGTCGGGGG CATGGTCAGC TTTGGGCCTG CGGCCACGAG CACGGCTGCC TTCGGCCAGG TCAAGACCGC AGCCCCGGTT CCCCCGCCGC CGCCAGCCTC TGGCGCGTCC GCAACGCCTG GACCCGCGCA GGGTGGCCCC GCGCCCGTCG CTGATCTCGC CGAGGGACTT CTCGGTGCGG TGGTCAACAT CGCGACGTCG CAGAATGTCG ATGATGACGA TGCAGCGCCG CTGCCACAGG TGCCGAAAGG GTCGCCCTTT GAGGATCTGT TTGAAGATTT CTACAAGAAT CGCGAGGGCA AAGGCAGCAA CCATAAGGTC AATTCGCTCG GCTCCGGCTT TGTTATCGAT CCCGCTGGTT ACATCGTCAC CAACAACCAT GTCATCGAGA ATGCCGATGA TATCGAAGTG ATTTTCTCCG ACGGTTCGAA ATTGCAGGCC AAGTTGATTG GCACCGACAC CAAGACCGAT CTTTCGCTGT TGAAGGTTGA ACCAACGGAA CCGCTGACGG CGGTAAAATT CGGCGATTCG AAGGTGATGC GGATCGGCGA TTGGGTGATG GCGATCGGCA ATCCCTTCGG GCTTGGCGGT TCGGTGACGC TCGGCATCGT TTCGGCCCGG GGCCGAAATA TCAATGCCGG ACCTTATGAC AATTTCATTC AGACCGATGC GGCGATCAAC AAAGGCAATT CCGGTGGCCC GTTGTTCAAC ATGCGAGGCG AGGTGATTGG TATCAATACC GCGATCATTT CGCCGAGTGG CGGCTCCATT GGTATTGGCT TTGCCGTGCC GTCGGAACTG GCCGAAAATG TCATCAAGCA GCTCCGCGAC TTTGGCGAGA CGCGGCGCGG CTGGCTGGGC GTGCGCATTC AGCCGGTGCC GGATGATCTG GCCAAGTCGG CAGGAATCAA ATTGGGCCGG GGTGCGCTGG TCAGCAGCAT TATTGAAGAC GGTCCGGTGG CCAAGGGGCC GCTTAAAACC GGAGACGTGA TTATTTCCTT CGGCGGCAAG GATATTGCCG AAAGCCGCGA TCTGGTTCGC ACGGTGGCCG AAAGCCCGAT CAATCAGGAT ATCGATGTCG TGGTGTTTCG TGACGGCAAG CGGGAAACAC TGAAAGTGAA GCTGGCGCAA TTGCCTGACG ACAAGGCGAC GGAAGCCAAG GACAGCGAAC AGGCCGATCC AAAAGCCAGC GACAGCGAGG ACACGGATGA AGCGGCCAGC GGCATGGTGC TTGGCATGAG TGTCGAGGCG CTTGATGATG AGAAGCGGGC CGCCAATTCG ATTGCCAAGA GCGTCGAGGG CTTGCTGATA ACAGATGTGC AGCAGGGCTC CGCCGCTGAC CAGAAGGGCT TGAAAACCGG TGAAGTGATC GTGGAAGTGG CGCAGGAATT CGTCGCCACG CCAGAAGCCT TGGCGGAGAA AATCGACAAG CTGAAATCAG ATGGTCGTCG TGCCATTCAC CTGATGGTGG CAACCCCGCA GGGCGATCTG CGTTTCGTGG CCGTGCCGCT GGAATAG
|
Protein sequence | MKLTMGKLMI KCCGPATRLL TSLVVGGMVS FGPAATSTAA FGQVKTAAPV PPPPPASGAS ATPGPAQGGP APVADLAEGL LGAVVNIATS QNVDDDDAAP LPQVPKGSPF EDLFEDFYKN REGKGSNHKV NSLGSGFVID PAGYIVTNNH VIENADDIEV IFSDGSKLQA KLIGTDTKTD LSLLKVEPTE PLTAVKFGDS KVMRIGDWVM AIGNPFGLGG SVTLGIVSAR GRNINAGPYD NFIQTDAAIN KGNSGGPLFN MRGEVIGINT AIISPSGGSI GIGFAVPSEL AENVIKQLRD FGETRRGWLG VRIQPVPDDL AKSAGIKLGR GALVSSIIED GPVAKGPLKT GDVIISFGGK DIAESRDLVR TVAESPINQD IDVVVFRDGK RETLKVKLAQ LPDDKATEAK DSEQADPKAS DSEDTDEAAS GMVLGMSVEA LDDEKRAANS IAKSVEGLLI TDVQQGSAAD QKGLKTGEVI VEVAQEFVAT PEALAEKIDK LKSDGRRAIH LMVATPQGDL RFVAVPLE
|
| |