Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_1368 |
Symbol | dop |
ID | 7389105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 1143352 |
End bp | 1144827 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643650776 |
Product | serine protease DO-like protease |
Protein accession | YP_002548982 |
Protein GI | 222148025 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.200414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGCGG GTGCCGTGAC GGTAACGCCT GCGCTGGCCG CGCCCGTCGA GGTTCAGGCG CCGCAGGTGC CAAGCTTTGC CGATCTGGTC AGTGCCGTTT CGCCCGCCGT CGTCTCCATC CGCGTCAAAT CGGATGTGCA GCAGGCCTCC GAGGATGGCA GCAATTTCTC CTTCAATGGT CGTGACTTCG ACCAGCTTCC CGATCCTCTG AAGCGCTTCT TCAAGGAATG GGGCATGCCA GGCCCCGGCG GTCCAGGTGG TCCAGGAGGC CCCAAGGGTG GACCGCATGC CGAGCGTCAT GGCAAGCTGC GTCCGATTGC CCAAGGGTCG GGCTTCTTCA TCTCCGAGGA CGGCTATGTT GTGACCAACA ATCACGTAGT TTCCGATGGC CAGGCCTATA CAGTGGTGAT GAATGACGGC ACCGAATACG ATGCCAAGCT GGTCGGTAAG GACCCGCGCA CTGACCTCGC CGTTTTGAAG GTCGATCAGC CGACCAAGAA ATTCACCTAT GTCGAATGGG CGCAGGACGA GAAGATCCGC GTTGGTGACT GGGTCGTGGC CGTCGGCAAT CCTTTCGGTC TCGGCGGAAC CGTGACGTCG GGTATCGTTT CGGCTTTTGG CCGTGATATC GGCTCCGGCC CTTATGACGA TTACATCCAG ATCGATGCAC CGGTAAACCG GGGCAATTCG GGTGGGCCGG ACTTCAACCT CAGCGGCAAG GTGGTCGGGA TCAACACGGC GATCTTCTCG CCATCGGGCG GTAGCGTCGG CATCGCCTTC GCTATTCCGG CGGCGACTGC CAAGGATGTC GTTGCTGAAT TGATCAAGCA TGGCTCGGTG CAGCGCGGCT GGCTTGGCGT GCAGATCCAG CCTGTCACCA AGGATATTGC CGAATCGCTC GGTCTGGCCG ATGCCAAGGG CGCACTGGTG GCTGAGCCGC AAACCGGTTC TCCCGGTGAA AAGGCCGGTA TCAAGCAGGG CGACGTGATT ACCGCCGTGA ATGGCGATCC GGTCAAGGAC CCGCGTGACC TCGCCAAGCG CATTGCTGCC TTCCCGCCCA ATACCAAGGT CGATATTTCT ATCTGGCGCA ATGGCAAGCC GACTGCCGTC AAGGTCGATC TCGGCACCTT GCCTGCTGAA AAGGATACGG CCAGCAGTGA TGAGGATCAG GGCGCGCCCG AGCAGAACGC ACCGGCCACC GAGCAGGCGC TTGCCAATCT CGGAGTCACT GTCCAGCGTG CCGATGACGG CAAAGGCCTG ACGATCACCA ATGTCGATCC GGATTCCGAC GCTGCCGACA AGGGGCTGAA GACCGGCCAG AAGATCACGT CCGTTAACAA CCAGCAGGTC TCCAGCGCCG CCGAGGTCAA GAAGATCCTT GATCAGGCCA AGAAGGACGG TCGCACCAAG GCGCTCTTCC AGGTGGAAAC CGACAATGGC AGCCGCTTCA TCGCCCTGCC GATCAACCAG GGCTGA
|
Protein sequence | MLAGAVTVTP ALAAPVEVQA PQVPSFADLV SAVSPAVVSI RVKSDVQQAS EDGSNFSFNG RDFDQLPDPL KRFFKEWGMP GPGGPGGPGG PKGGPHAERH GKLRPIAQGS GFFISEDGYV VTNNHVVSDG QAYTVVMNDG TEYDAKLVGK DPRTDLAVLK VDQPTKKFTY VEWAQDEKIR VGDWVVAVGN PFGLGGTVTS GIVSAFGRDI GSGPYDDYIQ IDAPVNRGNS GGPDFNLSGK VVGINTAIFS PSGGSVGIAF AIPAATAKDV VAELIKHGSV QRGWLGVQIQ PVTKDIAESL GLADAKGALV AEPQTGSPGE KAGIKQGDVI TAVNGDPVKD PRDLAKRIAA FPPNTKVDIS IWRNGKPTAV KVDLGTLPAE KDTASSDEDQ GAPEQNAPAT EQALANLGVT VQRADDGKGL TITNVDPDSD AADKGLKTGQ KITSVNNQQV SSAAEVKKIL DQAKKDGRTK ALFQVETDNG SRFIALPINQ G
|
| |