Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5759 |
Symbol | |
ID | 7381639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 789772 |
End bp | 791022 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643649311 |
Product | polygalacturonase |
Protein accession | YP_002547548 |
Protein GI | 222106757 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTGCCT CTGGCGGCGG CAGGGTTTCG CTCAAAGCTG GATCGCATCA GGTTGGCGGG CTGACGCTTC GCTCCGGTGT CGAGCTGCAT CTGGGGGAGG GAGCAGTCTT GCGGCCCGTG CCGGATTATC AGGCCTATGG CGACAATATC GTCTCGGTCA TTGCCGAAAA ATCCAACCGG GCCATGCTGG TGGCACGCGG TGCGGACACG ATCGCGCTGA CCGGGTCGGG CCGTATTGAT GCTGGCGGAG ACGCATTCAT TGCCGGTGAC GATGTCAGTG TCGGCGTATT CATTCCGGCG GAATTTCGTC CCCGTGTTCT GGTGCTGGAG GGGTGCCGGG GCGTTCGTCT CGACCATATT AAGGTCGAGA ACTCGCCAAT GTGGACGCTG CATTTCGTCA ATTGTGAGGA TGTCAGCCTC GCCAATGTGG CGATCCGCAA CAATTGCCGC CTGCCCAATA CCGATGGCAT CGTGCTGGAC GCCTGCCGCC GGGTGTTGAT TGAGGATTGC ACGATCTCCA CCGCCGATGA TGGTATCTGC CTGAAAACCA GCGTCGGGCC GGATGGAAAG GCGATTGGCA CCTGCGAGGA TGTGCTGGTG CGCCGCTGCA CCGTTTCCAG CCAGAGCTGC GCGCTGAAGC TGGGAACTGA AAGTTTCGGC GATTTCTCCC GCGTGGTGTT TGAGGATTGC CGGGTTGAAC AATCCAACCG GGGCCTGGGG ATTTTTTCCC GCGATGGCGG AAAGGTCTCC GATGTCAGGT TCTCCCGGAT CAGCCTGGAC TGCCGGGAGA CGCCGGACGG GTTCTGGGGT TCCGGCGAGG CGCTGACGGT CACGGTGGTG GACCGGGTGC CGGAGCGCCC GGCGGGTGCG GTGCGAGGAC TTGTTATTGA GGACATCACC GGGCGGATGG AAGGCACGGT CAACCTCGTC TCCATGGCCG GTGCCGGTAT TCACGATGTG GCGTTGCATC GGGTGCATCT AGCCCAGCAA ACCGGGGTAC TTGGCACCGC GTTGCGCTAT GATATGCGTC CGACCAATGC CGATCTGGCA CCTTCGCCCG AGGCTCATGG CCGGGCCAAT GCCTGGACGC GTGGGGCTGA CGGACGCATC GTGGGGCTCA TCGACTATCC GGGCGGAATG CCGGGCGTGT TTGCCCTTGG GGTGGACGGG CTGACTATAG ACGATCTCGT TATCGACAGG CCGCAGCCCC TGCCGGAGGG CTGGAACCCG CTGGATTTCG TCCAACAGTA A
|
Protein sequence | MSASGGGRVS LKAGSHQVGG LTLRSGVELH LGEGAVLRPV PDYQAYGDNI VSVIAEKSNR AMLVARGADT IALTGSGRID AGGDAFIAGD DVSVGVFIPA EFRPRVLVLE GCRGVRLDHI KVENSPMWTL HFVNCEDVSL ANVAIRNNCR LPNTDGIVLD ACRRVLIEDC TISTADDGIC LKTSVGPDGK AIGTCEDVLV RRCTVSSQSC ALKLGTESFG DFSRVVFEDC RVEQSNRGLG IFSRDGGKVS DVRFSRISLD CRETPDGFWG SGEALTVTVV DRVPERPAGA VRGLVIEDIT GRMEGTVNLV SMAGAGIHDV ALHRVHLAQQ TGVLGTALRY DMRPTNADLA PSPEAHGRAN AWTRGADGRI VGLIDYPGGM PGVFALGVDG LTIDDLVIDR PQPLPEGWNP LDFVQQ
|
| |