Gene Avi_5759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5759 
Symbol 
ID7381639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp789772 
End bp791022 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content62% 
IMG OID643649311 
Productpolygalacturonase 
Protein accessionYP_002547548 
Protein GI222106757 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTGCCT CTGGCGGCGG CAGGGTTTCG CTCAAAGCTG GATCGCATCA GGTTGGCGGG 
CTGACGCTTC GCTCCGGTGT CGAGCTGCAT CTGGGGGAGG GAGCAGTCTT GCGGCCCGTG
CCGGATTATC AGGCCTATGG CGACAATATC GTCTCGGTCA TTGCCGAAAA ATCCAACCGG
GCCATGCTGG TGGCACGCGG TGCGGACACG ATCGCGCTGA CCGGGTCGGG CCGTATTGAT
GCTGGCGGAG ACGCATTCAT TGCCGGTGAC GATGTCAGTG TCGGCGTATT CATTCCGGCG
GAATTTCGTC CCCGTGTTCT GGTGCTGGAG GGGTGCCGGG GCGTTCGTCT CGACCATATT
AAGGTCGAGA ACTCGCCAAT GTGGACGCTG CATTTCGTCA ATTGTGAGGA TGTCAGCCTC
GCCAATGTGG CGATCCGCAA CAATTGCCGC CTGCCCAATA CCGATGGCAT CGTGCTGGAC
GCCTGCCGCC GGGTGTTGAT TGAGGATTGC ACGATCTCCA CCGCCGATGA TGGTATCTGC
CTGAAAACCA GCGTCGGGCC GGATGGAAAG GCGATTGGCA CCTGCGAGGA TGTGCTGGTG
CGCCGCTGCA CCGTTTCCAG CCAGAGCTGC GCGCTGAAGC TGGGAACTGA AAGTTTCGGC
GATTTCTCCC GCGTGGTGTT TGAGGATTGC CGGGTTGAAC AATCCAACCG GGGCCTGGGG
ATTTTTTCCC GCGATGGCGG AAAGGTCTCC GATGTCAGGT TCTCCCGGAT CAGCCTGGAC
TGCCGGGAGA CGCCGGACGG GTTCTGGGGT TCCGGCGAGG CGCTGACGGT CACGGTGGTG
GACCGGGTGC CGGAGCGCCC GGCGGGTGCG GTGCGAGGAC TTGTTATTGA GGACATCACC
GGGCGGATGG AAGGCACGGT CAACCTCGTC TCCATGGCCG GTGCCGGTAT TCACGATGTG
GCGTTGCATC GGGTGCATCT AGCCCAGCAA ACCGGGGTAC TTGGCACCGC GTTGCGCTAT
GATATGCGTC CGACCAATGC CGATCTGGCA CCTTCGCCCG AGGCTCATGG CCGGGCCAAT
GCCTGGACGC GTGGGGCTGA CGGACGCATC GTGGGGCTCA TCGACTATCC GGGCGGAATG
CCGGGCGTGT TTGCCCTTGG GGTGGACGGG CTGACTATAG ACGATCTCGT TATCGACAGG
CCGCAGCCCC TGCCGGAGGG CTGGAACCCG CTGGATTTCG TCCAACAGTA A
 
Protein sequence
MSASGGGRVS LKAGSHQVGG LTLRSGVELH LGEGAVLRPV PDYQAYGDNI VSVIAEKSNR 
AMLVARGADT IALTGSGRID AGGDAFIAGD DVSVGVFIPA EFRPRVLVLE GCRGVRLDHI
KVENSPMWTL HFVNCEDVSL ANVAIRNNCR LPNTDGIVLD ACRRVLIEDC TISTADDGIC
LKTSVGPDGK AIGTCEDVLV RRCTVSSQSC ALKLGTESFG DFSRVVFEDC RVEQSNRGLG
IFSRDGGKVS DVRFSRISLD CRETPDGFWG SGEALTVTVV DRVPERPAGA VRGLVIEDIT
GRMEGTVNLV SMAGAGIHDV ALHRVHLAQQ TGVLGTALRY DMRPTNADLA PSPEAHGRAN
AWTRGADGRI VGLIDYPGGM PGVFALGVDG LTIDDLVIDR PQPLPEGWNP LDFVQQ