Gene Avi_5564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5564 
SymbolaglA 
ID7381472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp570563 
End bp573865 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content58% 
IMG OID643649144 
Productalpha-glucosidase 
Protein accessionYP_002547381 
Protein GI222106590 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.657284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGC AACATCAGCA GGCCGAGACC GATCCGCTTT GGTATAAGGA TGCAATTATC 
TATCAATTGC ATATCAAGTC CTTTTACGAT GGCAATGGCG ACGGGATCGG CGACTTCAAG
GGCTTGACGG AAAAGCTCGA CCATATCGCC TCGCTTGGCA TTACCGCCAT CTGGATCTTG
CCGTTCTTTC CCTCGCCGCG GCGCGATGAT GGCTATGACA TCGCCGATTA CGGCAATGTC
AGCCCGGACT ATGGCACGAT GGACGATTTC CGCGCCTTCG TGGATGCCGC TCATGCCCGT
GACATGCGCG TCATCATCGA ATTGGTGATC AACCATACAT CCGACCAGCA TCCCTGGTTT
GAGCGAGCCC GCAATGCGCC TGCCGGTTCG CCGGAACGGG ATTTCTATGT CTGGTCGGAG
ACCGATCAGA AGTTTCCCGA AACCCGGATC ATCTTTCTCG ATACGGAAAA GTCCAACTGG
ACCTGGGACC CGGTGGCTGG CGCCTACTAT TGGCACCGCT TCTATTCCCA CCAGCCGGAC
CTCAACTTCG ACAATCCGGC TGTGCTGGAC GAATTGATCA CGGTCATGCG CTTCTGGCTC
GATACCGGCA TTGATGCCTT CCGGCTCGAC GCCATTCCTT ACCTTGTCGA ACGCGAGGGG
ACCAATAACG AGAACCTGCC GGAAACCCAC GCGATCCTGA AGAAGATCCG CGCGGCAATG
GACGAAGGTC ATCCGGGCAA GATGCTGCTG GCCGAGGCCA ATCAATGGCC TGAAGATACC
CAGGAATATT TCGGTGACGG TGACGAATGC CATATGGCCT TTCACTTCCC GCTGATGCCG
CGCATGTATA TGGCGATTGC CAAGGAAGAT CGTTTTCCGA TCACCGACAT CATGCGCCAG
ACGCCTGAGA TCCCCGATAA TTGCCAATGG GCAATTTTCC TGCGCAATCA CGACGAACTG
ACGCTGGAAA TGGTCACCGA CGCCGAGCGG GACTATTTGT GGAATACCTA TGCCGCCGAC
CGCCGCGCCC GCATCAATCT CGGCATTCGC CGCCGGTTGG CGCCGCTGAT GGAGCGTGAC
CGCCGCCGCA TCGAGCTGAT GAACGCGCTG CTGCTGTCGA TGCGCGGCAC GCCGGTGATC
TATTACGGCG ACGAGATTGG CATGGGCGAT AATATCTATC TCGGTGACCG CGACGGGGTG
CGCACGCCGA TGCAATGGTC GCCTGATCGC AATGGCGGGT TTTCCCGCGC CGACCCGGCC
CGTCTGGTCC TGCCGCCGCT GATGGACCCG CTTTATGGGT ATGAGGCGGT CAATGTTGAA
GCACAGTCGG CTGATGCCCA TTCGCTGCTG AACTGGAGCC GCCATATGCT GGCGCTCCGG
CGTAAATTTA CTGCATTCGG ACGAGGCACA TTGCGGTTCC TGTCACCGGC CAACCGCAAG
ATCATCGCCT ATCTGCGGGA ATATGAAGGC GAAGTGCTGC TGTGCGTCGC CAATTTGTCG
CGCCTGCCGC AGGCGGTTGA GCTGGATCTC GCTGAATTCG AAAAGCGCAT TCCCATCGAA
CTAACAGGCA TGTCGGCATT CCCCCCCATC GGCCAGCTGA CTTATCTGCT GACCCTGCCA
CCCTATGGTT TCTTCTGGTT CAAGCTGTCG GGCGAGACGG ATGCGGATGG GCCAACCTGG
CGCACCGAGC CACCGGAGCA ATTGCCGGAT TTCGTTACCA TCGTCATGCG GCGCGAGCTT
GCCCAGTTGC TGGACGAGCC TGGTATCCAA GAGACGATTT CGCGTGAAAT CCTGCCTGCC
TATCTTGCCA AGCGCCGTTG GTTTGCCTCC AAGGGCGAAA GGGTGAAGCG CGCCTCGCTG
ATCTCGACCA TTCCGATGCC CTTCGGCAAT GACCTGCTGC TGGGTGAGTT GGAAACCGAG
TTGGAAGATC GCGTTGAGCG GTATTTTCTA CCGTTCGCCA TCGCCTGGGA CGATGAGAAT
CCCCATGCGC TTGCCCAGCA ACTGGCCTTC GCAAGAGTGC GGAAAGGACG CCGGGTTGGC
TTCCTGACGG ATGGCTTTGC CATGCAAAGC ATGGCACGCG GCGTCATCAG GGCTTTGCGC
GAACGCTCCA GCATTTCCAG TAAAAGCGGC TCCATCGATT TCATCGGCAC CGAACAACTC
GATGGCATTG AGCTGTCCGA CGACATGCAA GTCAAATGGC TCTCGGCTGA ACAATCCAAT
AGTTCGTTGA TCATCGGCGA CATGGCGATG ATCAAGCTCA TCCGTCATAT CCATCAGGGC
ATCCACCCGG AAGTGGAAAT GACCCGCCAC CTGACGCGGC TTGGCTACGC CAACACTGCC
CCGCTGCTGG GAGAGGTCGT CAGGGTTTCA CCTGAAGATG AGCGCTCGAC ATTGATCATC
ATCCAGGGCG CCATCCGCAA TCAGGGCGAT GCCTGGACCT GGATGCTCGA CACGCTCAGG
CAAACGCTCG AACAAAGCAT GGTTGCCGGT CAGGATGATG ACGCCGAGCA AGAGCGGTTC
AACCCGCTGT TCAATCAAGC AGCCGTGATC GGCAAGCGGC TTGGCGAGTT GCATGTGGCA
CTGGCCAAGC CAACAGACGA CCCGGCTTTC GCGCCGGTCG CAATGAGTGA TGAGGATGTC
TCCGTTTGGG CGCGATCAGT CATGGCCCGC GTTGCCGACT GCCTGGACAG AGTGTCTGAG
ATTGGTCATG GCAGCGACAG TGATGGTCTC GATACTGAAA CCATCAGGGT CAGCACCATG
TTGGCGGACC GTCGGGAGCA GATCTTGAGC GCGGTCGGTA CGCTGGCCAG TGTTGCCAAG
GACACGCTGA TGATCCGCAA CCACGGCGAT TTCCATCTGG GCCAAATCCT GGTTGCCGAA
GACGATGTTT ATATCATCGA TTTCGAAGGC GAACCGGCCC GCCCCCTCGC CGAACGGCGG
GCAAAGACCA ATCCGCTGCG CGACGTGGCC GGGCTGATCC GGTCACTGAG CTATCTCGCT
GCTTCCGCCG ATACGGGCCG CGAGATCGTC GTCCAGGGCG ACAGTGCAGC AAGAGCCGCG
CTGATCGAGG CGTTTCGCAA GAAGGCCGAG ACGCATTTCC TCGACAGCTA TTTCACGGCA
GTCGACGGAG AACCATCACT GGCAATGGAC CCTGAAAAAC GCCGCGAGAT CCTCGACCTG
TTCCTGCTGG AAAAGGCCGC CTATGAAATC GGCTATGAAG CCAGCAATCG GCCCGGCTGG
ATCGCTATTC CGCTTGCCGG GTTTGCCGAC ATTGCCGAAC GGCTGACGGA GAATTCCCTA
TGA
 
Protein sequence
MEQQHQQAET DPLWYKDAII YQLHIKSFYD GNGDGIGDFK GLTEKLDHIA SLGITAIWIL 
PFFPSPRRDD GYDIADYGNV SPDYGTMDDF RAFVDAAHAR DMRVIIELVI NHTSDQHPWF
ERARNAPAGS PERDFYVWSE TDQKFPETRI IFLDTEKSNW TWDPVAGAYY WHRFYSHQPD
LNFDNPAVLD ELITVMRFWL DTGIDAFRLD AIPYLVEREG TNNENLPETH AILKKIRAAM
DEGHPGKMLL AEANQWPEDT QEYFGDGDEC HMAFHFPLMP RMYMAIAKED RFPITDIMRQ
TPEIPDNCQW AIFLRNHDEL TLEMVTDAER DYLWNTYAAD RRARINLGIR RRLAPLMERD
RRRIELMNAL LLSMRGTPVI YYGDEIGMGD NIYLGDRDGV RTPMQWSPDR NGGFSRADPA
RLVLPPLMDP LYGYEAVNVE AQSADAHSLL NWSRHMLALR RKFTAFGRGT LRFLSPANRK
IIAYLREYEG EVLLCVANLS RLPQAVELDL AEFEKRIPIE LTGMSAFPPI GQLTYLLTLP
PYGFFWFKLS GETDADGPTW RTEPPEQLPD FVTIVMRREL AQLLDEPGIQ ETISREILPA
YLAKRRWFAS KGERVKRASL ISTIPMPFGN DLLLGELETE LEDRVERYFL PFAIAWDDEN
PHALAQQLAF ARVRKGRRVG FLTDGFAMQS MARGVIRALR ERSSISSKSG SIDFIGTEQL
DGIELSDDMQ VKWLSAEQSN SSLIIGDMAM IKLIRHIHQG IHPEVEMTRH LTRLGYANTA
PLLGEVVRVS PEDERSTLII IQGAIRNQGD AWTWMLDTLR QTLEQSMVAG QDDDAEQERF
NPLFNQAAVI GKRLGELHVA LAKPTDDPAF APVAMSDEDV SVWARSVMAR VADCLDRVSE
IGHGSDSDGL DTETIRVSTM LADRREQILS AVGTLASVAK DTLMIRNHGD FHLGQILVAE
DDVYIIDFEG EPARPLAERR AKTNPLRDVA GLIRSLSYLA ASADTGREIV VQGDSAARAA
LIEAFRKKAE THFLDSYFTA VDGEPSLAMD PEKRREILDL FLLEKAAYEI GYEASNRPGW
IAIPLAGFAD IAERLTENSL