Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5564 |
Symbol | aglA |
ID | 7381472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 570563 |
End bp | 573865 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643649144 |
Product | alpha-glucosidase |
Protein accession | YP_002547381 |
Protein GI | 222106590 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.657284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAGC AACATCAGCA GGCCGAGACC GATCCGCTTT GGTATAAGGA TGCAATTATC TATCAATTGC ATATCAAGTC CTTTTACGAT GGCAATGGCG ACGGGATCGG CGACTTCAAG GGCTTGACGG AAAAGCTCGA CCATATCGCC TCGCTTGGCA TTACCGCCAT CTGGATCTTG CCGTTCTTTC CCTCGCCGCG GCGCGATGAT GGCTATGACA TCGCCGATTA CGGCAATGTC AGCCCGGACT ATGGCACGAT GGACGATTTC CGCGCCTTCG TGGATGCCGC TCATGCCCGT GACATGCGCG TCATCATCGA ATTGGTGATC AACCATACAT CCGACCAGCA TCCCTGGTTT GAGCGAGCCC GCAATGCGCC TGCCGGTTCG CCGGAACGGG ATTTCTATGT CTGGTCGGAG ACCGATCAGA AGTTTCCCGA AACCCGGATC ATCTTTCTCG ATACGGAAAA GTCCAACTGG ACCTGGGACC CGGTGGCTGG CGCCTACTAT TGGCACCGCT TCTATTCCCA CCAGCCGGAC CTCAACTTCG ACAATCCGGC TGTGCTGGAC GAATTGATCA CGGTCATGCG CTTCTGGCTC GATACCGGCA TTGATGCCTT CCGGCTCGAC GCCATTCCTT ACCTTGTCGA ACGCGAGGGG ACCAATAACG AGAACCTGCC GGAAACCCAC GCGATCCTGA AGAAGATCCG CGCGGCAATG GACGAAGGTC ATCCGGGCAA GATGCTGCTG GCCGAGGCCA ATCAATGGCC TGAAGATACC CAGGAATATT TCGGTGACGG TGACGAATGC CATATGGCCT TTCACTTCCC GCTGATGCCG CGCATGTATA TGGCGATTGC CAAGGAAGAT CGTTTTCCGA TCACCGACAT CATGCGCCAG ACGCCTGAGA TCCCCGATAA TTGCCAATGG GCAATTTTCC TGCGCAATCA CGACGAACTG ACGCTGGAAA TGGTCACCGA CGCCGAGCGG GACTATTTGT GGAATACCTA TGCCGCCGAC CGCCGCGCCC GCATCAATCT CGGCATTCGC CGCCGGTTGG CGCCGCTGAT GGAGCGTGAC CGCCGCCGCA TCGAGCTGAT GAACGCGCTG CTGCTGTCGA TGCGCGGCAC GCCGGTGATC TATTACGGCG ACGAGATTGG CATGGGCGAT AATATCTATC TCGGTGACCG CGACGGGGTG CGCACGCCGA TGCAATGGTC GCCTGATCGC AATGGCGGGT TTTCCCGCGC CGACCCGGCC CGTCTGGTCC TGCCGCCGCT GATGGACCCG CTTTATGGGT ATGAGGCGGT CAATGTTGAA GCACAGTCGG CTGATGCCCA TTCGCTGCTG AACTGGAGCC GCCATATGCT GGCGCTCCGG CGTAAATTTA CTGCATTCGG ACGAGGCACA TTGCGGTTCC TGTCACCGGC CAACCGCAAG ATCATCGCCT ATCTGCGGGA ATATGAAGGC GAAGTGCTGC TGTGCGTCGC CAATTTGTCG CGCCTGCCGC AGGCGGTTGA GCTGGATCTC GCTGAATTCG AAAAGCGCAT TCCCATCGAA CTAACAGGCA TGTCGGCATT CCCCCCCATC GGCCAGCTGA CTTATCTGCT GACCCTGCCA CCCTATGGTT TCTTCTGGTT CAAGCTGTCG GGCGAGACGG ATGCGGATGG GCCAACCTGG CGCACCGAGC CACCGGAGCA ATTGCCGGAT TTCGTTACCA TCGTCATGCG GCGCGAGCTT GCCCAGTTGC TGGACGAGCC TGGTATCCAA GAGACGATTT CGCGTGAAAT CCTGCCTGCC TATCTTGCCA AGCGCCGTTG GTTTGCCTCC AAGGGCGAAA GGGTGAAGCG CGCCTCGCTG ATCTCGACCA TTCCGATGCC CTTCGGCAAT GACCTGCTGC TGGGTGAGTT GGAAACCGAG TTGGAAGATC GCGTTGAGCG GTATTTTCTA CCGTTCGCCA TCGCCTGGGA CGATGAGAAT CCCCATGCGC TTGCCCAGCA ACTGGCCTTC GCAAGAGTGC GGAAAGGACG CCGGGTTGGC TTCCTGACGG ATGGCTTTGC CATGCAAAGC ATGGCACGCG GCGTCATCAG GGCTTTGCGC GAACGCTCCA GCATTTCCAG TAAAAGCGGC TCCATCGATT TCATCGGCAC CGAACAACTC GATGGCATTG AGCTGTCCGA CGACATGCAA GTCAAATGGC TCTCGGCTGA ACAATCCAAT AGTTCGTTGA TCATCGGCGA CATGGCGATG ATCAAGCTCA TCCGTCATAT CCATCAGGGC ATCCACCCGG AAGTGGAAAT GACCCGCCAC CTGACGCGGC TTGGCTACGC CAACACTGCC CCGCTGCTGG GAGAGGTCGT CAGGGTTTCA CCTGAAGATG AGCGCTCGAC ATTGATCATC ATCCAGGGCG CCATCCGCAA TCAGGGCGAT GCCTGGACCT GGATGCTCGA CACGCTCAGG CAAACGCTCG AACAAAGCAT GGTTGCCGGT CAGGATGATG ACGCCGAGCA AGAGCGGTTC AACCCGCTGT TCAATCAAGC AGCCGTGATC GGCAAGCGGC TTGGCGAGTT GCATGTGGCA CTGGCCAAGC CAACAGACGA CCCGGCTTTC GCGCCGGTCG CAATGAGTGA TGAGGATGTC TCCGTTTGGG CGCGATCAGT CATGGCCCGC GTTGCCGACT GCCTGGACAG AGTGTCTGAG ATTGGTCATG GCAGCGACAG TGATGGTCTC GATACTGAAA CCATCAGGGT CAGCACCATG TTGGCGGACC GTCGGGAGCA GATCTTGAGC GCGGTCGGTA CGCTGGCCAG TGTTGCCAAG GACACGCTGA TGATCCGCAA CCACGGCGAT TTCCATCTGG GCCAAATCCT GGTTGCCGAA GACGATGTTT ATATCATCGA TTTCGAAGGC GAACCGGCCC GCCCCCTCGC CGAACGGCGG GCAAAGACCA ATCCGCTGCG CGACGTGGCC GGGCTGATCC GGTCACTGAG CTATCTCGCT GCTTCCGCCG ATACGGGCCG CGAGATCGTC GTCCAGGGCG ACAGTGCAGC AAGAGCCGCG CTGATCGAGG CGTTTCGCAA GAAGGCCGAG ACGCATTTCC TCGACAGCTA TTTCACGGCA GTCGACGGAG AACCATCACT GGCAATGGAC CCTGAAAAAC GCCGCGAGAT CCTCGACCTG TTCCTGCTGG AAAAGGCCGC CTATGAAATC GGCTATGAAG CCAGCAATCG GCCCGGCTGG ATCGCTATTC CGCTTGCCGG GTTTGCCGAC ATTGCCGAAC GGCTGACGGA GAATTCCCTA TGA
|
Protein sequence | MEQQHQQAET DPLWYKDAII YQLHIKSFYD GNGDGIGDFK GLTEKLDHIA SLGITAIWIL PFFPSPRRDD GYDIADYGNV SPDYGTMDDF RAFVDAAHAR DMRVIIELVI NHTSDQHPWF ERARNAPAGS PERDFYVWSE TDQKFPETRI IFLDTEKSNW TWDPVAGAYY WHRFYSHQPD LNFDNPAVLD ELITVMRFWL DTGIDAFRLD AIPYLVEREG TNNENLPETH AILKKIRAAM DEGHPGKMLL AEANQWPEDT QEYFGDGDEC HMAFHFPLMP RMYMAIAKED RFPITDIMRQ TPEIPDNCQW AIFLRNHDEL TLEMVTDAER DYLWNTYAAD RRARINLGIR RRLAPLMERD RRRIELMNAL LLSMRGTPVI YYGDEIGMGD NIYLGDRDGV RTPMQWSPDR NGGFSRADPA RLVLPPLMDP LYGYEAVNVE AQSADAHSLL NWSRHMLALR RKFTAFGRGT LRFLSPANRK IIAYLREYEG EVLLCVANLS RLPQAVELDL AEFEKRIPIE LTGMSAFPPI GQLTYLLTLP PYGFFWFKLS GETDADGPTW RTEPPEQLPD FVTIVMRREL AQLLDEPGIQ ETISREILPA YLAKRRWFAS KGERVKRASL ISTIPMPFGN DLLLGELETE LEDRVERYFL PFAIAWDDEN PHALAQQLAF ARVRKGRRVG FLTDGFAMQS MARGVIRALR ERSSISSKSG SIDFIGTEQL DGIELSDDMQ VKWLSAEQSN SSLIIGDMAM IKLIRHIHQG IHPEVEMTRH LTRLGYANTA PLLGEVVRVS PEDERSTLII IQGAIRNQGD AWTWMLDTLR QTLEQSMVAG QDDDAEQERF NPLFNQAAVI GKRLGELHVA LAKPTDDPAF APVAMSDEDV SVWARSVMAR VADCLDRVSE IGHGSDSDGL DTETIRVSTM LADRREQILS AVGTLASVAK DTLMIRNHGD FHLGQILVAE DDVYIIDFEG EPARPLAERR AKTNPLRDVA GLIRSLSYLA ASADTGREIV VQGDSAARAA LIEAFRKKAE THFLDSYFTA VDGEPSLAMD PEKRREILDL FLLEKAAYEI GYEASNRPGW IAIPLAGFAD IAERLTENSL
|
| |