Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5411 |
Symbol | |
ID | 7381511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 409732 |
End bp | 412758 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643649019 |
Product | beta-galactosidase |
Protein accession | YP_002547256 |
Protein GI | 222106465 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.111624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGATCC AAACTTTGCC TGCTAAAGCA CATCTCGATT GGCTTTCCGA TCCCAAGGTC TTTGCCGTCG GGCGGCTGCC TGCCCATTCC GACCATATCG TCTATCCCGA TGAAGCCTCC GCCAAGGCGG GTTCTGCCTC GCCGCTTCGC CAGAGCCTTG ATGGCACCTG GAAGTTCCTC TCCGCCATAT CAGCCGCTGA GCGCCCGGAA GGTTTCCACC ATCACGATTT CGACCGCAGC GGCTTCGGCG ACATCACCGT ACCCGGTGCC ATGCAATTGC AAGGCCATGG CCGCCCGCAA TATGTCAACA CCCAATATCC CTGGGACGGC CATGAGGCGC TGGCGCTTGG GCAAGCCGCC GAGGCCAACC GAGTCGGCTG CTATGCGAGG ACTTTCCAGC TTGACCCCGC CCTTGCCGGA CAGCGGATCA TCCTCACTTT CGACGGTGTC GAGACCGCCT TCTATGTCTG GCTGAATGGC CGTTTTATCG GCTATGCCGA AGACAGTTTC ACGCCCTCAC GCTTCGATAT TACCGACGCG CTGATTGACG GCGATAACCT GCTGGCCGTT GAGGTCTATC ATCGGTCTTC CGGTGCCTGG CTTGAGGATC AGGATTTCTG GCGTCTGTCG GGCATCATGC GGCCTGTCCG GTTGGAGGCC TGGCGGAGCC TGCATATCCG CGATCTATTC GTCACCACCG ATCTTGCCGA TGATTTCAAC AGTGGCTCCC TGCGCTTGCG GCTGGCCCTC GACTTGGCGG AAGAGAGCCC TGGCAGTCTT GCCGTGACGC TCTACGATCC CAAGGGTGAA ACCGTCCTAA CAACGTCCTT CGATGCCGCA GCCGAAATGG ATATTGCCCT GCCCGTCACC GCCCCGCTGC TGTGGAGCGC TGAGGCGCCG CATCTCTATC GCCTGCTGCT GGTCGTAAAG GACAGCAAGG GCGCGACGAT CGAGGCCGTG CCGCAGCCGG TCGGCTTCCG TCGGTTTGAA ATGCGCGATG GGGTGATGCG TCTCAATGGC CAGCGCATCG TTTTCCGTGG CATCAACAGG CATGATTTCC ATCCACGCCG GGGACGCGCC CTGACAGTTG AGGACATGCT CTGGGATGTG CTGTTCTTCA AGCGCAACAA TATCAATGCG GTGCGCACCA GCCATTACCC AAACCGCAGC GAGTTCTACG CGCTCTGCGA CCAGCACGGC CTCTATGTCA TCGACGAGGC CAATCTGGAA ACTCACGGCA CATGGTCGGT CGAGACGCTG GACCCGGACA AAGTCCTGCC CGGCGACCGC GACGAATGGC GACCCGCCGT GCTCGACCGT GCCGCCAATA TGCTGGAACG CGACAAGAAC CATCCTTGCG TGCTGATCTG GTCTTGCGGC AATGAGAGCT ATGGCGGCAG CGTGATTGCC GATATGGCCG ATTGGTTTAG GGACCGCGAT CCCTCGCGCC TCGTCCATTA CGAAGGCGTG TTCCACGACC GCCGCTTTGA TGCGCGCTCC AGCGACATGG AAAGCCGGAT GTATGCGCGC CCGCAGGACA TCGAGGATTT TCTGCGCAGC AATCCATCCA AGCCCTTCGT GTCCTGCGAA TATACCCATG CGATGGGCAA TTCCTGCGGC GGCATGCATC TCTATACCGA CCTCACCTAT CGTTATGATC AGTGCCAGGG CGGCTTTATC TGGGAATATA TCGAACAGGC TCTCTACGGC ACCCGCCCGG ATGGCAGCGA AGGCCTGCTA TTTGGCGGCG ATTTTGGCGA CCGGCCAACC GATTACAGCT TCTGCTGCGA CGGCATTATC ACCGCTGACC GGCAATTGAC CCCGAAAGTC CAGGAAATCA AGGCGCTTTA CCAGCCTGTC CGGCTGATCC CGGACGCGCA CGGCGTCAAA GTCATCAATG ACAACCTGTT CTTGAACCTC GACGCTTTCT ATCTGTCCTA TCACCTGCTG AAGGATGGTG TCGCGGTGGA CGAAGGGCGA GCCGATATTG CGCTTTCAGC CCAGGAACAA ACCTATCTGC GGCTGTCCAT ACCCGTCACG CAGCAGCCGG GCGAATATGC CCTGCAATGC TCCCTGCGTG AACGCCATGA CCGCGCATGG GCACCGGCTG ACCACGAAGT GGCGTTTGGC GAACACGTCT GGACGGTGGC AGGTGAAAAA CCGAACCCGG TTTCCCTCCC CCTCACCCGC GCCGAAGGCA GCTATAATCT CGGCATTTCC GATGGCCACA GCCGCACCTT GTTCTGCCGT CGCTTTGGCG GGCCGGTGTC GCTGGTCAAT GCGGCGGGCG TGGAATTCCT GGAACGGCCA CCCCTGCCGA TCTTCTGGCG CGCGCCGACC GACAATGACC GGGGCGCTGG CTTCGGCTTC AAATTCGGCA GCTGGCGGCG GGCAAGCCTC GACCAGAAAC AAGCTTCCTA TGCCTATCGT GACGGCGGGG CCGATTATCG CTTTGCCCTG CCTGATAGCA GCATGCAGGC CACGGTGTCC TACCGCTATG AGGAGGACGG GGCGATTGCC GTCACCGCCC ATTGGCCGGG CGATGCCAGC CTGCCGTCGC TACCGCTGTT CGGCCTGACG ATTCCCATGC CAGCAACCTT TAACCGCTTC CGTTATTATG GACTTGGACC GGAGGAAAAT CACATCGACC GCGCCCATGG CGCACGGCTT GGCATTTACC AGCGCGCGGT GGCCGACAAT GTCACGCCCT ATGTCATCCC GCAGGAATCC GGCAACCGAA CCGGCGTGCG CTGGGCAGAA CTGTTCGACG CTGAGGGCAA CACACTGCGC TTCGAGGCGG TGGATATGCC GTTCGAGCTG GGCGTTTCGC CTTATACGGC TTTCCAGCTG GAAGCCGCCG CCCGGCCCTA TGACCTGCCG CCAGCCAACC GTACCATTGT CACGCTGATG GCCCGGCAGA TGGGCGTCGG TGGCGACGAT AGCTGGGGCG CGCTGCCGCA TCCTCAATAC ATGATCGAGC CGGGTGAGCC TCTGACGCTC ACCTTCAGGA TGAGGGTGAT GGGGTGA
|
Protein sequence | MTIQTLPAKA HLDWLSDPKV FAVGRLPAHS DHIVYPDEAS AKAGSASPLR QSLDGTWKFL SAISAAERPE GFHHHDFDRS GFGDITVPGA MQLQGHGRPQ YVNTQYPWDG HEALALGQAA EANRVGCYAR TFQLDPALAG QRIILTFDGV ETAFYVWLNG RFIGYAEDSF TPSRFDITDA LIDGDNLLAV EVYHRSSGAW LEDQDFWRLS GIMRPVRLEA WRSLHIRDLF VTTDLADDFN SGSLRLRLAL DLAEESPGSL AVTLYDPKGE TVLTTSFDAA AEMDIALPVT APLLWSAEAP HLYRLLLVVK DSKGATIEAV PQPVGFRRFE MRDGVMRLNG QRIVFRGINR HDFHPRRGRA LTVEDMLWDV LFFKRNNINA VRTSHYPNRS EFYALCDQHG LYVIDEANLE THGTWSVETL DPDKVLPGDR DEWRPAVLDR AANMLERDKN HPCVLIWSCG NESYGGSVIA DMADWFRDRD PSRLVHYEGV FHDRRFDARS SDMESRMYAR PQDIEDFLRS NPSKPFVSCE YTHAMGNSCG GMHLYTDLTY RYDQCQGGFI WEYIEQALYG TRPDGSEGLL FGGDFGDRPT DYSFCCDGII TADRQLTPKV QEIKALYQPV RLIPDAHGVK VINDNLFLNL DAFYLSYHLL KDGVAVDEGR ADIALSAQEQ TYLRLSIPVT QQPGEYALQC SLRERHDRAW APADHEVAFG EHVWTVAGEK PNPVSLPLTR AEGSYNLGIS DGHSRTLFCR RFGGPVSLVN AAGVEFLERP PLPIFWRAPT DNDRGAGFGF KFGSWRRASL DQKQASYAYR DGGADYRFAL PDSSMQATVS YRYEEDGAIA VTAHWPGDAS LPSLPLFGLT IPMPATFNRF RYYGLGPEEN HIDRAHGARL GIYQRAVADN VTPYVIPQES GNRTGVRWAE LFDAEGNTLR FEAVDMPFEL GVSPYTAFQL EAAARPYDLP PANRTIVTLM ARQMGVGGDD SWGALPHPQY MIEPGEPLTL TFRMRVMG
|
| |