Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5742 |
Symbol | |
ID | 7381623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 768427 |
End bp | 770355 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643649295 |
Product | beta-galactosidase |
Protein accession | YP_002547532 |
Protein GI | 222106741 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.585873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGGCG TTTGCTATTA TCCCGAACAT TGGCCGGAGA GCCAATGGCA TATCGATGCG CAAAACATGC GCAAGCTTGG CATTTCCTAT GTGCGCATCG GCGAATTCGC CTGGTCGCGG CTGGAGGCGA GGCGGGGTCA GTTCACCTTC GAATGGCTGG ACCGGGCCAT CGATATTCTG CATGAGGCGG GCCTGAAGGT GGTGCTTGGC ACCCCGACCG CCACACCGCC GAAATGGCTG ATGGACGAGC ATCCCGACAT TGCGCCCTAT GATGCCGACG GGAATGTGCG CGGCTTCGGC TCGCGTCGCC ATTATAGTTT TTCGAGCGAA ATCTGGTGGG CCGAAAGCGC TCGAATCGTT GAAGTCATCG CCAAGCGCTA CGGCACCCAT CCGGGCATTG CTGGCTGGCA GACCGATAAC GAATACGGCT GCCATGACAC CACCGTATCT TGGGGGCCTG AAGACCTGAA GGCCTTCCGC CGCTGGTTGC GGATGCGGTA TCAGACCACC GACCAGCTCA ATGAGGCCTG GGGCTCGGTG TTCTGGTCGA TGGAGGTCAA CAGTTTTGAT GAGGTCGAGC TACCCACCCG CACCGTGACC GAGCCGAACC CGGCGCAGCG GCTGGATTTC TGGCGCTTCT CCTCCGATCA GGTGGCCGCC TATGATAAAA TGCAGGTGGA CATCATCCGC AAGCATTCAC CGGGCCGCTG GATTACCCAT AACTTCATGG GTTTCATCAA TGATTTCGAT CATTTCCAGG TGGGCGACAA TCTCGATCTC GCCAGCTGGG ATAGCTATCC GATCGGCTTC GTCGAAAAAT TCCCGTTTAC CGAAGACGAG CGCAATCGTT GGGCGGAAAC CTCCCATCCC GACATCGCGC CCTTCCACCA TGATCTCTAC CGCGGTGTTG GCCGTGGCCG GTTCTGGGTG ATGGAGCAGC AGCCCGGTCC GGTCAATTGG GCACCGTGGA ACCCGGTTCC CAAGCCCGGC ATGGTCAGGC TGTGGACCTG GGAAGCGCTG GCGCATGGGG CAGAGGTGGT CAGCTATTTC CGCTGGCGCC AGGCACCGTT TGCGCAGGAG CAGATGCATG CAGGCCTCAA TCTGCCGGGG CTTGATGAAT GGTCGGTGGG CGGGTTGGAA GCGCATGCCG TTGCTGGCGA ACTGGCAGCG CTTGGCGATC TGCCAGAGAG CGTTCAGGCA CCCGTGGCGC TGGTCTATGA CTATGCCAGC TATTGGGCGA CAACCATCCA GCCGCAGGGC CGGGATTTCC GCTACGAGGA ATTGGCGTTT CGCTGGTACG AAGCAGCCCG CCGCCTGGGG CTGGACGTGG ATTTTGTGCG CCCCGGTGCT GATCTGTCCG GTTATAAACT GGTGCTCGTT CCCTGCCTGA TGCAGGTGAA CGAGGCGGCG CTTGCGGCCT TTAAAAACAC CGATGCCGTC GTGCTGTACG GTCCGCGCAC CGGTTCACGC GACGAGAAAT TCCGTATTCC GGAAAACCTG CCGCCCGGTC CGCTGGCTCC TTTGACCGAT ACGCGCCAAA CGCAAGTCGC CTCACTGCGG CCCCGTCTTG GTGATAGCGT TTCAGGCTCA GTTTCGGGCC GGGCCATCCG CTGGCGGGAA TATCTGGAAA CCACGGCAAC CGTCATCGCC ACATTCGAAA ACGGCGATCC GGCCCTGACG AGCCAGAACG ACCACCATTA TCTGGCCTGC TGGCCCGACG AGGCTTTGCT GTCTTCAACG CTGGCGGTGC TGGCGGAAAA GGCTGGTCTC GAAACCCTTG TTTTACCAGA GCATATCCGC ATCCGCCGCC GTGGCAACCT CACATTCGCC TTCAATTATG GGGCTGAGGA CTGGACCATT CCGGTATCCG GCGAGTTTGT CTTAGGCGGA CAAGTGCTTA AGCCGCAGCA ATTGGCGGTA TGGGTTTGA
|
Protein sequence | MLGVCYYPEH WPESQWHIDA QNMRKLGISY VRIGEFAWSR LEARRGQFTF EWLDRAIDIL HEAGLKVVLG TPTATPPKWL MDEHPDIAPY DADGNVRGFG SRRHYSFSSE IWWAESARIV EVIAKRYGTH PGIAGWQTDN EYGCHDTTVS WGPEDLKAFR RWLRMRYQTT DQLNEAWGSV FWSMEVNSFD EVELPTRTVT EPNPAQRLDF WRFSSDQVAA YDKMQVDIIR KHSPGRWITH NFMGFINDFD HFQVGDNLDL ASWDSYPIGF VEKFPFTEDE RNRWAETSHP DIAPFHHDLY RGVGRGRFWV MEQQPGPVNW APWNPVPKPG MVRLWTWEAL AHGAEVVSYF RWRQAPFAQE QMHAGLNLPG LDEWSVGGLE AHAVAGELAA LGDLPESVQA PVALVYDYAS YWATTIQPQG RDFRYEELAF RWYEAARRLG LDVDFVRPGA DLSGYKLVLV PCLMQVNEAA LAAFKNTDAV VLYGPRTGSR DEKFRIPENL PPGPLAPLTD TRQTQVASLR PRLGDSVSGS VSGRAIRWRE YLETTATVIA TFENGDPALT SQNDHHYLAC WPDEALLSST LAVLAEKAGL ETLVLPEHIR IRRRGNLTFA FNYGAEDWTI PVSGEFVLGG QVLKPQQLAV WV
|
| |