Gene Avi_5742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5742 
Symbol 
ID7381623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp768427 
End bp770355 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content60% 
IMG OID643649295 
Productbeta-galactosidase 
Protein accessionYP_002547532 
Protein GI222106741 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.585873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGGCG TTTGCTATTA TCCCGAACAT TGGCCGGAGA GCCAATGGCA TATCGATGCG 
CAAAACATGC GCAAGCTTGG CATTTCCTAT GTGCGCATCG GCGAATTCGC CTGGTCGCGG
CTGGAGGCGA GGCGGGGTCA GTTCACCTTC GAATGGCTGG ACCGGGCCAT CGATATTCTG
CATGAGGCGG GCCTGAAGGT GGTGCTTGGC ACCCCGACCG CCACACCGCC GAAATGGCTG
ATGGACGAGC ATCCCGACAT TGCGCCCTAT GATGCCGACG GGAATGTGCG CGGCTTCGGC
TCGCGTCGCC ATTATAGTTT TTCGAGCGAA ATCTGGTGGG CCGAAAGCGC TCGAATCGTT
GAAGTCATCG CCAAGCGCTA CGGCACCCAT CCGGGCATTG CTGGCTGGCA GACCGATAAC
GAATACGGCT GCCATGACAC CACCGTATCT TGGGGGCCTG AAGACCTGAA GGCCTTCCGC
CGCTGGTTGC GGATGCGGTA TCAGACCACC GACCAGCTCA ATGAGGCCTG GGGCTCGGTG
TTCTGGTCGA TGGAGGTCAA CAGTTTTGAT GAGGTCGAGC TACCCACCCG CACCGTGACC
GAGCCGAACC CGGCGCAGCG GCTGGATTTC TGGCGCTTCT CCTCCGATCA GGTGGCCGCC
TATGATAAAA TGCAGGTGGA CATCATCCGC AAGCATTCAC CGGGCCGCTG GATTACCCAT
AACTTCATGG GTTTCATCAA TGATTTCGAT CATTTCCAGG TGGGCGACAA TCTCGATCTC
GCCAGCTGGG ATAGCTATCC GATCGGCTTC GTCGAAAAAT TCCCGTTTAC CGAAGACGAG
CGCAATCGTT GGGCGGAAAC CTCCCATCCC GACATCGCGC CCTTCCACCA TGATCTCTAC
CGCGGTGTTG GCCGTGGCCG GTTCTGGGTG ATGGAGCAGC AGCCCGGTCC GGTCAATTGG
GCACCGTGGA ACCCGGTTCC CAAGCCCGGC ATGGTCAGGC TGTGGACCTG GGAAGCGCTG
GCGCATGGGG CAGAGGTGGT CAGCTATTTC CGCTGGCGCC AGGCACCGTT TGCGCAGGAG
CAGATGCATG CAGGCCTCAA TCTGCCGGGG CTTGATGAAT GGTCGGTGGG CGGGTTGGAA
GCGCATGCCG TTGCTGGCGA ACTGGCAGCG CTTGGCGATC TGCCAGAGAG CGTTCAGGCA
CCCGTGGCGC TGGTCTATGA CTATGCCAGC TATTGGGCGA CAACCATCCA GCCGCAGGGC
CGGGATTTCC GCTACGAGGA ATTGGCGTTT CGCTGGTACG AAGCAGCCCG CCGCCTGGGG
CTGGACGTGG ATTTTGTGCG CCCCGGTGCT GATCTGTCCG GTTATAAACT GGTGCTCGTT
CCCTGCCTGA TGCAGGTGAA CGAGGCGGCG CTTGCGGCCT TTAAAAACAC CGATGCCGTC
GTGCTGTACG GTCCGCGCAC CGGTTCACGC GACGAGAAAT TCCGTATTCC GGAAAACCTG
CCGCCCGGTC CGCTGGCTCC TTTGACCGAT ACGCGCCAAA CGCAAGTCGC CTCACTGCGG
CCCCGTCTTG GTGATAGCGT TTCAGGCTCA GTTTCGGGCC GGGCCATCCG CTGGCGGGAA
TATCTGGAAA CCACGGCAAC CGTCATCGCC ACATTCGAAA ACGGCGATCC GGCCCTGACG
AGCCAGAACG ACCACCATTA TCTGGCCTGC TGGCCCGACG AGGCTTTGCT GTCTTCAACG
CTGGCGGTGC TGGCGGAAAA GGCTGGTCTC GAAACCCTTG TTTTACCAGA GCATATCCGC
ATCCGCCGCC GTGGCAACCT CACATTCGCC TTCAATTATG GGGCTGAGGA CTGGACCATT
CCGGTATCCG GCGAGTTTGT CTTAGGCGGA CAAGTGCTTA AGCCGCAGCA ATTGGCGGTA
TGGGTTTGA
 
Protein sequence
MLGVCYYPEH WPESQWHIDA QNMRKLGISY VRIGEFAWSR LEARRGQFTF EWLDRAIDIL 
HEAGLKVVLG TPTATPPKWL MDEHPDIAPY DADGNVRGFG SRRHYSFSSE IWWAESARIV
EVIAKRYGTH PGIAGWQTDN EYGCHDTTVS WGPEDLKAFR RWLRMRYQTT DQLNEAWGSV
FWSMEVNSFD EVELPTRTVT EPNPAQRLDF WRFSSDQVAA YDKMQVDIIR KHSPGRWITH
NFMGFINDFD HFQVGDNLDL ASWDSYPIGF VEKFPFTEDE RNRWAETSHP DIAPFHHDLY
RGVGRGRFWV MEQQPGPVNW APWNPVPKPG MVRLWTWEAL AHGAEVVSYF RWRQAPFAQE
QMHAGLNLPG LDEWSVGGLE AHAVAGELAA LGDLPESVQA PVALVYDYAS YWATTIQPQG
RDFRYEELAF RWYEAARRLG LDVDFVRPGA DLSGYKLVLV PCLMQVNEAA LAAFKNTDAV
VLYGPRTGSR DEKFRIPENL PPGPLAPLTD TRQTQVASLR PRLGDSVSGS VSGRAIRWRE
YLETTATVIA TFENGDPALT SQNDHHYLAC WPDEALLSST LAVLAEKAGL ETLVLPEHIR
IRRRGNLTFA FNYGAEDWTI PVSGEFVLGG QVLKPQQLAV WV