Gene Rleg_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3701 
Symbol 
ID8014539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3749922 
End bp3751295 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content61% 
IMG OID644826264 
Productbeta-galactosidase 
Protein accessionYP_002977483 
Protein GI241206387 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATC CGAAGAAACT CGCGGAGCGC TTTCCCGGCG ACTTCATTTT CGGCGTTGCC 
ACCGCTGCCT TCCAGATCGA AGGCGCCAGC AAAGCTGATG GCCGCAAGCC ATCCATCTGG
GATGCTTTCT GCAATATGCC TGGCCGCGTC CATAATCGCG ATAACGGCGA TGTTGCCTGC
GATCACTACA ATCGCCTGGA ACAGGACCTC GATCTCATCA AGGAGATGGG TGTCGAAGCC
TACCGGTTCT CGATTGCCTG GCCGCGCATT ATCCCGGACG GCACGGGTCC GGTGAACGAG
GCCGGCCTCG ATTTCTACGA CCGGCTGGTC GACGGCTGCA AGGCGCGCGG TATCAAGACC
TTCGCGACGC TCTACCACTG GGATCTGCCT CTGTTGCTCG CCGGCGAGGG CGGGTGGACC
GCGCGCTCGA CCGCCTATGC GTTTCAGCGC TATGCCAAGA CGGTGATGAA CCGCCTTGGC
GATCGCCTCG ACCGGGTCGC GACCTTCAAC GAACCCTGGT GCATCGTCTG GCTCAGCCAC
CTCTACGGCA TCCATGCTCC GGGCGAGCGC AACATGCAGG CTGCCCTTCA CGCCATGCAT
TACATGAACC TTGCCCACGG CCTCGGCGTC GAGGCGATCC GCTCGGAAGC GCCCAATGTG
CCGGTCGGCC TCGTGCTCAA TGCCGCCTCG ATCATTCCCG GCTCCGACAG CCCGGCCGAC
CTTGCCGCGG GCGAGCGGGC GCATCAGTTC CACAACGGCG CTTTCTTCGA TCCCGTCTTC
AAGGGTGAAT ATCCGAAGGA ATTCGTCGCG GCGCTCGGCG ACCGCATGCC TGTCATCGAG
GATGGCGACC TGAAGCTCAT CAGCCAGAAA CTCGATTGGT GGGGCCTGAA CTATTATAAG
CCCGAACGCG TCACCGACGA TGCCGAACGC AAAGGCGATT TCCCCTGGAC GGTGGAGGCG
CCGCCGGCAA GCGACGTCAA GACCGATATC GGCTGGGAAA TCTATGCGCC GGGCCTGAAG
CTCTCGATCG AGGATCTCTA CCGCCGCTAC GAACTGCCGG AATGCTACAT CACCGAGAAC
GGCGCCTGCG ACAACACCGA TGTCATCGAC GGCGAGGTCG ACGATACGAT GCGCCTCGAC
TATGTCGGAG ATCACCTCGA AATCGTTGCC GGCCTCATCA AGGACGGCTA TCCCCTACGC
GGCTATTTCG CCTGGAGCCT GATGGATAAT TTCGAATGGG CGGAAGGCTA CCGCATGCGC
TTCGGCCTGG TCCATGTCGA TTATGAAACC CAGCTGCGCA CGGTGAAGAA GAGCGGCAAG
TGGTATCGCC AACTCGCGGC GCAATTCCCG AAGGGCAATC ACAAGGCGGT TTAG
 
Protein sequence
MIDPKKLAER FPGDFIFGVA TAAFQIEGAS KADGRKPSIW DAFCNMPGRV HNRDNGDVAC 
DHYNRLEQDL DLIKEMGVEA YRFSIAWPRI IPDGTGPVNE AGLDFYDRLV DGCKARGIKT
FATLYHWDLP LLLAGEGGWT ARSTAYAFQR YAKTVMNRLG DRLDRVATFN EPWCIVWLSH
LYGIHAPGER NMQAALHAMH YMNLAHGLGV EAIRSEAPNV PVGLVLNAAS IIPGSDSPAD
LAAGERAHQF HNGAFFDPVF KGEYPKEFVA ALGDRMPVIE DGDLKLISQK LDWWGLNYYK
PERVTDDAER KGDFPWTVEA PPASDVKTDI GWEIYAPGLK LSIEDLYRRY ELPECYITEN
GACDNTDVID GEVDDTMRLD YVGDHLEIVA GLIKDGYPLR GYFAWSLMDN FEWAEGYRMR
FGLVHVDYET QLRTVKKSGK WYRQLAAQFP KGNHKAV