Gene Rleg2_3403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3403 
Symbol 
ID6982157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3515223 
End bp3516596 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content61% 
IMG OID643398121 
Productbeta-galactosidase 
Protein accessionYP_002282896 
Protein GI209550979 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.968269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.135443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATG CGAAGACGCT TGCAAGCCGC CTTCCCGGCG ATTTCACCTT CGGCGTCGCC 
ACCGCCGCCT TCCAGATCGA GGGTGCTGGT AAGGCCGACG GCCGCAAGCC ATCGATCTGG
GATGCTTTCT GCAATATGCC CGGCCGTGTC TATAATCGCG ACAATGGCGA CGTCGCCTGC
GACCACTATA ACCGGCTAGA GCAGGATCTC GATCTCATCA AGGATATGGG TGTCGAAGCC
TACCGCTTCT CGATCGCCTG GCCGCGCATC ATCCCCGACG GCACCGGTGC GGTGAACGAG
GCCGGGCTCG ATTTCTACGA TCGGCTGGTC GACGGCTGCA AGGCGCGCGG GATCAAGACC
TTTGCGACGC TCTATCACTG GGACCTGCCA CTAATGCTTG CCGGCGACGG CGGCTGGACG
GCGCGCTCGA CCGCCTATGC CTTTCAGCGC TACGCCAAGA CGGTGATGAA CCGGCTTGGC
GATCGTCTCG ATGCCGTCGC GACCTTCAAC GAGCCCTGGT GCATCGTCTG GCTGAGCCAC
CTCTACGGCA TCCACGCGCC GGGCGAGCGC AATATTCAGG CCGCCCTTCA CGCCATGCAC
TACATGAACC TCGCCCACGG TCTCGGCGTC GAGGCGATCC GTGCGGAAGC CCCTGCGGTG
CCCGTCGGGC TCGTGCTCAA CGCTGCCTCG ATCATCCCCG GTTCCGAGGG CCCGGCCGAT
CTTGCCGCCA CTGAGCGCGC GCATCAGTTT CACAACGGCG CTTTCTTCGA TCCCGTCTTC
AAGGGCGAAT ACCCCAAGGA ATTCGTTGAG GCGCTCGGCG ACCGCATGCC TGTCATCGAG
GACGGCGACA TGACGCTGAT CAGCCAGAAA CTCGACTGGT GGGGTCTGAA TTATTACACG
CCCGAGCGCG TCACTGACGA TGCCGAACGC AACGGCGATT TCCCCTGGAC GGTGAAAGCG
CCGCCGGCAA GCGACGTCAA AACCGATATC GGCTGGGAAA TCTATGCGCC GGGATTGAAG
CTGCTGGTCG AAAACCTTTA CCGCCGCTAC GAACTGCCGG AATGCTACAT CACTGAGAAC
GGCGCTTGCG ACAACACCGG TGTCGTCGAC GGCGAAGTCG ACGATACGAT GCGTCTCGAT
TATCTCGGCG ACCATCTCGA TGTCGTGGCC GGCCTTATCA AGGACGGTTA TCCCATGCGC
GGCTATTTCG CCTGGAGCCT GATGGACAAT TTCGAATGGG CAGAAGGCTA CCGCATGCGC
TTCGGCCTCG TCCATGTCGA TTATCAGACC CAGTTGCGTA CGGTGAAGAA GAGCGGCAAG
TGGTATCGCG AACTCGCAGC ACAATTCCCG AAGGGCAATC ACAAGGCGGG TTAG
 
Protein sequence
MIDAKTLASR LPGDFTFGVA TAAFQIEGAG KADGRKPSIW DAFCNMPGRV YNRDNGDVAC 
DHYNRLEQDL DLIKDMGVEA YRFSIAWPRI IPDGTGAVNE AGLDFYDRLV DGCKARGIKT
FATLYHWDLP LMLAGDGGWT ARSTAYAFQR YAKTVMNRLG DRLDAVATFN EPWCIVWLSH
LYGIHAPGER NIQAALHAMH YMNLAHGLGV EAIRAEAPAV PVGLVLNAAS IIPGSEGPAD
LAATERAHQF HNGAFFDPVF KGEYPKEFVE ALGDRMPVIE DGDMTLISQK LDWWGLNYYT
PERVTDDAER NGDFPWTVKA PPASDVKTDI GWEIYAPGLK LLVENLYRRY ELPECYITEN
GACDNTGVVD GEVDDTMRLD YLGDHLDVVA GLIKDGYPMR GYFAWSLMDN FEWAEGYRMR
FGLVHVDYQT QLRTVKKSGK WYRELAAQFP KGNHKAG