Gene Rleg_3952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3952 
Symbol 
ID8014767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4027618 
End bp4029246 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content60% 
IMG OID644826521 
Productglycoside hydrolase family 39 
Protein accessionYP_002977732 
Protein GI241206636 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3664] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTC AAGAGGACAC CAGAATGAGC AGCCCCTATG ACGTGACAAT CTCCGTTCAC 
GCCAACCGGC CAACCGGCAA ATACCAACCT TTGTGGAACT GGTTCGGCTA CGACGAGCCG
AACTACACCT ACTCCCCGCA CGGCAGGAAG CTGCTGAAGG AACTCACCGA GCTGAGCCCG
GAGCCGCCCC GCATCCGTGC GCACAATCTT CTGACCAGCG GCGACGGCCT GCCTGCCCTC
AAATGGGGTT CGACCAATGC CTATACCGAG GATGCGAACG GCAATCCGGT CTATGATTGG
ACGATCATCG ACAAGATCTT CGACACCTAT GTCGAAGCCG GCAACATTCC GCTCATCCAG
ATCGGCTTCA CGCCGGAGAC GCTCTCCGAT TTCGACGGCC CTTATCGCCA CCAATGGGAG
CCGGCCGACA AATACGCGAC GATCACCACC GGCTGGACGG CCCCGCCGAC GGACCTGAAG
AAATGGGGCG ACCTGATCGA AGCATGGGCA CGGCATCTTG CCGAGAGATA TGGCGAGGAC
GTCGTTTCAA GCTGGCCGTG GGAAGTGTGG AACGAGCCGG ACGGCCACTA CTGGACCGGC
ACCATCCCGC AATTCTGCGA AATGTATGAT GTCAGCGCCA AGGCCCTGAA GCGTGCCTTG
CCCAATGCCC GCGTCGGCGG CCCGCACACC TGCGGCGCCT TTGCCAATGA AAAGGCCCAG
ACCTTCCTGC GCGCCTTCCT CAGGCACGTG GTCGAGAACA AGTCGCCGAT CGATTTCCTC
GCCTTCCACG CCAAGGGCAA TCCTGTGATC TACGAGGGTC ACGTGCGGAT GGGCCTGCAC
AAGCAGCTCC GCGACATCGA GACGAACCTT GCCATCATCA ATGAGTTTCC GGAACTCCGG
CACCTGCCTG TGGTGATCGG CGAATCCGAT CCGGAAGGCT GTGCGGCCTG CTCCGCACGC
GTCCATCCCC AGAACGGCTA TCGCAACGGT CCGCTTTACG GCGTCTATGT GGTCGAGAGT
ATGATCCGCA CTTACGAGCT TGCAAGACGC GCCAACATCC ACATCGAAGG CGCGGTGACC
TGGGCCTTCC TCTTCGACGA CCAGCCCTAT TTCGACGGTT TCCGCGATCT TGCCACCAAC
GGCGTCGATA AGGCTGTTCT CAACGGCTTC CGCATGCTCG GCAAACTCGG CGGCGAATGG
CTGGAATCGG AAAGCGATTA TCGCCGCGAC ATCGAAGATA TCATGAGCCA TGGCGTTCGC
GGCAAGCCCG ATGTCAACGT CGTGGCAACC CGCGACGACA AGGGCGTTTC CATTCTCGTC
TGGCACTACC ACGACGATGA TGAAGCCGGG CCGTCCGCCA ATATCACGGT CAATCTCGAC
GGCTGGGACG GCAAGTTCGC CTCGCTCAAG CATTTCAGGA TGGACGAGGA GCATTCCAAC
GCTTTCGGGG TCTGGAAGGC GATGGGCAAG CCGCAGAATC CGGCCGGCGA AGACTATGCC
AGGCTGGAAG CCTCGGGAAA ACTCGCCGAG ATCGATGGTC AGGCGTCGGT CAAGGTCGAC
GACGGCAAGA TCGAGCTCAA GGTCTCCCTA CCGCGTCAAG GCGTGTCTCT CCTGCGTCTC
GATTGGTGA
 
Protein sequence
MTVQEDTRMS SPYDVTISVH ANRPTGKYQP LWNWFGYDEP NYTYSPHGRK LLKELTELSP 
EPPRIRAHNL LTSGDGLPAL KWGSTNAYTE DANGNPVYDW TIIDKIFDTY VEAGNIPLIQ
IGFTPETLSD FDGPYRHQWE PADKYATITT GWTAPPTDLK KWGDLIEAWA RHLAERYGED
VVSSWPWEVW NEPDGHYWTG TIPQFCEMYD VSAKALKRAL PNARVGGPHT CGAFANEKAQ
TFLRAFLRHV VENKSPIDFL AFHAKGNPVI YEGHVRMGLH KQLRDIETNL AIINEFPELR
HLPVVIGESD PEGCAACSAR VHPQNGYRNG PLYGVYVVES MIRTYELARR ANIHIEGAVT
WAFLFDDQPY FDGFRDLATN GVDKAVLNGF RMLGKLGGEW LESESDYRRD IEDIMSHGVR
GKPDVNVVAT RDDKGVSILV WHYHDDDEAG PSANITVNLD GWDGKFASLK HFRMDEEHSN
AFGVWKAMGK PQNPAGEDYA RLEASGKLAE IDGQASVKVD DGKIELKVSL PRQGVSLLRL
DW