Gene Rleg2_3954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3954 
Symbol 
ID6982718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4102276 
End bp4103496 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content62% 
IMG OID643398677 
Producttryptophan synthase subunit beta 
Protein accessionYP_002283442 
Protein GI209551525 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases
[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.291144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGAAA CGCCTAAACC GAATTCCTTC CGTTCCGGGC CTGATGAAGA CGGCCGCTTC 
GGCATCTATG GCGGTCGTTT CGTTGCCGAA ACGTTGATGC CTTTGATTCT CGATCTGCAG
GACGAGTGGA ACAAGGCGAA GAACGATCCG GCATTCCAGG CCGAATTGAA GCATCTCGGC
GCCCATTATA TCGGCCGCCC GAGCCCGCTC TATTTCGCCG AGCGGCTGAC GGCCGAACTC
GGCGGCGCGA AGATCTATTT CAAGCGCGAG GAGCTGAACC ACACTGGCTC GCACAAGATC
AACAACTGCA TCGGCCAGAT CCTGCTCGCC AAGCGCATGG GCAAGACCCG GATCATCGCC
GAAACCGGCG CCGGCCAGCA TGGTGTTGCC TCGGCTACCG TTGCCGCCCG CTTCGGCCTT
CCCTGCGTCG TCTATATGGG CGCCACCGAC GTCGAGCGTC AGGCGCCGAA CGTTTTCCGC
ATGAAGCTGC TCGGCGCGGA AGTGAAGCCG GTGACTGCCG GCAGCGGCAC GCTGAAGGAC
GCGATGAACG AGGCGCTTCG CGACTGGGTC ACCAATGTCG AAGATACCTA CTACCTGATC
GGAACGGCGG CCGGTCCGCA TCCCTATCCG GAGATGGTCC GCGATTTCCA GTCGGTGATC
GGCACCGAAG CGAAAGAGCA GATGCTGGCA GCCGAAGGCC GCCTGCCGGA TCTCGTCATC
GCTGCCGTCG GCGGCGGTTC GAACGCGATC GGCATTTTTC ATCCTTTCCT CGATGATCCC
ACCGTGAAGA TTGTCGGCGT CGAAGCAGGC GGCAAGGGCC TGCAGGGCGA CGAGCATTGC
GCCTCGATCA CCGCCGGTTC GCCGGGTGTG CTGCACGGCA ACCGCACCTA TCTGCTGCAG
GATGGCGATG GCCAGATCAA GGAAGGCCAT TCGATTTCGG CCGGTCTCGA TTATCCCGGC
ATCGGCCCCG AGCATTCCTG GCTGAACGAT ACCGGCCGCG TCGACTATGT TCCGATCATG
GATCACGAAG CGCTCGAGGC GTTCCAGACG CTGACCCGCC TCGAAGGCAT CATCCCGGCG
CTCGAGCCCT CGCACGCGAT TGCCGAGGTG ATCAAGCGCG CACCGAAGAT GGGCAAGGAC
GAGATCATCC TGATGAACCT CTCCGGCCGC GGCGACAAGG ATATCTTCAC CGTCGGCAAG
ATTCTGGGAA TGGGACTGTA A
 
Protein sequence
MNETPKPNSF RSGPDEDGRF GIYGGRFVAE TLMPLILDLQ DEWNKAKNDP AFQAELKHLG 
AHYIGRPSPL YFAERLTAEL GGAKIYFKRE ELNHTGSHKI NNCIGQILLA KRMGKTRIIA
ETGAGQHGVA SATVAARFGL PCVVYMGATD VERQAPNVFR MKLLGAEVKP VTAGSGTLKD
AMNEALRDWV TNVEDTYYLI GTAAGPHPYP EMVRDFQSVI GTEAKEQMLA AEGRLPDLVI
AAVGGGSNAI GIFHPFLDDP TVKIVGVEAG GKGLQGDEHC ASITAGSPGV LHGNRTYLLQ
DGDGQIKEGH SISAGLDYPG IGPEHSWLND TGRVDYVPIM DHEALEAFQT LTRLEGIIPA
LEPSHAIAEV IKRAPKMGKD EIILMNLSGR GDKDIFTVGK ILGMGL