Gene Rleg2_5074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5074 
Symbol 
ID6978168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp719894 
End bp721105 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content59% 
IMG OID643394212 
ProductCystathionine gamma-synthase 
Protein accessionYP_002279030 
Protein GI209547112 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.131911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAG ACGTGTTGAA TGATGACGAT GGCGATCAGG CTGGTTTCGA TCTTGGATTT 
GCCACGCGTG CCATATTGGG CGGCCAGGCA ACCACGCTCG CTCCGGCAGG CACGCGTGGC
AGAACCATCC CCCTTTGCGG TCGTTTAATC GACGATCATG ATCGCATCGG ATCGTCTTCG
ACCAAACAGG GGGACATCTC CATCGCTGCA AACGCCGGGC TGGCGTCAAA CCTCGCAGGT
CTCGAGGGCG CTGAAGCGGG ATTGGTCTCG GGCTCTGGGC TCGCGGCACT CACCACCCTT
TTTCGGGCGA CGACATCCCC AGGGGATCGC GTCCTCGTGC AAAAATCTGC GTGCACGGCG
ACCACCGCCC TCATGCAGGC GACGCTTTCC AGCATGAAGG TAGAAATGGC GGTCGTAGAC
TTTGCCGCCG AACTAGAGCT GCAAAACGAC CTAAACGGCC GTACGCGGCT CGTCTATCTC
CAGACACCAA GCGATCCGTT GAGCGGCATC GTCGATATCA CCGCCGTGTG CGCACAAGCG
CACGAGCACG GACTGACCGT CGCGGTAGAC AACACGTTCG CCTCCCCCGT CCTTCAACGG
CCGATCGAAC ATGGCGCCGA TGTCGTCTTC CATTCCTTTG CAAAATACAT CAACGGTCAC
GGCGATGCGG TCGGTGGGGG CGTTTTCGGG GACCGCGATC TGATCTTGCG GATGCAAGAG
ATGGCGGCGG GCATTGGCAA TCAGACTGGT CTCAACCTCG ATGCGGCGCA TCTGATCCAG
CGCGGCCTCA AGACCCTGGC GCTTCGTATG GAAAAGCACA GCTCGTCAGC CCATGCCGTT
GCCCTGACGC TGGAATCGCA TCCGGCCGTA AACTGGGTCC GCTATCCGTT TCTTTCATCC
CACCCTTACG CGGCCACCGC AAGGCGCCAG ATGACGGGAG GCTCAGGCAT GATTGCCTTT
GGCCTCAACG CTGGCGACAT TGCAACCCGT CACGTCGTCG AAAGACTTCG TCTGTTTAGA
CCGTCTATCG CATCAGGCGA GGTAGGAAGT CTCGTCTGCA CGTCTGCAGA TCTATCTAGC
GCCCGTAACA TTTCGCTTGA AGGGTCAGAG CTATGCGAGA CGCTCGGACA GGACGTTATC
CGGTTATCCG TCGGTCTGGA AGATGCCGAG GACCTTGTCG AAGATCTCTT CGAAGCCCTC
TCTGGCCTTT GA
 
Protein sequence
MTKDVLNDDD GDQAGFDLGF ATRAILGGQA TTLAPAGTRG RTIPLCGRLI DDHDRIGSSS 
TKQGDISIAA NAGLASNLAG LEGAEAGLVS GSGLAALTTL FRATTSPGDR VLVQKSACTA
TTALMQATLS SMKVEMAVVD FAAELELQND LNGRTRLVYL QTPSDPLSGI VDITAVCAQA
HEHGLTVAVD NTFASPVLQR PIEHGADVVF HSFAKYINGH GDAVGGGVFG DRDLILRMQE
MAAGIGNQTG LNLDAAHLIQ RGLKTLALRM EKHSSSAHAV ALTLESHPAV NWVRYPFLSS
HPYAATARRQ MTGGSGMIAF GLNAGDIATR HVVERLRLFR PSIASGEVGS LVCTSADLSS
ARNISLEGSE LCETLGQDVI RLSVGLEDAE DLVEDLFEAL SGL