Gene Rleg_5240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5240 
SymboleutB 
ID8007414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp651138 
End bp652139 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content65% 
IMG OID644822148 
Productthreonine dehydratase 
Protein accessionYP_002973408 
Protein GI241113573 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR02991] ectoine utilization protein EutB 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.303967 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGCA CCTTGCCCGT TTCGCTGGAG GATATCCACG CCGCGGCCCG CCGGATTGCC 
GGCCGCGTGC TCTGTACATC GATGGTGCAG TCCGCTTCAC TTGGCGAATT GGCCGGCGCG
CCTGTCCATC TCAAGCTCGA ACATCATCAG ACGACCGGCA GTTTCAAGCT GCGCGGAGCG
ACCAACGCGG TGCTTTCTTT GTCGCCGGCG GAGCGCTCGC GCGGCTTCGT CGCGGCCTCG
ACCGGAAATC ACGGCCGTGC ACTTGCCTAT GCGGCAAAGG CGGAAGGTGC CGTCGCGACC
ATCTGCATGT CGCGGCTGGT GCCGGAGAAC AAGGTTTCGG AAATCCGCCG CCTCGGTGCC
GATGTCCGTA TCATCGGAAG GTCGCAAGAC GAAGCGCAGC AGGAGGTCGA CCGGCTGGTG
CGCGAGGAGG GGCTGGTGAT GGTCCCGCCC TTTGATGATC CTGATGTCGT GGCCGGGCAG
GGGACACTGG GGCTTGAAAT CATCGACACC TTGCCGGAGG CGGCAATCGT GCTGGTGCCG
CTCTCGGGCG GCGGCCTGGC GGCCGGCGTT GCCGCCGCGG TCAAAGGCAT CAGCTCGAAG
ACCAAAGTGA TCGGCCTGAC GATGGAGAAG GGTGCCGCGA TGAAGGCAAG CCTCGATGCT
AGACGGCCGG TGCAGGTCGA GGAGGTATCG AGCCTTGCCG ACTCGCTCGG CGGCGGCATC
GGCCTCGACA ATCGCGTGAC CTTGGCCATG TGCCGAGACC TTCTCGACGA GGTCATCCTG
CTGACGGAAG CGGAAATCGC CGCCGGCATG CGCCATGCCT ATGCCTGCGA ACGCCAAATC
GTCGAAGGCG CGGGCGCGGT CGGCATTGCA GCGCTTCTTG CCGGGAAAAT TGTGGGGAAC
GGTCCCATCG TCGCGATCCT GTCCGGGCAG AATGTCGACA TGGAACAGCA CAGGCGGGTG
ATCAATGGCA AGGCGGCACT CTGTGGGGAG GAGGGACCAT GA
 
Protein sequence
MVSTLPVSLE DIHAAARRIA GRVLCTSMVQ SASLGELAGA PVHLKLEHHQ TTGSFKLRGA 
TNAVLSLSPA ERSRGFVAAS TGNHGRALAY AAKAEGAVAT ICMSRLVPEN KVSEIRRLGA
DVRIIGRSQD EAQQEVDRLV REEGLVMVPP FDDPDVVAGQ GTLGLEIIDT LPEAAIVLVP
LSGGGLAAGV AAAVKGISSK TKVIGLTMEK GAAMKASLDA RRPVQVEEVS SLADSLGGGI
GLDNRVTLAM CRDLLDEVIL LTEAEIAAGM RHAYACERQI VEGAGAVGIA ALLAGKIVGN
GPIVAILSGQ NVDMEQHRRV INGKAALCGE EGP