Gene Rleg2_5519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5519 
SymboleutB 
ID6978613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1167636 
End bp1168637 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content66% 
IMG OID643394618 
Productthreonine dehydratase 
Protein accessionYP_002279436 
Protein GI209547518 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR02991] ectoine utilization protein EutB 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0114285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAGCA CCTTGCCCGT TTCGCTGGAG GATATTCGCG CGGCGGCGCG GCGGATCGCC 
GGCCGGATTG TCGAAACACC GATGGTACAG GCAGCATCGC TTTGCGACAT TGCCGGCGTT
CCCGTCTGGC TGAAGCTCGA ACATCATCAG ACGACCGGCA GCTTCAAGCT GCGCGGGGCG
ACCAATGCGG TACTCTCCTT ATCGCCGGCG GAACGCTCAC GCGGCGTCGT CGCCGCCTCG
ACCGGAAATC ACGGCCGGGC GCTTGCCTAT GCGGCGAAGG CTGAAGGTGC CGTCGCAACT
ATCTGCATGT CGCGCCTGGT GCCAGAGAAC AAGATCTCGG AGATCCGCCG CCTCGGTGCC
GAAATCCGCA TCGTCGGATC GTCGCAGGAC GAGGCGCAGC TAGAGGTCGA CCGGCTGGTC
GGCGAAGAAG GGGTGGTCAT GGTCCCGCCC TTCGATCATC CGGCTGTCGT AGCCGGGCAG
GGGACGCTGG GGCTCGAGAT TCTCGACGCT TTGCCGGAAG CGGCCACGGT TCTGGTGCCT
CTCTCCGGCG GGGGTCTTGC GGCGGGCGTT GCCGCTGCGA TCAAGGGCGT CAATCCGAAG
ACGAAGGTGA TCGGCCTGAC GATGGAACGG GGCGCGGCGA TGAAGGCGAG CCTCAATGCC
GGCCGGCCGG TGCAGGTCGA GGAAAGGCCG AGCCTTGCAG ACTCGCTCGG CGGCGGCATC
GGCCTCGACA ATCGCGTGAC CTTCGCCATG TGCCGCGCCC TTCTCGACGA CGTCATCCTG
CTGACGGAGG CGGAAATCGC CGCAGGTATG CGCCACGCCT ATGCCTGCGA GCGGGAGATC
GTCGAAGGTG CGGGCGCCGT CGGTATCGCG GCGCTGCTTG CGGGAAAGAT CCGCTCCGGC
GGTCCCGTCG TTGCGATCCT GTCGGGCCGA AATGTCGACA TGGAACAGCA CCGCCGGTTG
ATCAACGGCG AGGCGGCGAT GTTCGGGGAG GATGGGCGAT GA
 
Protein sequence
MVSTLPVSLE DIRAAARRIA GRIVETPMVQ AASLCDIAGV PVWLKLEHHQ TTGSFKLRGA 
TNAVLSLSPA ERSRGVVAAS TGNHGRALAY AAKAEGAVAT ICMSRLVPEN KISEIRRLGA
EIRIVGSSQD EAQLEVDRLV GEEGVVMVPP FDHPAVVAGQ GTLGLEILDA LPEAATVLVP
LSGGGLAAGV AAAIKGVNPK TKVIGLTMER GAAMKASLNA GRPVQVEERP SLADSLGGGI
GLDNRVTFAM CRALLDDVIL LTEAEIAAGM RHAYACEREI VEGAGAVGIA ALLAGKIRSG
GPVVAILSGR NVDMEQHRRL INGEAAMFGE DGR