Gene Rleg2_1545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1545 
Symbol 
ID6980276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1569994 
End bp1571319 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content62% 
IMG OID643396265 
Producthomoserine dehydrogenase 
Protein accessionYP_002281061 
Protein GI209549144 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.927693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATG CCCTCAAAAT CGGCATTGCG GGCTTGGGCA CCGTTGGCGC CTCGCTAGTC 
CGCATCATTC AGCAGAAGAG CAACGAGCTT GCCGTCACCT GCGGGCGTCC GATCACCATC
ACGGCGGTCT CCGCGCGTGA CAAAACGAGA GACCGCGGTA TCGATCTTTC CGCTGTCACC
TGGTTCGATC GGCCGGAAGA TCTTGCCGAA AAGGGCGATA TCGACGTCTT CGTCGAGCTG
ATGGGCGGCG CCGAAGGAGC TGCCAACACC TCCGTACGCG CCGCACTTAA GCGTGGTCTC
CACGTGGTGA CAGCCAACAA GGCGCTGCTT GCCTATCACG GCGTCGAGCT TGCGACGATT
GCCGAGGAGA AGGGCGCGCT TCTAAACTTC GAGGCCGCGG TGGCCGGCGG CATCCCGGTC
ATCAAGGCGC TGCGCGAATC GCTGACCGGC AATGCCGTCT CGCGCATCTA TGGCATCATG
AACGGCACCT GCAATTACAT CCTGACCAAG ATGGAAAAGG AGGGGCTTTC CTTCGCCGAG
TGCCTGAAGG AAGCCCAGCG GCTGGGTTAT GCCGAGGCCG ATCCGGCCTT CGACATCGAG
GGCAACGACA CCGCCCATAA GCTTTCCATC CTGACGACGC TCGCCTTCGG CAATCGCATC
GCGGCCGACG ATATCTATCT CGAAGGCATC ACCAACATCT CGATCGAGGA TATCCACGCC
GCCGCCGAGC TCGGTTATCG TATCAAGCTC CTGGGCGTTG CCCAGCGCAC CGACACCGGC
ATCGAGCAGC GCGTGCATCC GACCATGGTG CCGGTCGATT CGGTCATTGC CCAGGTCGAC
GGCGTTACCA ATGCGGTGGC GATCGAATCC GACGTGCTCG GCGAACTGCT GATGGTCGGT
CCCGGCGCCG GCGGCAATTC GACGGCCTCG TCCGTACTGG GCGATATCGC CGATATCGCC
AAAAGCCAGC CGGGCGCACA ACGCGTGCCG GTGCTCGGCC ATCCCGCAAA AGCGCTGGAA
CCCTACCGCA AGGCGCAGAT GCAGAGCCAC GAGGGCGGCT ACTTTATCCG CCTGACCGTG
CTCGACCGCA CGGGCGTCTT TGCCAGCGTT GCAACCCGCA TGGCGGAAAA CAACATCTCG
TTGGAATCGA TCGTCCAGCG CTCCAAGCAA CATCTGGCGC CGTCGCACCA CCAGACGATC
ATTCTCGTCA CCCATGCGAC GATGGAAGAG TCGGTGCGCA AGGCGGTCGC CTCGATCAAG
TCGGAAGGCT ATCTCTTCGG CGAACCGCAG GTGATTCGTA TCGAGCGGCC GAAAGAAGAC
GCTTAA
 
Protein sequence
MADALKIGIA GLGTVGASLV RIIQQKSNEL AVTCGRPITI TAVSARDKTR DRGIDLSAVT 
WFDRPEDLAE KGDIDVFVEL MGGAEGAANT SVRAALKRGL HVVTANKALL AYHGVELATI
AEEKGALLNF EAAVAGGIPV IKALRESLTG NAVSRIYGIM NGTCNYILTK MEKEGLSFAE
CLKEAQRLGY AEADPAFDIE GNDTAHKLSI LTTLAFGNRI AADDIYLEGI TNISIEDIHA
AAELGYRIKL LGVAQRTDTG IEQRVHPTMV PVDSVIAQVD GVTNAVAIES DVLGELLMVG
PGAGGNSTAS SVLGDIADIA KSQPGAQRVP VLGHPAKALE PYRKAQMQSH EGGYFIRLTV
LDRTGVFASV ATRMAENNIS LESIVQRSKQ HLAPSHHQTI ILVTHATMEE SVRKAVASIK
SEGYLFGEPQ VIRIERPKED A