Gene Rleg_1739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1739 
Symbol 
ID8012800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1732062 
End bp1733387 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content62% 
IMG OID644824326 
Producthomoserine dehydrogenase 
Protein accessionYP_002975564 
Protein GI241204468 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.150104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATG CCCTCAAAAT CGGCATTGCG GGCTTGGGCA CCGTTGGCGC CTCGCTTGTC 
CGCATCATTC AGCAGAAAAG CAACGAGCTT GCCGTCACCT GCGGGCGTCC GATCACCATC
ACCGCCGTTT CGGCACGTGA CAAGGCGAGG GACCGCGGCA TCGATCTTTC CGCTGTTACC
TGGTTCGACC GGCCGGAAGA GCTTGCCGAA AAGGGCGATA TCGACGTCTT CGTCGAGCTG
ATGGGCGGCG CCGAAGGGGC TGCCAACATC TCGGTGCGTA CGGCACTCCA GCGTGGTCTC
CATGTGGTGA CAGCCAACAA GGCGCTGCTT GCCTATCACG GCGTCGAGCT TGCGACGATC
GCCGAGGAGA AGGGGTCGCT GCTGAACTTC GAGGCGGCGG TCGCCGGCGG CATCCCGGTC
ATCAAGGCCC TGCGTGAATC GCTGACGGGC AATTCCGTCT CGCGCATCTA TGGCATCATG
AATGGCACCT GCAATTATAT CCTGACCAAG ATGGAGAAGG AGGGGCTTTC CTTCGCCGAA
TGCCTCAAGG AAGCGCAGCG GCTGGGTTAT GCCGAGGCCG ATCCGGCCTT CGATATCGAG
GGCAACGACA CGGCCCATAA GCTTTCCATC TTGACGACGC TCGCCTTCGG CAATCAGATC
GCCGCCGACG ACATCTATCT CGAAGGCATC ACCAACATCT CGATCGAGGA TATCCACGCC
GCTGCCGAAC TCGGTTATCG CATCAAGCTC TTGGGCGTTG CCCAGCGCAC CGATACCGGC
ATCGAACAGC GCGTCCATCC CACAATGGTG CCGGTCGATT CGGTCATCGC CCAGGTCGAC
GGCGTTACCA ATGCAGTGGC GATCGAATCC GACGTGCTCG GCGAACTGCT GATGGTCGGC
CCTGGCGCCG GCGGCAACGC CACAGCCTCG TCGGTGCTTG GTGATATCGC CGATATCGCC
AAGAGCCAGC CGGGCGCCCA GCGGGTGCCG GTGCTCGGCC ATCCCGCAAC CACACTGGAA
CCCTATCGCA AGGCGCAGAT GCAGAGCCAC GAGGGCGGCT ATTTCATCCG CCTGACCGTG
CTCGACCGCA CCGGCGTCTT TGCCAGCGTC GCAACCCGCA TGGCCGAAAA CAACATCTCG
CTGGAATCGA TCGTCCAGCG CTCCAAGCAG CACCTGGCGC CATCGCATCA CCAGACGATC
ATTCTCGTCA CCCACGCGAC GACGGAAGAC TCGGTGCGCA AGGCAGTCGC TTCGATCAAG
TCGGAAGGTT ACCTCTTCGG CGAGCCGCAG GTGATTCGCA TCGAGCGGCC CAAAGAGGAA
GGCTGA
 
Protein sequence
MADALKIGIA GLGTVGASLV RIIQQKSNEL AVTCGRPITI TAVSARDKAR DRGIDLSAVT 
WFDRPEELAE KGDIDVFVEL MGGAEGAANI SVRTALQRGL HVVTANKALL AYHGVELATI
AEEKGSLLNF EAAVAGGIPV IKALRESLTG NSVSRIYGIM NGTCNYILTK MEKEGLSFAE
CLKEAQRLGY AEADPAFDIE GNDTAHKLSI LTTLAFGNQI AADDIYLEGI TNISIEDIHA
AAELGYRIKL LGVAQRTDTG IEQRVHPTMV PVDSVIAQVD GVTNAVAIES DVLGELLMVG
PGAGGNATAS SVLGDIADIA KSQPGAQRVP VLGHPATTLE PYRKAQMQSH EGGYFIRLTV
LDRTGVFASV ATRMAENNIS LESIVQRSKQ HLAPSHHQTI ILVTHATTED SVRKAVASIK
SEGYLFGEPQ VIRIERPKEE G