Gene Rleg2_5063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5063 
Symbol 
ID6978157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp710868 
End bp711929 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content61% 
IMG OID643394201 
Producthomoserine dehydrogenase 
Protein accessionYP_002279019 
Protein GI209547101 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTCT ACAATATCGC ACTGATCGGC TTCGGCGGCG TCAACCGTGC GCTTGCCGAA 
TTAATTGCTT CGAAAAACCC GCTCTGGGAA CGTGACCTCG GCTTCCGTCT GAACATCGTT
GCCGTAAGTG ACCTTTACCT CGGCTCTGTC ATTTCACCGA ACGGCCTGGA CGCAACAACG
CTTGTTGAAT CCAAATTCGC CAAGGGCGGC TTCAGCCAAC TCTCTGGTGG AAGTGCCGAG
GCCAACAACG AAGTCGTCAT CAAGAATGCT CCGGCAGACA TCATCGTCGA AGCTACCTTC
ACCAATCCGA AAGACGGCGA GCCCGCAGTC TCCCACTGCC GCTGGGCTCT CGAGGGCGGC
AAGCACGTCG TGACGACCAA TAAGGGTCCG GTGGCGATCG CCGCGCAGGA GCTCAAGGCT
CTTGCGAAGA AGAATGGCGT TCGCTTCGAA TATGAAGGCT CCGTCATGAG CGGAACCCCG
GTTATCCGAA TGGTGGACAA GACGCTGGCG GGTGCGGAGC TGAATGGCTT CGAAGGCATC
CTCAATGGGA CGTCGAACTT CGTCCTCGGC CGGATGGAAA CGGGCATGGA CTTCTCCGCT
GCAGTGAAGG AAGCTCAGGA GCTCGGCTAT GCCGAAGCGG ACCCCACAGC CGACGTCGAG
GGGTTCGATG TGCGGCTCAA GGTCGTCATC CTCGCCAACG AGCTGCTCGG GGCGAACCTC
ACGCCGGACG ACGTCGCGCG CAAGGGCATC TCTGGCCTGA CCGCCGCCGA TATCGACACC
GCCAAGAAGG CCGGCAGCCG CTGGAAGCTC ATCGGCTCCG CCATTCGTAA CGCCGATGGC
TCCGTCACTG GCAGCGTCGA GCCCAAGTGC CTTCCGCTGG AGCACCCGCT TGCAGCAGTG
AGTGGCGCGA CCAATGCTGT GTCTCTGAAT ACCGAACTCC TCGGCTCCGT GACCGTCACT
GGTCCAGGCG CCGGCCGTAT CGAGACGGCA TACGCACTTC TCTCCGATAT AGTCGCCATC
CACAACCTCG CCGGCGCGAA CCTCAAGAAG GAGGCTGCAT GA
 
Protein sequence
MTVYNIALIG FGGVNRALAE LIASKNPLWE RDLGFRLNIV AVSDLYLGSV ISPNGLDATT 
LVESKFAKGG FSQLSGGSAE ANNEVVIKNA PADIIVEATF TNPKDGEPAV SHCRWALEGG
KHVVTTNKGP VAIAAQELKA LAKKNGVRFE YEGSVMSGTP VIRMVDKTLA GAELNGFEGI
LNGTSNFVLG RMETGMDFSA AVKEAQELGY AEADPTADVE GFDVRLKVVI LANELLGANL
TPDDVARKGI SGLTAADIDT AKKAGSRWKL IGSAIRNADG SVTGSVEPKC LPLEHPLAAV
SGATNAVSLN TELLGSVTVT GPGAGRIETA YALLSDIVAI HNLAGANLKK EAA