Gene Rleg_0652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0652 
Symbol 
ID8011830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp687497 
End bp688480 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content63% 
IMG OID644823242 
Producthomoserine kinase 
Protein accessionYP_002974495 
Protein GI241203399 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR00938] homoserine kinase, Neisseria type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.252204 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.14086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGA GACCTCACTT GGCAGTCTAT ACCGATATCG CCGAAGACGA TCTGAAATGG 
TTCCTGACGG AATATGACGC GGGCACGCTG CTCTCCTACA AGGGCATTGC CGAAGGCGTC
GAAAACTCCA ACTTCCTGCT TCACACCTCC AGGGATCCGC TGATCCTGAC GCTCTATGAG
AAGCGGGTGG AAAAGAGCGA CCTGCCTTTC TTCCTCGGTT TCATGCAGCA TCTTTCCGCC
CGCGGCCTGT CCTGCCCGCT GCCGCTGCCG CGCCGCGATG GCGCGCTGCT CGGCTCACTG
TCCGGCCGTC CGGCGGCGCT GATCTCCTTC CTCGAAGGCA TGTGGCTGAG AAAGCCGGAG
GCAAAACACT GCCGCGAAGT CGGCAAGGCG CTGGCCGAGA TGCATGTGGC CGGCGATGGT
TTCGAGTTGA AGCGGGCGAA TGCGCTGTCG ATCGACGGCT GGCGGGGGCT GTGGGAGAAA
TCCGAAGCGC GCGCCGGCGA GGTTGAGTCC GGCCTGCAGA CCGAGATCCG CAGCGAACTC
GATTTCCTCT CCGCCGCCTG GCCGAGCGGC CTGCCGGCCG GCGTCATCCA CGCCGACCTC
TTCCCCGACA ACGTCTTCTT CCTCGGTGAC CAGCTCTCCG GCCTGATCGA TTTCTATTTC
GCCTGCAACG ACCTGCTCGC CTATGACGTC TCGATCTGCC TGAATGCCTG GTGCTTCGAG
AAGGACGGCG CCTATAACAT CACCAAGGGC ACGGCGATGC TCGAGGGTTA CCAGAGCGTC
AGGCCGCTGA GCGAGGCCGA AATCGCAGCC CTGCCGGTGC TGTCGCGCGG GTCTGCGCTG
CGCTTCTTCC TGACCCGGCT CTATGACTGG CTGACGACGC CGGAGGGCGC CATGGTCACC
AAAAAGGATC CGCTCGAATA TCTCCGCAAG CTGCGCTTCC ACCGCCAGAT CAAATCGCCC
GCCGAATACG GATTGAGCCT ATGA
 
Protein sequence
MKARPHLAVY TDIAEDDLKW FLTEYDAGTL LSYKGIAEGV ENSNFLLHTS RDPLILTLYE 
KRVEKSDLPF FLGFMQHLSA RGLSCPLPLP RRDGALLGSL SGRPAALISF LEGMWLRKPE
AKHCREVGKA LAEMHVAGDG FELKRANALS IDGWRGLWEK SEARAGEVES GLQTEIRSEL
DFLSAAWPSG LPAGVIHADL FPDNVFFLGD QLSGLIDFYF ACNDLLAYDV SICLNAWCFE
KDGAYNITKG TAMLEGYQSV RPLSEAEIAA LPVLSRGSAL RFFLTRLYDW LTTPEGAMVT
KKDPLEYLRK LRFHRQIKSP AEYGLSL