Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0652 |
Symbol | |
ID | 8011830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 687497 |
End bp | 688480 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644823242 |
Product | homoserine kinase |
Protein accession | YP_002974495 |
Protein GI | 241203399 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR00938] homoserine kinase, Neisseria type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.252204 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.14086 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCGA GACCTCACTT GGCAGTCTAT ACCGATATCG CCGAAGACGA TCTGAAATGG TTCCTGACGG AATATGACGC GGGCACGCTG CTCTCCTACA AGGGCATTGC CGAAGGCGTC GAAAACTCCA ACTTCCTGCT TCACACCTCC AGGGATCCGC TGATCCTGAC GCTCTATGAG AAGCGGGTGG AAAAGAGCGA CCTGCCTTTC TTCCTCGGTT TCATGCAGCA TCTTTCCGCC CGCGGCCTGT CCTGCCCGCT GCCGCTGCCG CGCCGCGATG GCGCGCTGCT CGGCTCACTG TCCGGCCGTC CGGCGGCGCT GATCTCCTTC CTCGAAGGCA TGTGGCTGAG AAAGCCGGAG GCAAAACACT GCCGCGAAGT CGGCAAGGCG CTGGCCGAGA TGCATGTGGC CGGCGATGGT TTCGAGTTGA AGCGGGCGAA TGCGCTGTCG ATCGACGGCT GGCGGGGGCT GTGGGAGAAA TCCGAAGCGC GCGCCGGCGA GGTTGAGTCC GGCCTGCAGA CCGAGATCCG CAGCGAACTC GATTTCCTCT CCGCCGCCTG GCCGAGCGGC CTGCCGGCCG GCGTCATCCA CGCCGACCTC TTCCCCGACA ACGTCTTCTT CCTCGGTGAC CAGCTCTCCG GCCTGATCGA TTTCTATTTC GCCTGCAACG ACCTGCTCGC CTATGACGTC TCGATCTGCC TGAATGCCTG GTGCTTCGAG AAGGACGGCG CCTATAACAT CACCAAGGGC ACGGCGATGC TCGAGGGTTA CCAGAGCGTC AGGCCGCTGA GCGAGGCCGA AATCGCAGCC CTGCCGGTGC TGTCGCGCGG GTCTGCGCTG CGCTTCTTCC TGACCCGGCT CTATGACTGG CTGACGACGC CGGAGGGCGC CATGGTCACC AAAAAGGATC CGCTCGAATA TCTCCGCAAG CTGCGCTTCC ACCGCCAGAT CAAATCGCCC GCCGAATACG GATTGAGCCT ATGA
|
Protein sequence | MKARPHLAVY TDIAEDDLKW FLTEYDAGTL LSYKGIAEGV ENSNFLLHTS RDPLILTLYE KRVEKSDLPF FLGFMQHLSA RGLSCPLPLP RRDGALLGSL SGRPAALISF LEGMWLRKPE AKHCREVGKA LAEMHVAGDG FELKRANALS IDGWRGLWEK SEARAGEVES GLQTEIRSEL DFLSAAWPSG LPAGVIHADL FPDNVFFLGD QLSGLIDFYF ACNDLLAYDV SICLNAWCFE KDGAYNITKG TAMLEGYQSV RPLSEAEIAA LPVLSRGSAL RFFLTRLYDW LTTPEGAMVT KKDPLEYLRK LRFHRQIKSP AEYGLSL
|
| |