Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen_4497 |
Symbol | |
ID | 4094468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia AU 1054 |
Kingdom | Bacteria |
Replicon accession | NC_008061 |
Strand | + |
Start bp | 1751666 |
End bp | 1752664 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638017785 |
Product | homoserine kinase |
Protein accession | YP_624352 |
Protein GI | 107026841 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR00938] homoserine kinase, Neisseria type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.165757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGTTT TCACCGCTGT TTCCGACTCC GATCTCGCGC AATGGATGCG CCACTACGAA CTGGGCGACG TGCTTGCGTT CCGCGGCATT CCGTCCGGTA TCGAAAACAG CAATTTCTTC CTGACGACGA CGCGCGGCGA ATACGTCCTC ACGATCTTCG AAAAGCTCAC CGCGCAGCAG TTGCCGTTCT ACCTCGACCT GATGCGCCAC CTGGCCGCCC ACGGCGTGCC GGTGCCGGAC CCGATCCCGC GCGACGACGG CGCGCTGTTC GGCGAGTTGC ACGGCAAGCC GGCCGCGATC GTCACGAAGC TCGACGGCGC GGCCGAACTC GCGCCCGGGG TCGAACACTG CATCGAGGTC GGGCAGATGC TCGCGCGCCT GCACCTCGCG GGCCGCGACT ATCCGCGCAA CCAGCCGAAC CTGCGCAGCC TGCCGTGGTG GCAGGAGAAC GTGCCGGCAA TCGTGCCGTT CATCACGGAC GCGCAACGCG CGCTGCTCGA AGGCGAACTC GCGCACCAGG CCGGTTTCTT CGCGTCGGAC GACTACGCAG CGCTGCCGGC CGGCCCGTGC CATTGCGACC TGTTCCGCGA CAACGTGCTG TTCGCCCATG CGGCGCCCGG CACCGGTCAC GACGTGCGGC TCGGCGGCTT CTTCGACTTC TATTTCGCGG GCTGCGACAA GTGGCTGTTC GACGTCGCGG TCACCGTCAA CGACTGGTGC GTCGACCTCG CGACCGGCGT GCTCGACGTC GCGCGCGCCG ATGCGCTGCT GCGCGCGTAC CAGACCGTGC GGCCGTTCAC GGCCGAGGAG CGCCGCCATT GGAGCGACAT GCTGCGCGCC GGCGCGTACC GCTTCTGGGT GTCGCGCCTG TACGACTTCT ACCTGCCGCG CGCGGCCGAG ATGCTCAAGC CGCACGACCC CGGCCATTTC GAACGCATCC TGCGTGAGCG TATCGCGCAT ACGCCCGCGC TTCCCGAGAT CCAAACCGCA TGCAACTGA
|
Protein sequence | MAVFTAVSDS DLAQWMRHYE LGDVLAFRGI PSGIENSNFF LTTTRGEYVL TIFEKLTAQQ LPFYLDLMRH LAAHGVPVPD PIPRDDGALF GELHGKPAAI VTKLDGAAEL APGVEHCIEV GQMLARLHLA GRDYPRNQPN LRSLPWWQEN VPAIVPFITD AQRALLEGEL AHQAGFFASD DYAALPAGPC HCDLFRDNVL FAHAAPGTGH DVRLGGFFDF YFAGCDKWLF DVAVTVNDWC VDLATGVLDV ARADALLRAY QTVRPFTAEE RRHWSDMLRA GAYRFWVSRL YDFYLPRAAE MLKPHDPGHF ERILRERIAH TPALPEIQTA CN
|
| |