Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_06511 |
Symbol | thrB |
ID | 4717353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 574196 |
End bp | 575104 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640078364 |
Product | homoserine kinase |
Protein accession | YP_001009044 |
Protein GI | 123968186 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0083] Homoserine kinase |
TIGRFAM ID | [TIGR00191] homoserine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCTTCCA CAACTGCCAA TTTAGGGCCT GGATTCGATT GCCTTGGAGC TGCATTAGAT TTATATAATG AGTTTATTTT TACAAGAATT GAAGGTGGTG GAGATAGATT TGATTTAATA ATGGAAAGTA CAGATGGTAA TCATTTAAGA GGAGGACCTG AAAACTTAGT TTTTAGAGCA GCACAGAAAG TATGGGAAAG CGCAAATATG GATCCTTTTG CACTTGAAGC AAGAGTTAAG TTGGCAGTTC CACCTGCACG CGGACTTGGA AGTAGTGCTA CAGCAATAGT GGCGGGACTA ATCGGAGCAA ATGCAATCAT GAACTCTCCA TTGTCCAAAG AAAAACTTCT TGAACATGCC ATTGATATAG AAGGTCATCC AGATAATGTC GTTCCTTCTC TCCTGGGTGG GCTTTGCTTG ACAGCCAAGT CGGCTTCTCA AAGATGGAGA ATTATTAGAT GTGATTGGCA CGATTCAATC AAAGCTGTGG TGGCAATACC AGCAATCCGT CTAAGCACAA GTGAAGCAAG AAAGGTTATG CCCAAGAATG TACCTATATC TGATGCAGTG ACAAATATGG GGGCACTTAC TTTGTTACTA AATGGCTTAA AAGCAGGAAA TGAGGAACTT ATAAAAGAAG GAATGTTTGA TAAGTTACAT GAACCCTACA GATGGAAGCT TATTAAAGGT GGACTAGAAG TCAAAGATGC TGCACTTAAT GCAGGTGCTT TAGGATGCGC AATTAGTGGG GCTGGACCAA GTATATTAGC TTTATGTAAA AAAGAAAATG GTAAAAACGT CAGTCAAGCC ATGGTGAAAG CTTGGGAGAA GATAGGTGTA GCTAGCAGAG CACCATTCTT AAACGTACAA AAAACAGGTA GCCAATTTAG CAATATCTCC GGTAAGTAG
|
Protein sequence | MPSTTANLGP GFDCLGAALD LYNEFIFTRI EGGGDRFDLI MESTDGNHLR GGPENLVFRA AQKVWESANM DPFALEARVK LAVPPARGLG SSATAIVAGL IGANAIMNSP LSKEKLLEHA IDIEGHPDNV VPSLLGGLCL TAKSASQRWR IIRCDWHDSI KAVVAIPAIR LSTSEARKVM PKNVPISDAV TNMGALTLLL NGLKAGNEEL IKEGMFDKLH EPYRWKLIKG GLEVKDAALN AGALGCAISG AGPSILALCK KENGKNVSQA MVKAWEKIGV ASRAPFLNVQ KTGSQFSNIS GK
|
| |