Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18581 |
Symbol | thrB |
ID | 4775974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1620202 |
End bp | 1621152 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640087367 |
Product | homoserine kinase |
Protein accession | YP_001017865 |
Protein GI | 124023558 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0083] Homoserine kinase |
TIGRFAM ID | [TIGR00191] homoserine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.566516 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAGC CGAGTATCGG CCAAAAAATT GTGGTTGACG TACCGTCCAC CACCGCCAAC CTTGGTCCTG GCTTCGACTG CCTTGGTGCT GCCCTTGACC TCAACAACCG TTTTGCCATG CGGCGGATCG AAGGGGACAG CGGACGCTTT GAACTCATTA TTGAAGGCAA TGAGGGAAGC CACCTACGCG GTGGGCCCAA CAACCTGATT TATCGCGCCG CCCAGAGGGT GTGGAAAGCC GCAGGGCTCG AACCGGTGGG ACTAGAAGCC AAAGTGAGGC TTGCGGTACC CCCCGCAAGA GGCCTAGGAA GCAGTGCCAG TGCCATCGTG GCAGGGTTGG TTGGAGCCAA TGCCTTGGTG GGCGAACCTC TCAGCAAAGA AAAACTGTTG GAACTGGCCA TTGACATTGA GGGACATCCC GACAATGTGG TGCCTTCTCT TCTAGGAGGT CTTTGCTTGA CTGCCAAGGC AGCCTCGCAA CGCTGGCGGG TTGTTCGTTG TGTCTGGATC AATTCCGTGA AAGTCGTTGT AGCAATCCCC TCTATTCGCC TAAGCACGAG CGAGGCAAGG CGCGCCATGC CTAAAGACAT TCCGATCAGC GATGCCGTAG AAAACCTTGG TGCCCTTACG CTCCTGCTGC AGGGACTGCG GACAGGAAAC GGCGACCTGA TTACAGACGG GATGCACGAT CGATTGCATG AGCCCTATCG CTGGCCATTA ATCAAAGGTG GTTTGGATGT TCGCGATGCG GCTCTGAATG CCGGGGCCTG GGGTTGTGCC ATCAGCGGAG CTGGCCCCAG CGTGCTGGCT CTGTGCCCGG AGGATAAAGG GCAAGCAGTC AGTCAGGCAA TGGTAAAAGC TTGGGAGGCC GAGGGTGTAG CAAGTAGGGC ACCACTGCTT AGCATTCAGA CAGGAGGGAG CCACTGGCAA CCTCAAATTG AGGATGAGTA G
|
Protein sequence | MAQPSIGQKI VVDVPSTTAN LGPGFDCLGA ALDLNNRFAM RRIEGDSGRF ELIIEGNEGS HLRGGPNNLI YRAAQRVWKA AGLEPVGLEA KVRLAVPPAR GLGSSASAIV AGLVGANALV GEPLSKEKLL ELAIDIEGHP DNVVPSLLGG LCLTAKAASQ RWRVVRCVWI NSVKVVVAIP SIRLSTSEAR RAMPKDIPIS DAVENLGALT LLLQGLRTGN GDLITDGMHD RLHEPYRWPL IKGGLDVRDA ALNAGAWGCA ISGAGPSVLA LCPEDKGQAV SQAMVKAWEA EGVASRAPLL SIQTGGSHWQ PQIEDE
|
| |