Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_06511 |
Symbol | thrB |
ID | 4779655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 596190 |
End bp | 597137 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640083929 |
Product | homoserine kinase |
Protein accession | YP_001014478 |
Protein GI | 124025362 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0083] Homoserine kinase |
TIGRFAM ID | [TIGR00191] homoserine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.893933 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCCGC CAAAAATAGG ACAAACTGTC GTAGTAGAAG TTCCTTCTAC TACAGCCAAT ATTGGTCCAG GGTTTGATTG CCTTGGTGCA GCATTAGATC TTTCCAATCA GTTCACTATT AAAAGAATTG AGGGTAATGC TGAAAGGTTT GAATTAATAA TGGAAAGTAC TGAAGGTAAT CATTTAAGAG GCGGCCCTGA GAATCTTTTT TATCGAGCCG CGCAAAGAGT TTGGAGAACT GCAGGTGTTG AGCCTGTTGC TCTTGAAGCA AGAGTAAAAC TGGCAGTACC TCCCGCAAGA GGACTTGGAA GCAGTGCTAC GGCAATCGTG GCAGGGCTAG TTGGAGCAAA TGCACTTGCT GGATATCCTT TACCTAAGGA AAAATTATTG GAGCTGGCAA TAGATATAGA AGGTCATCCA GACAATGTTG TTCCATCATT AATAGGAGGT CTTTGCGTAA CAGCTAAAAC TGCAACCGAC AGATGGCGAG TGGTCCGCTG TGATTGGGAT CAATCAATAA AAGCGGTAGT TGCAATTCCA TCTATTCGCC TTAGTACAAG CGAAGCGAGA CGTGTAATGC CCGAGAATAT TCCAGTCAAT GATGCAGTAA TCAATTTAGG TGCGCTTACT CTTCTGCTTC AAGGACTAAG GACTGGAAAT GAGGATTTAA TTGCAGATGG TATGCATGAC AAGCTTCATG AACCCTACAG ATGGGGTTTG ATCAAAGGTG GGTTAGAGGT AAGAGAAGCA GCAAAGGCTG CCGGAGCTTT AGGATGTGCA ATTAGTGGAG CAGGACCAAG CATTCTTGCC TTGTGCAAAG CGACTAAAGG CCGAGAAGTC AGTGTCGCGA TGGTCAAAGC TTGGGAAGCT GCTGGTGTAG CAAGTCGTGC TCCTTTAATG AGCCTCCAGC TCACAGGAAG CGAATGCATT TCAAACACTT TTGGGTAG
|
Protein sequence | MGPPKIGQTV VVEVPSTTAN IGPGFDCLGA ALDLSNQFTI KRIEGNAERF ELIMESTEGN HLRGGPENLF YRAAQRVWRT AGVEPVALEA RVKLAVPPAR GLGSSATAIV AGLVGANALA GYPLPKEKLL ELAIDIEGHP DNVVPSLIGG LCVTAKTATD RWRVVRCDWD QSIKAVVAIP SIRLSTSEAR RVMPENIPVN DAVINLGALT LLLQGLRTGN EDLIADGMHD KLHEPYRWGL IKGGLEVREA AKAAGALGCA ISGAGPSILA LCKATKGREV SVAMVKAWEA AGVASRAPLM SLQLTGSECI SNTFG
|
| |