Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_11401 |
Symbol | thrA |
ID | 5731354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1043585 |
End bp | 1044901 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641285508 |
Product | homoserine dehydrogenase |
Protein accession | YP_001551025 |
Protein GI | 159903681 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.814686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAGA ATATTGGTAT AGGCCTCCTA GGACTTGGAA CAGTAGGGAC TGGAGTAGCA CAGATAATCA ATTCTCCGGA AGGGCGCCAC CCTTTGACAT CTAGAGTCGA GTTAAAGCGA ATAGCTCTCA GAGATTCTAA GAAAATCAGA GCTCTGTCAA TACCTAACAA ATTAATCACA GAAGATGCCT GGGAAGTTGT CGAGGATCCT GATGTAGAAA TTGTTGTTGA AGTTATAGGA GGTCTTGAGC CCGCAAGAAG TCTGATTCTC AAGGCAATCA AGTCCGGCAA ATCTGTAGTT ACTGCAAACA AAGCTGTTAT CGCAAGACAT GGTGAGGAAA TTGCAGAAGC CGCAATTTCT TCAGGTGTTT ATGTACTCAT AGAAGCAGCA GTAGGTGGAG GGATCCCAAT TATCGAACCG TTAAAACAAT CACTAGGAGG CAACATAATT CAAAAGGTCA CTGGAATTGT GAATGGAACT ACGAACTACA TTCTGACCAG GATGGCAAAG GAAGGTGCTG ACTACGAAGC CGTATTAAAA GAAGCTCAAT CTCTTGGCTA TGCAGAGTCT GATCCCATGG CAGATGTAGA GGGCCTCGAT GCAGCAGATA AGATCTCGAT ACTCAGCAAT CTTGCTTTTG GTGGGCCAAT TAAAAGAGCA TCTGTACCAA CCAAAGGGAT AAGCACTCTT CAAAATAGAG ATGTTGACTA TGCAAATCAG TTGGGTTATG AAGTCAAGCT TTTAGCTATA GCTGAGAGAC TTGCTAGCAA TCTTGAAAAC AACTCTTCAC TCCCATTAGC AGTAAGAGTT GAGCCAACAC TATTACCAAC CGGCCATCCA CTTGCAGAAG TTAATGGAGT AAACAACGCA ATTCTTGTTG AAGGAGATCC AATCGGAGAA GTAATGTTCT ATGGACCTGG AGCAGGAGCA GGACCTACCG CCTCAGCAGT AGTGGCAGAC ATACTTAATA TTGCAGGGAT AAAACTTATG GGAGGGGAAA AGACGTCTCT AGACCCTCTA CTCTCAGCAT CTAGCTGGAG AGAATGCCAT TTAGCAAAGC CAAAAGAAAT TTTACAAAAG AACTATGTCC GTCTCATTGC TAAAGATGCT CCAGGGGTAA TTGGTCAAAT TGGGAAAATA TTTGGATCTC ACAATGTCTC AATTCAATCA ATAGTCCAAT TCGATGCTAG TGAAGAGGAT GCGGAAATCG TTGTAATTAC TCACAAGGTG TTCAAAGGTT TACTGACAGA TTCTCTTTCT GAGATACAGC AACTCCCAGA GATCAAACAA ATTGCAGCCC ATCTAAGTTG TCTTTAA
|
Protein sequence | MTKNIGIGLL GLGTVGTGVA QIINSPEGRH PLTSRVELKR IALRDSKKIR ALSIPNKLIT EDAWEVVEDP DVEIVVEVIG GLEPARSLIL KAIKSGKSVV TANKAVIARH GEEIAEAAIS SGVYVLIEAA VGGGIPIIEP LKQSLGGNII QKVTGIVNGT TNYILTRMAK EGADYEAVLK EAQSLGYAES DPMADVEGLD AADKISILSN LAFGGPIKRA SVPTKGISTL QNRDVDYANQ LGYEVKLLAI AERLASNLEN NSSLPLAVRV EPTLLPTGHP LAEVNGVNNA ILVEGDPIGE VMFYGPGAGA GPTASAVVAD ILNIAGIKLM GGEKTSLDPL LSASSWRECH LAKPKEILQK NYVRLIAKDA PGVIGQIGKI FGSHNVSIQS IVQFDASEED AEIVVITHKV FKGLLTDSLS EIQQLPEIKQ IAAHLSCL
|
| |