Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_15361 |
Symbol | thrA |
ID | 4780390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1249605 |
End bp | 1250927 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640084818 |
Product | homoserine dehydrogenase |
Protein accession | YP_001015358 |
Protein GI | 124026242 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.261974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATGA AAAAAGTTGG TATCGGTTTA CTTGGACTAG GAACTGTTGG CCAGGGTGTC GCAAATATAA TAAGCCAACC AAAAGATAGG CATCCTTTGG TTGGAGAACT TGAACTTTTA AGTGTCGCGG TAAGAAATCT CAAAAAGAAA AGAGATATAT CCATCCCAGA TTCAATACTT ACAACAAACC CAACTGAAAT AATAAATAAT CCCAATATTC AAATAGTTGT TGAAGTAATG GGCGGTATAG AACCAGCCAA ATCATTAATT ATCCAAGCGA TAAGAGCAGG TAAATCTGTT GTTACTGCTA ATAAAGCAGT AATTGCAAGA CATGGTGAAG AAATCTCAAA TGAAGCAAAA GCCGCTGGGG TTTATGTCCT CATTGAAGCA GCAGTTGGAG GGGGAATCCC AATAATTGAG CCTTTAAAAC AATCTTTAGG AGGGAATCAG ATAACAAAAG TAAGCGGAAT AATAAATGGA ACAACTAATT ACATACTCTC CAGAATGGAT AAAGAAGGAG TTAATTATGC TGAAGTTTTA AAAGATGCCC AAGTCCTTGG ATATGCAGAA AGTGATCCTG CCGCTGATGT AGAGGGATCA GACGCAGCTG ACAAAATTGC AATTCTTAGT GGCCTTGCTT TTGGTGGAGC AATTAATAGA GCTCACATAC CGACAACTGG AATAAACCTA CTAGAGGCTA TAGATGTTAA TTACGCTAGG AAGCTAGGGT ATGGAATCAA GCTTTTAGCA ATATCTGAGA AAGGAGCAAC TCAACCAAGC AAAGAAGACA GTCAACCACT CTCCATTTGG GTTGAGCCAA CATTAGTACC TGAAGACAAT CCATTAGCGG GAGTAAATGG AGTTAACAAT GCAATCCTTG TAGAAGGAAA TCCCATTGGC CAAGTAATGT TTTTGGGTCC AGGTGCAGGT TCCGGCCCAA CTGCATCAGC TGTTGTTGCA GACATACTTA ACATTGCAGG TATTCAGTCT ATGAGTGAAG ACAAAATCTT TAACTTAGAT CCTCTTCTAT CAGCGAAAGG TTGGAGAAGT TGTCACGTAG CAGAAAAGGA ACAAATAACT AAAAAGAATT ACATACGGCT TATTGCAGAA GATAGCCCAG GGGTAATTGG AGAAATCGGG ACTATCTTTG GAAAGAAGAA AATATCTATT GAATCAATTG TGCAATTTGA TGCAAAGGAT AAAAAAGCAG AGATAGTAGT TATTACCCAC AAAATCAATC AAGGTCAACT TGAAGAGGCT CTTTTAGATA TCAAAAACTT GCCACAAGTC AAAAGAATTG CAGCAAAGAT GGGTTGCCTT TAG
|
Protein sequence | MNMKKVGIGL LGLGTVGQGV ANIISQPKDR HPLVGELELL SVAVRNLKKK RDISIPDSIL TTNPTEIINN PNIQIVVEVM GGIEPAKSLI IQAIRAGKSV VTANKAVIAR HGEEISNEAK AAGVYVLIEA AVGGGIPIIE PLKQSLGGNQ ITKVSGIING TTNYILSRMD KEGVNYAEVL KDAQVLGYAE SDPAADVEGS DAADKIAILS GLAFGGAINR AHIPTTGINL LEAIDVNYAR KLGYGIKLLA ISEKGATQPS KEDSQPLSIW VEPTLVPEDN PLAGVNGVNN AILVEGNPIG QVMFLGPGAG SGPTASAVVA DILNIAGIQS MSEDKIFNLD PLLSAKGWRS CHVAEKEQIT KKNYIRLIAE DSPGVIGEIG TIFGKKKISI ESIVQFDAKD KKAEIVVITH KINQGQLEEA LLDIKNLPQV KRIAAKMGCL
|
| |