Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_11561 |
Symbol | thrA |
ID | 4717869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 971451 |
End bp | 972752 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640078871 |
Product | homoserine dehydrogenase |
Protein accession | YP_001009547 |
Protein GI | 123968689 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAT GCAAAATTGG TATTGTAGGT TTTGGAACTG TAGGTTCAGG GATTTATAAA ATATTAACTT CTGAACTCGA TTCACATCCA ATTCTAGAAG AAATAGAAAT TTCAAAAATA GCAGTTAAAG ATCTTGATAA AAAAAGAGAT ATTGAGCTAG ATAATAATTT ATTAACTGAT GATCCATTTA AATTAATTAA TGACCCATCT ATAGATGTTA TTGTTGAAGT AATGGGTGGG GTTGATTTAG CGAAAGAAAT TATTCTGCAA TCATTAAAAT TAGGTAAATC TGTAGTTACC GCAAACAAAG CAGTTATTGC AAGATATGGA GAAGAAATAT ATAAAACTGC ATCTAAAGAA GGTGTCTATA TATTGTCAGA AGCAGCTGTT TGCGGGGGGA TTCCAATAAT TGAACCCTTA AAAAGATCAT TAAAAAGTAA CAGTATAAAA AAAATGGTTG GGATAATAAA TGGCACAACA AATTTTATTC TTTCAAAGAT GGCAAATGAA AAAGCTGATT ATAAGGAAAC TTTAAAATTG GCCCAAAGCC TTGGTTACGC AGAATTTGAT CCAACTGCAG ATGTTGAGGG GCATGATGCT GCTGATAAGA TTTCAATCCT TAGTGAACTC GCATTTGGAG GGAAAATCAA AAGAGAGGAG ATACATTCTG AGGGTATCAG TAAAATTAAT CTCAAGGATA TCGAATATGC CAATAAATTA GGATTTGAAA TAAAACTTTT AGCGCTCTCT GAAAGGGGAC AAATTAATAG TAATGATTCA CTCGCTTTAA ATATTTGGGT AGGACCTTCT TTGATTCCAA AATCTCATCC ATTGTCAACA GTTAAGGGAG TTAACAATGC CTTGTTGATT GAGGCTGATC CTCTTGGAGA AATAATGTTA TATGGTCCAG GTGCAGGGAG TGGTCCAACT GCAGCATCAG TAGTATCAGA TATATTAAAT CTGCATGCCG CCTCAGTAAA AAATAATAAT TCAATCGATC CATTATTATC TTTTGATTTC TGGAGAAACT GCCATATCAT AGGATCTTCG CAAATAAACA AAAAAAATTA CCTTAGAATT ATTTGTCTTG ATAGTCCAGG TGTAATAGGA AAGATTGGAG ATATTTTTGG AAAGAATAAT GTATCAATCG AATCAATTGT TCAACTTGAT GCGAGTGAGG ACAAAGCTGA AATTGTCGTT ATTACTCATG AGGTGAATAA TGGAGATTTT GAGAGATCGA AAAATGAAAT AAATTCGTTA AATGAAGTAA AAATTATTGC AAGTCAATTA AGTTGTATTT AA
|
Protein sequence | MRKCKIGIVG FGTVGSGIYK ILTSELDSHP ILEEIEISKI AVKDLDKKRD IELDNNLLTD DPFKLINDPS IDVIVEVMGG VDLAKEIILQ SLKLGKSVVT ANKAVIARYG EEIYKTASKE GVYILSEAAV CGGIPIIEPL KRSLKSNSIK KMVGIINGTT NFILSKMANE KADYKETLKL AQSLGYAEFD PTADVEGHDA ADKISILSEL AFGGKIKREE IHSEGISKIN LKDIEYANKL GFEIKLLALS ERGQINSNDS LALNIWVGPS LIPKSHPLST VKGVNNALLI EADPLGEIML YGPGAGSGPT AASVVSDILN LHAASVKNNN SIDPLLSFDF WRNCHIIGSS QINKKNYLRI ICLDSPGVIG KIGDIFGKNN VSIESIVQLD ASEDKAEIVV ITHEVNNGDF ERSKNEINSL NEVKIIASQL SCI
|
| |