Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_11571 |
Symbol | thrA |
ID | 4912040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 970319 |
End bp | 971620 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640160743 |
Product | homoserine dehydrogenase |
Protein accession | YP_001091381 |
Protein GI | 126696495 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.292612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAT GCAAAATTGG GATTGTAGGT TTTGGAACTG TAGGTTCAGG GATTTATAAA ATATTAAGTT CTGAAGTTGA TTCACATCCA ATTCTAAAAG AAATAGAAAT TGCAAAAATA GCAGTTAAAG ATCTTAATAA AAAAAGGGAT ATTGAGCTAG ATAATAATTT ATTAACTGAT GATCCATTTA AATTAATTAA TGACCCCTCT ATTGATGTAA TTGTTGAAGT AATGGGTGGG GTTGATTTAG CTAAAGATAT TATTCTGCAA TCTTTAAAAT TAGGTAAATC TGTTGTTACA GCAAATAAAG CAGTTATCGC AAGATATGGA GAAGAAATAT ATAAAACTGC ATCTAAAGAA AGAGTTTATA TATTGTCAGA GGCAGCAGTT TGCGGAGGGA TTCCTATCAT TGAACCCTTA AAAAGATCAT TAAAAAGTAA CTGTATAAAA AGAATGGTTG GGATAATAAA TGGCACAACA AATTTTATTC TTTCAAAGAT GACAAATGAA AAAGCTGATT ACAAGGAGAC CTTAAAATTG GCTCAAAGCC TTGGATATGC AGAATTTGAT CCAACTGCAG ATGTTGAGGG CCATGATGCT GCTGATAAGA TTTCAATTCT TAGTGAACTT GCATTCGGAG GGAAAATCAA AAGAGAGGAG ATTCATTTTG AGGGCATTAG TAAAATTAAT CTAAAGGATA TTGAATATGC CAATAAATTA GGGTTTGAAA TAAAACTTTT AGCGCTATCC GAAAGGGGAC AAATTAATAG TAATGATTCA CTCGCTTTAA ATATTTGGGT AGGACCTTCT TTGATTCCAA AATCTCATCC ATTGTCAACA GTTAAGGGAG TTAATAATGC CTTATTGATT GAAGCTGATC CTCTTGGTGA AATAATGTTA TATGGTCCAG GTGCAGGGAG TGGCCCAACT GCAGCGTCAG TGGTATCAGA TATATTAAAT CTGCATGCCG CCAAAGAAAA AAATAATAAT TCAGTCGATC CATTATTATC TTTTGATTTC TGGAGAAACT GCCATATAAC AAGCTCCTCA CAAATAAATA AAAAAAATTA CCTTAGAATT ATTTGTCTTG ATAGTCCAGG TGTCATAGGA AAGATTGGAG ATATTTTTGG AAAGAATAAT GTATCAATTG AATCAATTGT TCAACTTGAT GCTAGTGAGG ACAAAGCTGA AATTGTCGTT ATTACTCATG AGGTGAATAA TGGAGATTTT GAGAGATCGA AAGATGAAAT AAATTCGCTA AATGAAGTCA AAATTATTGC AAGTCAATTA AGTTGTATTT AA
|
Protein sequence | MRKCKIGIVG FGTVGSGIYK ILSSEVDSHP ILKEIEIAKI AVKDLNKKRD IELDNNLLTD DPFKLINDPS IDVIVEVMGG VDLAKDIILQ SLKLGKSVVT ANKAVIARYG EEIYKTASKE RVYILSEAAV CGGIPIIEPL KRSLKSNCIK RMVGIINGTT NFILSKMTNE KADYKETLKL AQSLGYAEFD PTADVEGHDA ADKISILSEL AFGGKIKREE IHFEGISKIN LKDIEYANKL GFEIKLLALS ERGQINSNDS LALNIWVGPS LIPKSHPLST VKGVNNALLI EADPLGEIML YGPGAGSGPT AASVVSDILN LHAAKEKNNN SVDPLLSFDF WRNCHITSSS QINKKNYLRI ICLDSPGVIG KIGDIFGKNN VSIESIVQLD ASEDKAEIVV ITHEVNNGDF ERSKDEINSL NEVKIIASQL SCI
|
| |