Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_02161 |
Symbol | |
ID | 5731226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 208526 |
End bp | 209641 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641284560 |
Product | aminotransferase class-I |
Protein accession | YP_001550101 |
Protein GI | 159902757 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATAG AAGCTGAGAT TGATAATCAT CAACAAGTCT CAAATTTCTT TAAACATGGA GGGAATATTG CCCAAGAAGC AAAAAGATTG GGATTCCAGC CAAATGAACT AATAGATGCC AGTGCATCAT TAGTTCCCTT CCCTCCTCCT AAAGAAATTA AAAAGTGTAT TATCAATGCG CTTAAAGGAA TGGATATCAC AAGTTATCCA GACCCAAGTC TTACAACTCT AAGAGATGTA ATCAGCACTT GGCATGGTAT TGATCCCTCC TGTGTATTGC CTGGCAATGG AGCAGCAGAG TTAATTACAT GGGCTGCTCG CGATGCTGCG AAGCATGGCC TAAGTATCTT GCCTTCGCCA GGTTTTAGTG ACTATAAGCG CGCCTTAAAA TGCTGGAACT CCCTTTATAA GCAAACTCCT CTGCCATTGT CATGGGATTC TATCTTCCCT CAATCATTTC CGATTAGTAC ATCCTCTAAC GTGATTTGGG TAACGAATCC ACATAATCCA ACAGGTCAAT TATGGAGTCG AAATTCTCTG GAAAAATTAC TTGTATCTAA TCGGCTAGTG ATCTGTGACG AAGCATTTTT ACCGCTAGTT CCTAACGGCG ACAAACAATC ACTTATACCT CTTATTACTA GTCATTCAAA TCTGATTGTC ATAAGAAGTC TTACTAAGCT TTTCTCAATA GCTGGTTTAA GAGTTGGGTA TGCGATTAGT TCGAGGCACA GATTACATGA ATGGGAGAAA TGTAGAGATC CATGGCCAAT GAATGGCCTT GCAATAGCTG TAGGAACAAT GCTAATGAGT AACCAAACAT TAATGAACAG GCAAATTCAG AAAGTCCAAT CATGGGTATC AAATGAAGGG GCTTGGCTAC ATTCAAAACT TGAGGGTCTT CATGGCATAA AATCTTACCC ATCTTCAACT AACTTCCAAT TAATTCATAG TAGCAATTCA CTAAGTCTAT TACGCGAACA GCTAGCGCAA AGAAAGATTT TACTTAGAGA TTGTCAATCT TTTGAAGGTC TAGGAGCTAA CTGGCTACGA ATTAGCTTAA AAAGTCGACA TGAAAATCTG CGCATCTTGG ATGCAATGAA GGAAGTAATT AACTAG
|
Protein sequence | MDIEAEIDNH QQVSNFFKHG GNIAQEAKRL GFQPNELIDA SASLVPFPPP KEIKKCIINA LKGMDITSYP DPSLTTLRDV ISTWHGIDPS CVLPGNGAAE LITWAARDAA KHGLSILPSP GFSDYKRALK CWNSLYKQTP LPLSWDSIFP QSFPISTSSN VIWVTNPHNP TGQLWSRNSL EKLLVSNRLV ICDEAFLPLV PNGDKQSLIP LITSHSNLIV IRSLTKLFSI AGLRVGYAIS SRHRLHEWEK CRDPWPMNGL AIAVGTMLMS NQTLMNRQIQ KVQSWVSNEG AWLHSKLEGL HGIKSYPSST NFQLIHSSNS LSLLREQLAQ RKILLRDCQS FEGLGANWLR ISLKSRHENL RILDAMKEVI N
|
| |