Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_18831 |
Symbol | |
ID | 4912710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1610087 |
End bp | 1611403 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 640161489 |
Product | putative p-aminobenzoate synthetase |
Protein accession | YP_001092107 |
Protein GI | 126697221 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.264202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA AAAAATTAAT TCTAGAAAAA TGGATAGATC CAGCACTGAT TACGCATCAT CTAACAAAAA AATTCGGAGA TAAAGGGTTA GCTTGGCTAG ACAGTGATGG TAAAGAAAAT GGGGAATGGT CAATAATAGG AATAAAACCT AAAAAAATAA TACAATCAAG AGATATCAAT AACTTAGACA AAACTAATAA TCCATTTAAC AATTTAAAAA ATATTGAAAA AGGATTTTGG ATCGGATGGT TAAGTTATGA AGCTGGAGTT TACATAGAAC CCAAAAACCC ATGGCGAAAA TCTAATATGG CAACTTTATG GATTGCGTCA TATGATCCAA TCATTAAATG TAATCTAATA AAAAAAGAAA TAATTATCGA AGGCACAAAC TCATCTGAAC TGATGAAATT TAAAAATACA ATAGACAATA TAAAAAATAT TGAAGAAGAA AATATTATTA AAACAAATTT AAATTTTGAT TTTTCAAAAA TAAATTTGGA CGAAATGGCT GAAAAATTTC AGAAAAATAT TCTAAAATTA AAGAAATTAA TTTCTGAAGG GGATATATTT CAAGCAAACC TAACAACTAA ATGCGAGATT GAATCTTCCA AAAATTATAA TCCTTTAGAT ATTTATTTAA AAATAAGAAG GAAATTAAGA GCTCCATTTG GAGGAATAAT AATCAATAAT GATAATCATA ATGAGGCGGT ATTATCTACC TCCCCAGAAA GATTTATAAA AATAGATAAT AAAAGTTTTG TAGAATCAAG ACCTATTAAA GGAACTAGAT CCAGAGATAA GGATTTAAAT CAAGACGCAC TAAATGCTAT CGATTTAATA ACGAACGAAA AAGATAGAGC CGAAAATATT ATGATTGTTG ACCTAATAAG AAATGATTTA AGTAAAGTTT GCGAAACAGG AAGTATTATG GTGCCAGAAA TATTAAAACT TGAAAGTTTC TTAAAAGTTC ATCATCTAAC TTCAGTAATC AGAGGCAAAT TAAAAAAAAA CAAGAACTGG ATTGATTTAC TAAAAGCTTG TTGGCCAGGG GGCTCAATAA CTGGAGCACC TAAATTAAGA TCATGCCAGA GACTTTTTGA ATTAGAAGAA TATGAACGCG GACCATACTG TGGCTCGTTT TTGAAGCTTG ACTGGAATGG AGAGTTTGAC AGCAATATAC TAATAAGATC ATTTTTAATT AAAGACAAAA AAATCAGTAT ATATGCTGGT TGCGGAATAG TTATTGACTC AAACCTTGAA GAGGAAACTA ATGAACTAAA GTGGAAACTT TTACCGTTAA TTGATTCACT AAAATGA
|
Protein sequence | MKIKKLILEK WIDPALITHH LTKKFGDKGL AWLDSDGKEN GEWSIIGIKP KKIIQSRDIN NLDKTNNPFN NLKNIEKGFW IGWLSYEAGV YIEPKNPWRK SNMATLWIAS YDPIIKCNLI KKEIIIEGTN SSELMKFKNT IDNIKNIEEE NIIKTNLNFD FSKINLDEMA EKFQKNILKL KKLISEGDIF QANLTTKCEI ESSKNYNPLD IYLKIRRKLR APFGGIIINN DNHNEAVLST SPERFIKIDN KSFVESRPIK GTRSRDKDLN QDALNAIDLI TNEKDRAENI MIVDLIRNDL SKVCETGSIM VPEILKLESF LKVHHLTSVI RGKLKKNKNW IDLLKACWPG GSITGAPKLR SCQRLFELEE YERGPYCGSF LKLDWNGEFD SNILIRSFLI KDKKISIYAG CGIVIDSNLE EETNELKWKL LPLIDSLK
|
| |