Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_19021 |
Symbol | |
ID | 4718641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1638093 |
End bp | 1639406 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 640079637 |
Product | putative p-aminobenzoate synthetase |
Protein accession | YP_001010292 |
Protein GI | 123969434 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0758384 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA AAAAAATAAT TCTAGAAAAA TGGATAGATC CAGCACTGAT TACGCATCAT CTAACAAAAA AATTTGGAGA TCAAGGATTA GCTTGGCTAG ACAGCGATGG CAAAGAAAAT GGGGAATGGT CAATAATAGG AATTAAACCT AAAAAAATAA TCCAATCAAG AGATATCAAT AACTTAGACA AAACTAATAA TCCATTTAAC AATTTAAGAA ATATTGAAAA AGGATTTTGG ATCGGATGGT TAAGTTATGA AGCCGGAGTT TACATAGAAC CAAAAAACCC ATGGAAAAAA TCTAATATGG CAACTTTATG GATTGCATCA TATGATCCAA TCATTAAATG TAATCTAATA AAAAAAGAAA TAATTATCGA AGGCACAAAC TCATCTGAAC TGATGAATTA TAAAAACATA ATCAACAATA TAAAAAATAT CGAAGAAGAA AATATTATTA AAACAAAGTT GAATTTTGAT TTTTCAAAAA TAAATTTGGA CGAAATGGCT GAAAAATTTC AGAAAAATAT TTTAAAATTG AAAAAATTAA TTTCCTTAGG AGATATATTT CAAGCAAACC TAACAACTAA ATGCGAAATT GAATCTTCCA AAAACTATAA TCCTCTAGAT ATTTATTTGA AAATAAGAAG GAAATTAAGA GCTCCCTTTG GAGGAATAAT AATAAATAAT AATTATAAAG AGGCTGTATT ATCTACCTCG CCAGAAAGGT TTTTAAAGAT AGATAATAAA AATTTTGTAG AATCAAGACC TATCAAAGGA ACTAGATCCA GAGATAAAGA TTTAAATCAA GACGCACTTA ATGCTATCGA TTTAATAACT AATGAGAAAG ATAGAGCCGA AAATATTATG ATTGTTGACC TAATAAGAAA TGATTTAAGT AAAGTTTGCG AAACAGGAAG TATTATGGTG CCAGAAATAT TAAAACTTGA AAGTTTCTTA AAAGTTCATC ATCTAACTTC AGTAATCAGA GGCAAATTAA AAAAAGACAC GAACTGGATT GATTTACTAA AAGCTTGTTG GCCTGGGGGC TCTATAACTG GAGCACCTAA ATTAAGATCA TGCCAGAGAC TTTTTGAATT AGAAAAATGT GAACGCGGAC CATACTGTGG GTCATTTTTG AAGCTTGACT GGAATGGAGA GTTTGACAGC AATATACTAA TAAGATCATT TTTAGTTAAA GACAAAAAAA TCAATATATA TGCTGGTTGC GGAATAGTTA TTGACTCAGA CCCGGAGGAA GAAACTGATG AACTAAAGTG GAAACTTTTA CCATTAATTG ATTCACTAAA ATGA
|
Protein sequence | MKIKKIILEK WIDPALITHH LTKKFGDQGL AWLDSDGKEN GEWSIIGIKP KKIIQSRDIN NLDKTNNPFN NLRNIEKGFW IGWLSYEAGV YIEPKNPWKK SNMATLWIAS YDPIIKCNLI KKEIIIEGTN SSELMNYKNI INNIKNIEEE NIIKTKLNFD FSKINLDEMA EKFQKNILKL KKLISLGDIF QANLTTKCEI ESSKNYNPLD IYLKIRRKLR APFGGIIINN NYKEAVLSTS PERFLKIDNK NFVESRPIKG TRSRDKDLNQ DALNAIDLIT NEKDRAENIM IVDLIRNDLS KVCETGSIMV PEILKLESFL KVHHLTSVIR GKLKKDTNWI DLLKACWPGG SITGAPKLRS CQRLFELEKC ERGPYCGSFL KLDWNGEFDS NILIRSFLVK DKKINIYAGC GIVIDSDPEE ETDELKWKLL PLIDSLK
|
| |