Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_16071 |
Symbol | psbD |
ID | 4780640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1313889 |
End bp | 1314965 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640084889 |
Product | photosystem II PsbD protein (D2) |
Protein accession | YP_001015429 |
Protein GI | 124026313 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01152] Photosystem II, DII subunit (also called Q(A)) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0432746 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATCG CTGTTGGAAG CGCAACAGAA CGAGGTTGGT TTGACGCCCT CGATGACTGG TTAAAGCGCG ACCGATTCGT ATTTGTTGGT TGGTCTGGAC TACTACTCTT CCCTACGGCT TTCCTAGCTA TTGGTGGATG GTTTACAGGT ACAACCTTCG TTTCTTCCTG GTACACCCAT GGTGTAGCCA GTTCTTACCT TGAGGGATGC AATTTCCTCA CAGCCGCTGT TAGTACTCCT GGCGATGCCA TGGGTCACAG TCTTCTATTC CTTTGGGGAC CAGAAGCTCA GGGCGATTTA ACACGTTGGT TCCAACTTGG TGGCCTTTGG AATTTCGTTG CTCTTCACGG CGCGTTTAGT CTTATCGGCT TCATGCTTCG TCAGTTCGAA ATTGCAAGAC TAGTTGGTAT CCGTCCATAT AACGCACTTG CTTTCTCAGC AGTTATTGCA GTATTTACTG CTTGCTTCCT TATCTATCCA TTAGGACAGC ACAGTTGGTT TTTCGCTCCT TCTTTTGGAG TTGCAGCAAT ATTCCGTTTC ATCCTCTTCA TTCAAGGATT CCACAACATT ACGCTTAACC CATTCCACAT GATGGGAGTA GCAGGAATTC TTGGTGGTGC TCTTCTTTGT GCTATTCACG GTGCAACAGT TCAGAACACT CTTTATGAAG ACTCAAGTGT TTACTCTGAA GGTAAGACTC AGAGTTCAAC ATTTAGAGGT TTCGATCCAG TTCAAGAAGA AGAAACTTAC TCATTTATTA CAGCAAACCG TTTCTGGAGT CAGATTTTCG GAATTGCTTT CTCAAATAAG CGTTTCCTTC ACTTCTTGAT GCTCTTCGTA CCAGTAACAG GTATGTGGGC TGCATCAATT GGAATTGTTG GATTAGCTCT AAACCTTCGT GCTTACGACT TTGTTAGCCA AGAAATCAGA GCTGCTGAAG ATCCTGAATT CGAAACTTTC TACACAAAAA ATATCCTTCT TAATGAAGGT ATGCGTGCAT GGATGTCTTC TGTAGACCAA CCACACGAAA ACTTTGTATT CCCTGAGGAG GTACTCCCAC GTGGAAACGC CCTTTAA
|
Protein sequence | MTIAVGSATE RGWFDALDDW LKRDRFVFVG WSGLLLFPTA FLAIGGWFTG TTFVSSWYTH GVASSYLEGC NFLTAAVSTP GDAMGHSLLF LWGPEAQGDL TRWFQLGGLW NFVALHGAFS LIGFMLRQFE IARLVGIRPY NALAFSAVIA VFTACFLIYP LGQHSWFFAP SFGVAAIFRF ILFIQGFHNI TLNPFHMMGV AGILGGALLC AIHGATVQNT LYEDSSVYSE GKTQSSTFRG FDPVQEEETY SFITANRFWS QIFGIAFSNK RFLHFLMLFV PVTGMWAASI GIVGLALNLR AYDFVSQEIR AAEDPEFETF YTKNILLNEG MRAWMSSVDQ PHENFVFPEE VLPRGNAL
|
| |