Gene NATL1_16071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16071 
SymbolpsbD 
ID4780640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1313889 
End bp1314965 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content44% 
IMG OID640084889 
Productphotosystem II PsbD protein (D2) 
Protein accessionYP_001015429 
Protein GI124026313 
COG category 
COG ID 
TIGRFAM ID[TIGR01152] Photosystem II, DII subunit (also called Q(A)) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0432746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCG CTGTTGGAAG CGCAACAGAA CGAGGTTGGT TTGACGCCCT CGATGACTGG 
TTAAAGCGCG ACCGATTCGT ATTTGTTGGT TGGTCTGGAC TACTACTCTT CCCTACGGCT
TTCCTAGCTA TTGGTGGATG GTTTACAGGT ACAACCTTCG TTTCTTCCTG GTACACCCAT
GGTGTAGCCA GTTCTTACCT TGAGGGATGC AATTTCCTCA CAGCCGCTGT TAGTACTCCT
GGCGATGCCA TGGGTCACAG TCTTCTATTC CTTTGGGGAC CAGAAGCTCA GGGCGATTTA
ACACGTTGGT TCCAACTTGG TGGCCTTTGG AATTTCGTTG CTCTTCACGG CGCGTTTAGT
CTTATCGGCT TCATGCTTCG TCAGTTCGAA ATTGCAAGAC TAGTTGGTAT CCGTCCATAT
AACGCACTTG CTTTCTCAGC AGTTATTGCA GTATTTACTG CTTGCTTCCT TATCTATCCA
TTAGGACAGC ACAGTTGGTT TTTCGCTCCT TCTTTTGGAG TTGCAGCAAT ATTCCGTTTC
ATCCTCTTCA TTCAAGGATT CCACAACATT ACGCTTAACC CATTCCACAT GATGGGAGTA
GCAGGAATTC TTGGTGGTGC TCTTCTTTGT GCTATTCACG GTGCAACAGT TCAGAACACT
CTTTATGAAG ACTCAAGTGT TTACTCTGAA GGTAAGACTC AGAGTTCAAC ATTTAGAGGT
TTCGATCCAG TTCAAGAAGA AGAAACTTAC TCATTTATTA CAGCAAACCG TTTCTGGAGT
CAGATTTTCG GAATTGCTTT CTCAAATAAG CGTTTCCTTC ACTTCTTGAT GCTCTTCGTA
CCAGTAACAG GTATGTGGGC TGCATCAATT GGAATTGTTG GATTAGCTCT AAACCTTCGT
GCTTACGACT TTGTTAGCCA AGAAATCAGA GCTGCTGAAG ATCCTGAATT CGAAACTTTC
TACACAAAAA ATATCCTTCT TAATGAAGGT ATGCGTGCAT GGATGTCTTC TGTAGACCAA
CCACACGAAA ACTTTGTATT CCCTGAGGAG GTACTCCCAC GTGGAAACGC CCTTTAA
 
Protein sequence
MTIAVGSATE RGWFDALDDW LKRDRFVFVG WSGLLLFPTA FLAIGGWFTG TTFVSSWYTH 
GVASSYLEGC NFLTAAVSTP GDAMGHSLLF LWGPEAQGDL TRWFQLGGLW NFVALHGAFS
LIGFMLRQFE IARLVGIRPY NALAFSAVIA VFTACFLIYP LGQHSWFFAP SFGVAAIFRF
ILFIQGFHNI TLNPFHMMGV AGILGGALLC AIHGATVQNT LYEDSSVYSE GKTQSSTFRG
FDPVQEEETY SFITANRFWS QIFGIAFSNK RFLHFLMLFV PVTGMWAASI GIVGLALNLR
AYDFVSQEIR AAEDPEFETF YTKNILLNEG MRAWMSSVDQ PHENFVFPEE VLPRGNAL