Gene NATL1_04051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04051 
SymbolpsbB 
ID4781127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp373317 
End bp374840 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content45% 
IMG OID640083674 
Productphotosystem II PsbB protein (CP47) 
Protein accessionYP_001014234 
Protein GI124025118 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.644071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTGC CCTGGTATCG GGTGCACACA GTCGTTATTA ACGACCCTGG CCGTCTCTTG 
GCCGTGCACC TCATGCATAC TGCTTTGTTA GCCGGCTGGG CCGGCTCAAT GGCCTTATAT
GAATTGGCCA TTTTTGATCC TTCTGATCCA GTACTTAATC CAATGTGGCG CCAAGGCATG
TATGTCATGC CCTTTATGGC CCGCCTAGGA ATTACAGGAA GCTGGAACGG ATGGGATATA
ACTGGAGCAA CCGGAGTTGA CCCCGGTTTC TGGAGCTTCG AAGGTGTTGC TGCAGCACAT
ATTGTTTTCA GTGGTCTCCT TATGCTCGCT GCCGTATGGC ACTGGACATA TTGGGACCTT
GAATTGTGGG AAGATTCCCG CACTGGAGAG CCAGCTTTAG ATCTTCCAAG AATATTTGGA
ATTCATCTTT TCTTAGCCGG CCTGACATGC TTTGGCTTTG GCGCATTTCA TTGTTCATCA
GTAGGCATAT GGGTTTCTGA TCCATATGGA TTAACTGGGC ATGTAGAAAA AGTTGCTCCA
GTCTGGGGTG CAGAAGGATT TAATCCATTC AATGCTGGAG GAATTGTCGC CAATCATGTT
GGGGCAGGAC TTTTAGGAAT AATTGGTGGA ATATTCCATA TCACCAATAG ACCTGGAGAA
AGGCTCTATA GGGCTCTAAA GCTAGGAAGT CTTGAAGGCG TTCTAGCAAG TGCTTTGGCA
GCTGTTCTTT TTGTATCCTT CGTAGTTGCT GGAACAATGT GGTATGGATC AGCAACAACT
CCAGTTGAAT TATTTGGTCC TACTAGATAT CAATGGGATT CTGGTTATTT CAAAACTGAA
ATCAACCGCA GAGTTCAAGA GTCTATGAAT GATGGTGCAT CGAAAGCTGA GGCTTATGCA
TCGATTCCTG AGCAACTTGC TTTCTACGAC TATGTAGGTA ATAGTCCCGC CAAAGGAGGA
CTATTTAGAG TTGGTGCAAT GGTTAATGGA GATGGAGTTC CGAGTGGATG GCAAGGTCAC
ATCTCATTTA CTGATAGCGA TGGCAATGAC TTAGAAGTAA GAAGAATTCC AAATTTCTTT
GAGAACTTCC CTGTCATTCT TGAAGACAAG GATGGGAACG TTAAAGCTGA CATTCCTTTC
CGTCGAGCCG AAGCTAAGTA CTCCATTGAA CAAACTGGAG TCACAGCTAC TGTTTATGGT
GGTGAATTAA ATGGACAAAC ATTTACTGAT CCAGTAATAG TTAAACGTTT AGCTAGAAAA
TCTCAGCTTG GAGAGGCCTT CAAGTTTGAC AGAGATAGAT ACAAGTCAGA TGGAGTATTC
AGAAGCTCTC CAAGAGCCTG GTTCACTTAT GCTCACTTGT GTTTTGGCTT ACTATTCCTC
TTCGGGCACT GGTGGCACGC AGCTAGGACT CTCTTTGGCG ACAGATTCTC AGGTATCGAT
CCAGAGCTTG GGGATCAGGT TGAGTTTGGT CTCTTCAAAA AACTTGGAGA CGAATCTACC
CGCCGAGTTC CTGGTCGAGC CTAA
 
Protein sequence
MGLPWYRVHT VVINDPGRLL AVHLMHTALL AGWAGSMALY ELAIFDPSDP VLNPMWRQGM 
YVMPFMARLG ITGSWNGWDI TGATGVDPGF WSFEGVAAAH IVFSGLLMLA AVWHWTYWDL
ELWEDSRTGE PALDLPRIFG IHLFLAGLTC FGFGAFHCSS VGIWVSDPYG LTGHVEKVAP
VWGAEGFNPF NAGGIVANHV GAGLLGIIGG IFHITNRPGE RLYRALKLGS LEGVLASALA
AVLFVSFVVA GTMWYGSATT PVELFGPTRY QWDSGYFKTE INRRVQESMN DGASKAEAYA
SIPEQLAFYD YVGNSPAKGG LFRVGAMVNG DGVPSGWQGH ISFTDSDGND LEVRRIPNFF
ENFPVILEDK DGNVKADIPF RRAEAKYSIE QTGVTATVYG GELNGQTFTD PVIVKRLARK
SQLGEAFKFD RDRYKSDGVF RSSPRAWFTY AHLCFGLLFL FGHWWHAART LFGDRFSGID
PELGDQVEFG LFKKLGDEST RRVPGRA