Gene A9601_03391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_03391 
SymbolpsbB 
ID4717027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp311035 
End bp312558 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content41% 
IMG OID640078042 
Productphotosystem II PsbB protein (CP47) 
Protein accessionYP_001008734 
Protein GI123967876 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0452672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTGC CTTGGTATCG AGTTCACACA GTAGTTATTA ATGACCCAGG TCGACTACTT 
GCTGTGCATC TTATGCATAC TGCATTATTA GCCGGCTGGG CCGGTTCAAT GGCTCTTTAT
GAATTAGCCA TTTTTGATCC TTCTGATGCT GTTCTCAATC CAATGTGGAG ACAGGGGATG
TACGTTATGC CTTTCATGGC AAGACTAGGT ATCACAAGTA GTTGGAATGG ATGGGATATT
ACTGGTGCTA CTGGAGTTGA TCCTGGATTC TGGAGTTTCG AAGGGGTTGC GGCAGCACAC
ATAGTATTTA GTGGCCTATT GATGTTGGCA TCTATTTGGC ACTGGACATA TTGGGACCTA
GATTTATGGG AAGATTCAAG AACTGGTGAG CCTGCTCTTG ATTTACCAAG AATTTTTGGA
ATTCACCTTC TTTTAGCAGG CCTTACATGT TTTGGTTTTG GAGCTTTTCA TTGCGCAAAC
GTTGGGATTT GGGTTTCTGA TCCATATGGT TTAAGTGGTC ATGTAGAACC TGTAGCACCT
TCCTGGGGAG TTGAAGGATT TAATCCTTTT AATCCAGGAG GTATTGTGGC TAATCATATT
GCAGCTGGAC TTATGGGTAT TATTGGAGGT ATTTTTCACA TTACCAACAG GCCTGGAGAA
AGACTTTACA GAGCATTAAA ACTTGGAAGT CTCGAAGGAG TTCTTGCTAG TGCTTTAGCT
GCTGTATTGT TTGTTTCTTT CGTTGTTTCC GGAACAATGT GGTACGGTTC AGCAACCACC
CCAATAGAGC TTTTTGGTCC TACTAGATAT CAATGGGATT CAGGCTATTT CAAAACTGAA
ATTAACAGAA GAGTGCAAGC TGCTATTGAT GATGGTGCAA CTAAATCAGA GGCATATGCA
TCTATACCAG AAAAATTAGC CTTCTACGAT TATGTCGGTA ATAGTCCAGC TAAAGGGGGA
CTATTCAGAG TTGGAGCTCT TGTTAATGGT GATGGCTTAC CAACTGGTTG GCAAGGTCAC
ATTGCTTTCC AAGATAAGGA AGGTAACGAA TTAGAAGTTA GAAGAATTCC TAACTTCTTT
GAAAACTTCC CTGTCATTCT TGAAGATAAA GAAGGTAATG TAAGGGCAGA TATCCCATTT
AGAAGAGCTG AAGCAAAGTA CTCATTTGAG CAAACTGGTA TTACTGCAAC AATTTATGGA
GGAGACCTAG ATGGACAAAC ATTTACAGAT CCTGCAGTAG TGAAAAGATT AGCAAGAAAG
GCTCAACTTG GAGAAGCATT CAAGTTTGAC AGAGAGACAT ATAAATCTGA CGGTGTATTC
CGAAGTTCTC CAAGAGCATG GTTTACATAT GCACATTTAT GTTTCGGATT GCTATTCTTG
TTTGGGCACT GGTGGCACGC TTCAAGAACT CTTTACAGAA ATTCCTTTGC TGGTATTGAT
GCTGAAATTG GCGACCAAGT TGAATTTGGT TTATTCAAAA AGCTTGGAGA CGAAACCACA
AGAAGAATCC CTGGAAGGGT TTAA
 
Protein sequence
MGLPWYRVHT VVINDPGRLL AVHLMHTALL AGWAGSMALY ELAIFDPSDA VLNPMWRQGM 
YVMPFMARLG ITSSWNGWDI TGATGVDPGF WSFEGVAAAH IVFSGLLMLA SIWHWTYWDL
DLWEDSRTGE PALDLPRIFG IHLLLAGLTC FGFGAFHCAN VGIWVSDPYG LSGHVEPVAP
SWGVEGFNPF NPGGIVANHI AAGLMGIIGG IFHITNRPGE RLYRALKLGS LEGVLASALA
AVLFVSFVVS GTMWYGSATT PIELFGPTRY QWDSGYFKTE INRRVQAAID DGATKSEAYA
SIPEKLAFYD YVGNSPAKGG LFRVGALVNG DGLPTGWQGH IAFQDKEGNE LEVRRIPNFF
ENFPVILEDK EGNVRADIPF RRAEAKYSFE QTGITATIYG GDLDGQTFTD PAVVKRLARK
AQLGEAFKFD RETYKSDGVF RSSPRAWFTY AHLCFGLLFL FGHWWHASRT LYRNSFAGID
AEIGDQVEFG LFKKLGDETT RRIPGRV