Gene A9601_13351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_13351 
SymbolpsbC 
ID4718054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1109686 
End bp1111068 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content41% 
IMG OID640079054 
Productphotosystem II PsbC protein (CP43) 
Protein accessionYP_001009726 
Protein GI123968868 
COG category 
COG ID 
TIGRFAM ID[TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAACGC CCTTTAATAA TTTATTAAGA GCTCCAAACC AAAGTATTGA GGAAACTGGT 
TATGCCTGGT ATGTAGGCAA CGCTAGATTA ATCAATTTAT CTGGACGTTT ATTGGGAGCT
CACATTGCTC ACTCTGGACT AATAGTCTTT TGGGCGGGAG CAATGATGCT CTTTGAGGTT
AATCATTTCA CTTTTGATAA ACCAATGTGG GAGCAAGGTT TAATCTGTAT GCCACACGTT
GCAATGTTTG GCTACGGGAT AGGCCCAGGT GGTGAAGTTA CTGATATCAT GCCTTTCTTC
CAAGCAGGCG TGGTTCATTT GATAGCTTCT GCTGTTCTTG GTTTTGGTGG TATTTACCAT
TCATTAGCAG GCCCAGAAAA ACTTGAAGAA GATTTTCCAT TTTTCTCCAC CGATTGGAGA
GATAAAAATC AAATGACAAA TATTCTTGGA TATCATTTGA TTGTATTAGG TGTAGGTGCA
TTAGCATGGT CAGTTAACTG GTGTTTTATT GGCGGTGCAT ATGACACATG GGCACCTGGT
GGTGGTGAAG TCAGACTTGT TAATCCAACT TTAGATCCGA GAGTTATTCT TGGTTATCTA
TTTAGATCTC CATGGGGAGG AGCTGGTTCA ATAATCGGTG TCAACTCCAT CGAAGATATT
GTTGGTGGAC ACGTTTACGT GGGTATTACT GCAATTATTG GAGGAATATT CCATATCTTC
ACAAAACCCT TTGGATGGGC AAGAAGAGCA TTCATCTGGA ATGGTGAAGG ACTATTAAGT
TATGCTCTTG GTGGAATTTG TGTAGCAAGT TTCATTGCTT CAACATTCAT TTGGTTTAAC
AATACTGCTT ACCCTTCAGA ATTTTATGGC CCAACAAATG CTGAAGCTTC ACAGGCTCAA
AGCTTTACTT TCCTAGTGAG AGACCAAAGA ATTGGAGCTA ACGTAGGTTC AACAATGGGA
CCAACAGGTC TAGGTAAGTA TCTCATGAGA TCTCCTACTG GTGAAATTAT ATTTGGTGGT
GAAACAATGA GATTTTGGGA TTTCAGAGGT CCATGGTTAG AGCCTTTAAG AGGTCCTAAT
GGTTTGAGCC TTGAGAAAAT TCAAAATGAT ATTCAGCCTT GGCAGGTAAG AAGAGCAGCT
GAATATATGA CTCATGCTCC TAACGCTTCT ATCAACTCTG TTGGTGGAAT CATTACAGAG
CCAAATGCTG TTAACTTCGT TAACCTAAGA CAATGGTTAG CTGCAGCTCA ATTCTTCCTA
GGATGGTTTA CATTTATCGG TCATCTTTGG CATGCTGGAC GTGCTAGAGC TGCCGCTGCT
GGTTTCGAGA AAGGAATCGA CAGAAAGAGT GAACCAGCTC TAGAAATGCC TGATCTAGAT
TAA
 
Protein sequence
METPFNNLLR APNQSIEETG YAWYVGNARL INLSGRLLGA HIAHSGLIVF WAGAMMLFEV 
NHFTFDKPMW EQGLICMPHV AMFGYGIGPG GEVTDIMPFF QAGVVHLIAS AVLGFGGIYH
SLAGPEKLEE DFPFFSTDWR DKNQMTNILG YHLIVLGVGA LAWSVNWCFI GGAYDTWAPG
GGEVRLVNPT LDPRVILGYL FRSPWGGAGS IIGVNSIEDI VGGHVYVGIT AIIGGIFHIF
TKPFGWARRA FIWNGEGLLS YALGGICVAS FIASTFIWFN NTAYPSEFYG PTNAEASQAQ
SFTFLVRDQR IGANVGSTMG PTGLGKYLMR SPTGEIIFGG ETMRFWDFRG PWLEPLRGPN
GLSLEKIQND IQPWQVRRAA EYMTHAPNAS INSVGGIITE PNAVNFVNLR QWLAAAQFFL
GWFTFIGHLW HAGRARAAAA GFEKGIDRKS EPALEMPDLD