Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_16081 |
Symbol | psbC |
ID | 4781055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1314949 |
End bp | 1316331 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640084890 |
Product | photosystem II PsbC protein (CP43) |
Protein accession | YP_001015430 |
Protein GI | 124026314 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.548682 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAACGC CCTTTAATAG TTTACTCAAC GCTCCTAATC AAAGCCTTGA AGAGACTGGT TACGCCTGGT ATGTAGGAAA TGCAAGGCTA ATCAATCTTT CAGGAAGACT TTTAGGTGCT CACATTGCTC ACGCAGGACT AATTGTGTTC TGGGCAGGCG CGATGATGCT TTTCGAAGTA AGTCACTTCA CCATGGATAA ACCCATGTGG GAACAAGGCT TAATTTGTAT GCCTCACGTA GCCATGTTTG GATACGGCAT TGGTCCAGGT GGAGAAGTCA CTGATGTATG GCCTTTCTTC ATTGCTGGTG TTATTCACCT AGTTGCATCT GGAATCCTTG GCTTTGGTGG TGTTTTTCAC TCCCTTGCAG GACCAGAGAA ACTTGAAGAA GATTTCCCAT TCTTCTCCAC TGACTGGAGA GATAAAAACC AAATGACCAA TATTCTTGGT TTTCATTTGG TTGTTCTTGG TGTTGGAGCT CTTCTATGGT CCATTAACTG GATGTATATA GGTGGTGCAT ACGACACTTG GGCTCCAGGT GGAGGAGAAG TTAGGTTGAT CAACCCAACA CTCGATCCAA GAATTATTTT TGGATATCTG CTATCAACCC CTTGGGGTGG TGGTGGTTGG ATGGTTGGTG TTAACTCAAT GGAAGATATT GTCGGAGGAC ATGTTTACCT GGGAGTGATT GAAATAATTG GTGGTCTTTT CCATATCTTC ACTCAGCCTT ATGGGTGGGC AAGGAGAGCC TTTATCTGGA ACGGTGAAGG ACTTCTAAGT TATGCATTAG GTGGAATCTG TGTCGCAAGT TTTGTTGCCT CATGTTTCAT CTGGTTTAAC AACACTGCTT ATCCATCTGA GTTCTACGGC CCAACAAACG CTGAAGCTTC TCAGGCCCAA AGTTTTACAT TCCTGGTTCG TGACCAACGA ATTGGAGCAA ATGTAGGTTC AACAATGGGT CCTACTGGTT TAGGAAAATA CCTAATGCGT TCTCCTACTG GTGAAATCAT CTTTGGTGGA GAAACTATGC GTTTTTGGGA TTTCCGAGGA CCATGGCTTG AGCCTCTTAG AGGACCAAAT GGTCTCAGCC TTGAGAAGAT TCAAAATGAT ATTCAGCCTT GGCAAGTTCG CCGTGCTGCT GAATACATGA CACATGCTCC AAACGCTTCT ATCAACTCAG TTGGTGGAAT CATTACTGAG CCTAACGCTG TTAACTTTGT TAACTTGCGT CAATGGCTAG CTGGTGCTCA ATTCTTCCTT GGTTGGTTTA CTTTTGTAGG TCATCTTTGG CATGCTGGTC GTGCTAGAGC CGCTGCAGCT GGATTTGAAA AAGGTATCAG TCGTTCACAA GAGCCTGCTC TTTCAATGCC TGATCTAGAT TAG
|
Protein sequence | METPFNSLLN APNQSLEETG YAWYVGNARL INLSGRLLGA HIAHAGLIVF WAGAMMLFEV SHFTMDKPMW EQGLICMPHV AMFGYGIGPG GEVTDVWPFF IAGVIHLVAS GILGFGGVFH SLAGPEKLEE DFPFFSTDWR DKNQMTNILG FHLVVLGVGA LLWSINWMYI GGAYDTWAPG GGEVRLINPT LDPRIIFGYL LSTPWGGGGW MVGVNSMEDI VGGHVYLGVI EIIGGLFHIF TQPYGWARRA FIWNGEGLLS YALGGICVAS FVASCFIWFN NTAYPSEFYG PTNAEASQAQ SFTFLVRDQR IGANVGSTMG PTGLGKYLMR SPTGEIIFGG ETMRFWDFRG PWLEPLRGPN GLSLEKIQND IQPWQVRRAA EYMTHAPNAS INSVGGIITE PNAVNFVNLR QWLAGAQFFL GWFTFVGHLW HAGRARAAAA GFEKGISRSQ EPALSMPDLD
|
| |