Gene NATL1_16081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16081 
SymbolpsbC 
ID4781055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1314949 
End bp1316331 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content45% 
IMG OID640084890 
Productphotosystem II PsbC protein (CP43) 
Protein accessionYP_001015430 
Protein GI124026314 
COG category 
COG ID 
TIGRFAM ID[TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.548682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAACGC CCTTTAATAG TTTACTCAAC GCTCCTAATC AAAGCCTTGA AGAGACTGGT 
TACGCCTGGT ATGTAGGAAA TGCAAGGCTA ATCAATCTTT CAGGAAGACT TTTAGGTGCT
CACATTGCTC ACGCAGGACT AATTGTGTTC TGGGCAGGCG CGATGATGCT TTTCGAAGTA
AGTCACTTCA CCATGGATAA ACCCATGTGG GAACAAGGCT TAATTTGTAT GCCTCACGTA
GCCATGTTTG GATACGGCAT TGGTCCAGGT GGAGAAGTCA CTGATGTATG GCCTTTCTTC
ATTGCTGGTG TTATTCACCT AGTTGCATCT GGAATCCTTG GCTTTGGTGG TGTTTTTCAC
TCCCTTGCAG GACCAGAGAA ACTTGAAGAA GATTTCCCAT TCTTCTCCAC TGACTGGAGA
GATAAAAACC AAATGACCAA TATTCTTGGT TTTCATTTGG TTGTTCTTGG TGTTGGAGCT
CTTCTATGGT CCATTAACTG GATGTATATA GGTGGTGCAT ACGACACTTG GGCTCCAGGT
GGAGGAGAAG TTAGGTTGAT CAACCCAACA CTCGATCCAA GAATTATTTT TGGATATCTG
CTATCAACCC CTTGGGGTGG TGGTGGTTGG ATGGTTGGTG TTAACTCAAT GGAAGATATT
GTCGGAGGAC ATGTTTACCT GGGAGTGATT GAAATAATTG GTGGTCTTTT CCATATCTTC
ACTCAGCCTT ATGGGTGGGC AAGGAGAGCC TTTATCTGGA ACGGTGAAGG ACTTCTAAGT
TATGCATTAG GTGGAATCTG TGTCGCAAGT TTTGTTGCCT CATGTTTCAT CTGGTTTAAC
AACACTGCTT ATCCATCTGA GTTCTACGGC CCAACAAACG CTGAAGCTTC TCAGGCCCAA
AGTTTTACAT TCCTGGTTCG TGACCAACGA ATTGGAGCAA ATGTAGGTTC AACAATGGGT
CCTACTGGTT TAGGAAAATA CCTAATGCGT TCTCCTACTG GTGAAATCAT CTTTGGTGGA
GAAACTATGC GTTTTTGGGA TTTCCGAGGA CCATGGCTTG AGCCTCTTAG AGGACCAAAT
GGTCTCAGCC TTGAGAAGAT TCAAAATGAT ATTCAGCCTT GGCAAGTTCG CCGTGCTGCT
GAATACATGA CACATGCTCC AAACGCTTCT ATCAACTCAG TTGGTGGAAT CATTACTGAG
CCTAACGCTG TTAACTTTGT TAACTTGCGT CAATGGCTAG CTGGTGCTCA ATTCTTCCTT
GGTTGGTTTA CTTTTGTAGG TCATCTTTGG CATGCTGGTC GTGCTAGAGC CGCTGCAGCT
GGATTTGAAA AAGGTATCAG TCGTTCACAA GAGCCTGCTC TTTCAATGCC TGATCTAGAT
TAG
 
Protein sequence
METPFNSLLN APNQSLEETG YAWYVGNARL INLSGRLLGA HIAHAGLIVF WAGAMMLFEV 
SHFTMDKPMW EQGLICMPHV AMFGYGIGPG GEVTDVWPFF IAGVIHLVAS GILGFGGVFH
SLAGPEKLEE DFPFFSTDWR DKNQMTNILG FHLVVLGVGA LLWSINWMYI GGAYDTWAPG
GGEVRLINPT LDPRIIFGYL LSTPWGGGGW MVGVNSMEDI VGGHVYLGVI EIIGGLFHIF
TQPYGWARRA FIWNGEGLLS YALGGICVAS FVASCFIWFN NTAYPSEFYG PTNAEASQAQ
SFTFLVRDQR IGANVGSTMG PTGLGKYLMR SPTGEIIFGG ETMRFWDFRG PWLEPLRGPN
GLSLEKIQND IQPWQVRRAA EYMTHAPNAS INSVGGIITE PNAVNFVNLR QWLAGAQFFL
GWFTFVGHLW HAGRARAAAA GFEKGISRSQ EPALSMPDLD