Gene PMN2A_0722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_0722 
Symbol 
ID3606100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1214932 
End bp1216020 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content42% 
IMG OID637687585 
Productchlorophyll a/b binding light harvesting protein PcbE 
Protein accessionYP_291916 
Protein GI72382561 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.683508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTCCT ACGGAAACCC AGACGTCACC TACGGGTGGT GGGTTGGTAA TTCTGTCGTA 
ACAAATAAGT CAAGCCGATT TATTGGCTCG CATGTTGCTC ATACAGGATT GATTTGTTTC
GCAGCTGGTG CCAACACACT TTGGGAGCTC GCTAGATACA ACCCAGATAT TCCAATGGGA
CACCAAGGAA TGGTGAGCAT CCCACACCTT GCTTCTATTG GTATTGGATT TGATCCAACT
GGAACAGTAT TCGACGGAAC ATCAATTGCT TTTATCGGAG TATTCCATCT GATTTGTTCA
ATGGTTTATG CGGGTGCAGG TCTATTGCAC TCTCTGATTT TTAGCGAAGA TACCCAAAAT
AGTTCAGGTT TGTTTGCTGA TGATCGTCCT GAACATCGTC AGGCAGCAAG ATACAAGCTT
GAATGGGATA ATCCAGATAA TCAGACTTTT ATTCTTGGTC ACCATTTGAT TTTCTTTGGT
GTTGCATGTA TTTGGTTTGT TGAGTGGGCT CGAATACATG GGATTTACGA TCCTGCAATA
GGAGCTGTTC GACAAGTCGA GTACAACTTA AACTTGACCA ACATTTGGAA TCATCAGTTT
GATTTCTTGG CTATTGATAG TCTGGAGGAT GTTATGGGTG GTCATGCATT CTTAGCATTT
GTTGAGATCA CAGGTGGTGC TTTCCATATC GCTACGAAGC AGACTGGAGA ATACACAGAA
TTCAAAGGGA AGAATATTCT TTCTGCTGAA GCAGTTCTTT CCTGGTCTCT TGCTGGTATT
GGTTGGATGG CAATTATTGC CGCTTTCTGG TGTGCAACCA ATACAACTGT TTATCCAGAG
GCTTGGTACG GAGAAACATT AGCTCTTAAG TTTGGAATCT CTCCATATTG GATTGATACT
GCTGATATGA CTGGTGTCGT TAGTGGTCAT ACTTCAAGAG CTTGGCTTGC GAATGTTCAT
TACTATCTTG GTTTCTTCTT TATTCAAGGA CACCTTTGGC ATGCAATACG TGCTCTAGGC
TTTGATTTCA AAAAGGTTAC TGATGCAATT AGTAATCTTG ATGGAGCAAG AGTTACTCTA
ACTGATTGA
 
Protein sequence
MQSYGNPDVT YGWWVGNSVV TNKSSRFIGS HVAHTGLICF AAGANTLWEL ARYNPDIPMG 
HQGMVSIPHL ASIGIGFDPT GTVFDGTSIA FIGVFHLICS MVYAGAGLLH SLIFSEDTQN
SSGLFADDRP EHRQAARYKL EWDNPDNQTF ILGHHLIFFG VACIWFVEWA RIHGIYDPAI
GAVRQVEYNL NLTNIWNHQF DFLAIDSLED VMGGHAFLAF VEITGGAFHI ATKQTGEYTE
FKGKNILSAE AVLSWSLAGI GWMAIIAAFW CATNTTVYPE AWYGETLALK FGISPYWIDT
ADMTGVVSGH TSRAWLANVH YYLGFFFIQG HLWHAIRALG FDFKKVTDAI SNLDGARVTL
TD