Gene A9601_13631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_13631 
Symbol 
ID4718083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1133998 
End bp1135056 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content37% 
IMG OID640079083 
Productchlorophyll a/b binding light harvesting protein PcbD 
Protein accessionYP_001009754 
Protein GI123968896 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTACAAA CTTACGGAAA ATCTGATGTC ACCTATGACT GGTACGCAGG GAATTCTGGT 
GTTGTTGGCC GTTCAGGTAA ATTCATAGCT GCTCATGCTG CCCATGCAGG CCTAATGATG
TTTTGGGCAG GAGCTTTTGG ATTATTTGAA TTGGCTCGTT ACGACGCCAG TATTCCAATG
GGCGCACAGA AAGCAATTGT TTTGCCTCAC CTAGCGGGTA TTGGAATTGG TGGCATTGAA
AATGGTGTTA TTACTGAACC ATATGGAATA GTTGTAATTT GCACATTACA TCTAATTTTC
TCAGCAGTAT TGGGTGCTGG TGGATTATTA CACTCCAATA AATTTGCAGG TGATCTTGGA
GACTATCCAG AAAATAGTAA GCCACAAAAA TTTGATTTTG AATGGGATGA TCCAGATAAA
TTAACTTTTA TTCTTGGTCA TCATCTAATC TTTCTTGGAC TTGGAGCAAT TATGTTCGTT
GAATGGGCTC GAATTCATGG AATTTACGAC CCAGCAATAG GATCCACGAG ACAAGTTATT
TACAACTTAG ATATTGCCGC TATCTGGAAT CATCAATTTG ATTTTTTAAA AATAGATAGT
TTGGAAGATG TTATGGGAGG ACATGCTTTC CTAGCTTTCC TCGAAATAAT TGGAGGAGTT
TTCCATATTT GTACTAAACA ATTTGGAGAA TATACAGAAT TTAAAGGAAA AGGATTACTT
GGCGCTGAGG CAATCTTGTC ATACTCAGTT GTTGGTGTTT CTTATATGGC TTTTGTTGCT
GCTTTTTGGT GTGCTTCTAA TACAACTATA TATCCAGTTG ATCTATATGG AGAACCCTTG
AAGCTTCAAT TTGAATTCGC CCCTTATTTT ACTGATACAG TAGATTTAGG TTCAGGAGCG
TACAGCTCAA GAGCTTGGCT TGCTAATACT CATTTTTATT TGGGTTTCTT TTTCTTACAA
GGTCATCTTT GGCACGCACT AAGAGCAATG GGATTTGACT TTAAGAAAAT TGGTCAAGCT
TTTGATAATA TTGAAAATAC AAAAATTACT CAAAACTAG
 
Protein sequence
MLQTYGKSDV TYDWYAGNSG VVGRSGKFIA AHAAHAGLMM FWAGAFGLFE LARYDASIPM 
GAQKAIVLPH LAGIGIGGIE NGVITEPYGI VVICTLHLIF SAVLGAGGLL HSNKFAGDLG
DYPENSKPQK FDFEWDDPDK LTFILGHHLI FLGLGAIMFV EWARIHGIYD PAIGSTRQVI
YNLDIAAIWN HQFDFLKIDS LEDVMGGHAF LAFLEIIGGV FHICTKQFGE YTEFKGKGLL
GAEAILSYSV VGVSYMAFVA AFWCASNTTI YPVDLYGEPL KLQFEFAPYF TDTVDLGSGA
YSSRAWLANT HFYLGFFFLQ GHLWHALRAM GFDFKKIGQA FDNIENTKIT QN