Gene P9303_17721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_17721 
Symbol 
ID4778975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1547562 
End bp1548668 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content45% 
IMG OID640087279 
Productlight-harvesting complex protein 
Protein accessionYP_001017779 
Protein GI124023472 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAACCT ACGGGAAAAC AGACGTTACC TACGCCTGGT ACGCGGGTAA CAGTGGAGTG 
ACCAACCGAT CAGGCCGATT CATTGCCTCG CATATTGGCC ATACAGGATT AATTTGCTTC
GGCGCTGGTG CCAACACCCT ATTTGAGCTA GCTCGTTACG ATTCTGCATT GCCAATAGGT
GACCAAGGAT TTGTAGTTCT TCCTCACCTA GCAGGACTTG GTATTGGTGG CATAGAGAAC
GGTGTGATTA CAGACTCCTA TGGGATGCTT GTCGTCGCAG TCTTCCATCT AATCTTTTCG
GCCGTTTATG CCGGCGGAGC GATGCTTCAC TCCTTTCGAT ACAAGGAGGA CCTGGGAGAA
TACCCGCAAG GATCCAGACC CAACAAATTT GATTTTAAAT GGGATGATCC AGACAGGCTC
ACCTTCATAC TTGGACATCA CCTGCTATTC CTAGGTCTTG GCTGTGTTCA ATTTGTTGAA
TGGGCTAAAT ACCATGGAAT TTATGACCCA GCAATGGGTG TTGTACGTAA GGTTGAATAC
AACCTTGACT TGTCAATGGT TTGGAATCAC CAGATTGATT TCCTTACGAT TAACAGTTTG
GAAGATGTGA TGGGCGGTCA TGCATTCTTG GCCTTCTTCT TGAGTGCTGG TGCTGTTTGG
CATATTTTCA GCAAGCCATT TGGGGAATAC ACTGAATTCA AAGGAAAAGG ACTCTTATCT
GCTGAATTTG TTCTTTCTAC CTCATTAGCA GGTGCAGCCT TTATTGCTTT CGTGGCAGCC
TTCTGGGCTT CTATGAACAC TACAATTTAT CCAACTGATC TGTATGGAGG TCCTCTCAAT
ATCGAATTGA ACTTCGCTCC ATATTTCTCA GATACGGATC CATTGTTTGG TGGAGACGTA
CACTCAGCCC GTTCATGGCT GTCAAACTTC CATTTCTACC TTGGATTCTT CTATCTTCAG
GGTCATTTCT GGCATGGATT GAGAGCGATG GGCTTTGACT TCAAGCGTGT TGAAAAATTG
TTCGATCAGC TAGAAAGCAA CGAAATTAGT CTTAACCCAG GTAAAAGTAC GACCGTGCCA
TCAACATCGA CAGATAACGC CACATAA
 
Protein sequence
MQTYGKTDVT YAWYAGNSGV TNRSGRFIAS HIGHTGLICF GAGANTLFEL ARYDSALPIG 
DQGFVVLPHL AGLGIGGIEN GVITDSYGML VVAVFHLIFS AVYAGGAMLH SFRYKEDLGE
YPQGSRPNKF DFKWDDPDRL TFILGHHLLF LGLGCVQFVE WAKYHGIYDP AMGVVRKVEY
NLDLSMVWNH QIDFLTINSL EDVMGGHAFL AFFLSAGAVW HIFSKPFGEY TEFKGKGLLS
AEFVLSTSLA GAAFIAFVAA FWASMNTTIY PTDLYGGPLN IELNFAPYFS DTDPLFGGDV
HSARSWLSNF HFYLGFFYLQ GHFWHGLRAM GFDFKRVEKL FDQLESNEIS LNPGKSTTVP
STSTDNAT