Gene P9211_14281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_14281 
Symbol 
ID5730686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1289835 
End bp1290932 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content43% 
IMG OID641285805 
Producthypothetical protein 
Protein accessionYP_001551313 
Protein GI159903969 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00517015 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAACCT ATGGGAATCC AAACCCCACT TACGGGTGGT GGGTTGGTAA TTCTGTAGTA 
ACCAACAAGT CAAGTCGATT TATAGGCTCG CATGTTGCGC ATACAGGATT GATTGCATTT
ACCGCTGGCG CAAACACACT TTGGGAACTT GCCCGTTACA ACCCTGATAT CCCTATGGGA
CATCAAGGAA TGGTAAGCAT CCCTCATTTG GCATCTATTG GCATTGGTTT TGACCAAGCT
GGAGTATGGA CCGGGCAAGA TGTTGCTTTC ATTGGCATCT TTCACCTGAT TTGTTCATTT
GTATATGCCC TGGCTGGACT ATTGCACTCA ATAGTTTTCA GTGAAGACAC TCAAAACTCA
TCAGGCCTTT TTGCTGAAGG TCGTCCCGAG CATCGTCAAG CGGCTAGATA CAAGCTTGAA
TGGGATAACC CAGATAACCA AACCTTTATT CTTGGACACC ATTTGATTTT CTTTGGTGTT
GCATGTATTT GGTTTGTTGA ATGGGCAAGA ATTCATGGTA TTTACGATCC TGCTATTGGT
GCAGTTCGCC AGGTTGAGTA CAACTTGAAC TTGAATGCTA TTTGGAACCA TCAATTTGAC
TTCTTGACTA TAGATAGCCT TGAAGATGTA ATGGGAGGCC ATGCATTCTT GGCTTTTGCT
GAGATTCTTG GTGGAGCTCA CCACATTGCA ACCAAGATGG GTTCTGGAGC TCTTGGAGAA
TATACTGAAT TCAAAGGTAA GAATGTTTTG TCAGCTGAGG CCGTTCTTTC TTGGTCTTTA
GCTGGTATTG GCTGGATGGC AATTATTGCT GCATTCTGGT GCGCTACTAA CACAACTGTT
TACCCTGAAG CTTGGTATGG CGAACCTCTT GCTATCAAAT TTGGAATTTC TCCTTATTGG
ATAGACACAG GAAACATGGA TGGTGTTGTT ACCGGTCACA CATCTCGTGC ATGGCTGACT
AATGTTCATT ATTATCTTGG ATTCTTCTTT ATCCAAGGTC ATTTATGGCA TGCAATTCGT
GCATTGGGCT TTGACTTCAA GCGAGTTACA AATGCTATCG GTAACTTAGA CAATCAAAAA
ATTACTCTTA ATGGTTGA
 
Protein sequence
MQTYGNPNPT YGWWVGNSVV TNKSSRFIGS HVAHTGLIAF TAGANTLWEL ARYNPDIPMG 
HQGMVSIPHL ASIGIGFDQA GVWTGQDVAF IGIFHLICSF VYALAGLLHS IVFSEDTQNS
SGLFAEGRPE HRQAARYKLE WDNPDNQTFI LGHHLIFFGV ACIWFVEWAR IHGIYDPAIG
AVRQVEYNLN LNAIWNHQFD FLTIDSLEDV MGGHAFLAFA EILGGAHHIA TKMGSGALGE
YTEFKGKNVL SAEAVLSWSL AGIGWMAIIA AFWCATNTTV YPEAWYGEPL AIKFGISPYW
IDTGNMDGVV TGHTSRAWLT NVHYYLGFFF IQGHLWHAIR ALGFDFKRVT NAIGNLDNQK
ITLNG