Gene P9211_09521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_09521 
Symbol 
ID5731408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp848316 
End bp849374 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content43% 
IMG OID641285319 
Producthypothetical protein 
Protein accessionYP_001550837 
Protein GI159903493 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000722234 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCAGACCT ACGGGAACCC AAACGTTACC TATGACTGGT ACGCGGGTAA CTCTGGGGTG 
ACAAACCGTT CAGGAAAATT CATCGCCGCT CATGCAGCTC ATGCAGGTTT GATGATGTTC
TGGGCAGGTG CGTTCACTCT TTTCGAACTC GCTCGTTATG ACTCATCTAT TCCGATGGGT
AATCAAAACC TTATTTGTTT GCCTCATCTT GCTGGACTTG GCATAGGTGG TGTTTCTAAT
GGGGTTATTA CTGAACCTTA TGGCTGCACA GTAGTAGCTG TATTACACCT CATTTTCTCA
GGAGTACTTG GTGCAGGAGG ACTCTTGCAT TCAATGAGAT ATGAAGGAGA TCTAGGTAAT
TATCCTGACT CAGCCAGAGC TAAAAAGTTC GACTTCGAGT GGGACGATCC AGACAGATTG
ACCTTTATTC TTGGTCACCA CCTTATTTTC CTAGGATTAG GCAACATTCA GTTCGTTGAA
TGGGCAAGAA TTCATGGAAT TTATGATGCT GCCAAAGGTA TTACTAGAAC TGTTGATTAC
AACCTAGACC TTGGAATGGT ATGGAATCAC CAAGCTGATT TCCTATCAAT ATCTAGCCTG
GAAGACGTAA TGGGTGGTCA TGCATTCCTT GCATTCTTCC TAATTACTGG TGGAGCATTC
CACATTGCTA CCAAGCAATT TGGTGAATAC ACCGAATTTA AAGGAAAAGG ACTTTTATCA
GCAGAGTCAA TCCTCTCTTA TTCATTAGCT GGTGTTGCAT ATTGTGCATT TGTTGCAGCC
TTCTGGTGTT CAACAAATAC AACTGTTTAT CCAACAGACC TTTATGGTGA AGTTCTAAAA
CTTCAATTCG ATTTCGCACC ATACTTTGCA GATACAGCAT CTGACTTACC TGCAGATGTA
CATACTGCAA GAGCTTGGCT TGCTAACTCT CATTTCTTCC TTGGATTCTT CTACCTTCAA
GGACACCTAT GGCATGCCCT TAGAGGAATG GGATTTGACT TTAAGCGTGT AGGTAAAGCG
TTTGACAATA TGGAGAACGC CAAAATAACA GCTGGATAA
 
Protein sequence
MQTYGNPNVT YDWYAGNSGV TNRSGKFIAA HAAHAGLMMF WAGAFTLFEL ARYDSSIPMG 
NQNLICLPHL AGLGIGGVSN GVITEPYGCT VVAVLHLIFS GVLGAGGLLH SMRYEGDLGN
YPDSARAKKF DFEWDDPDRL TFILGHHLIF LGLGNIQFVE WARIHGIYDA AKGITRTVDY
NLDLGMVWNH QADFLSISSL EDVMGGHAFL AFFLITGGAF HIATKQFGEY TEFKGKGLLS
AESILSYSLA GVAYCAFVAA FWCSTNTTVY PTDLYGEVLK LQFDFAPYFA DTASDLPADV
HTARAWLANS HFFLGFFYLQ GHLWHALRGM GFDFKRVGKA FDNMENAKIT AG