Gene P9211_03221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_03221 
Symbol 
ID5731535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp304956 
End bp305975 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content42% 
IMG OID641284669 
ProductYcf48-like protein 
Protein accessionYP_001550207 
Protein GI159902863 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGAT TATTCAAGTT TTCTGCAAAT CTTGCTTTAT TTATTTGTCT TTTATTCGTC 
TTAAGCGGTT GTGTATCAAC AACGCGCCTC CCAATTGCAA GCTCTACTCC TTGGGAGCAA
ATTCAACTTG CTAGTGATGA CAACCCGCTA GATATAGCTT TTGTGGATAA AAATCATGGT
TTTTTGGTTG GGGCAAATCG TTTAATTCGT GAAACGAGTG ATGGAGGAGC CACTTGGCAA
AATAGGGAGT TAGACCTTGG TAGTGAAGAA AATTTCCGCT TAATCAGTAT TGACTTCTTA
GGAAATGAAG GCTGGATAGT AGGTCAGCCA GGACTAGTAA TGCATAGCAA TGATGGGGGA
CAAAGTTGGG TTCGTTTGGT CCTTGGCAAT AAATTGCCTG GAGATCCTTA CTTGATAACC
ACTTTGGGGA ATGATTCTGC TGAATTGGCA ACAACCGCAG GTGCTGTATA TCGTACTAAT
GATGGAGGAT CTAATTGGGA GTCCAAAGTG GCTGAAGCTT CAGGGGGAGC TAGAGATCTA
CGAAGAAGCA ATGATGGTGA TTATGTAAGC GTGAGTAGCC TCGGTAATTT TTTCAATACG
CTTGAACAAG GTCAGGGAAA TTGGCAACCT CATCAAAGAG CTAGTAGTAA ACGGGTCCAG
ACATTGGGTT ATCAACCCAA CGGCTCTTTA TGGATGTTGT CTAGAGGAGC TGAGATTCGT
TTTAATGACA GTACTGGTAA CTATGAGAGC TGGTCTAAGC CAAAAGTCCC AATCGTTAAT
GGATACAATT ATTTGGATAT GGCTTGGGAT CCAAATGGGG ATATTTGGGC AGGAGGAGGG
AATGGAACCC TTTTAGTTAG CAAAGATGGT GGGGAAAATT GGGAAAAAGA TCCGATTGGT
GAATTAATTC CAACAAATTT TATTCGCATC CTTTTTATTG ATAATGAGAT GTCTGATCAG
CCAAAAGGCT TTGCAATTGG TGAAAGAGGG CATCTTTTGA GGTGGGTTGG ATACGCTTAA
 
Protein sequence
MNRLFKFSAN LALFICLLFV LSGCVSTTRL PIASSTPWEQ IQLASDDNPL DIAFVDKNHG 
FLVGANRLIR ETSDGGATWQ NRELDLGSEE NFRLISIDFL GNEGWIVGQP GLVMHSNDGG
QSWVRLVLGN KLPGDPYLIT TLGNDSAELA TTAGAVYRTN DGGSNWESKV AEASGGARDL
RRSNDGDYVS VSSLGNFFNT LEQGQGNWQP HQRASSKRVQ TLGYQPNGSL WMLSRGAEIR
FNDSTGNYES WSKPKVPIVN GYNYLDMAWD PNGDIWAGGG NGTLLVSKDG GENWEKDPIG
ELIPTNFIRI LFIDNEMSDQ PKGFAIGERG HLLRWVGYA