Gene P9211_05441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_05441 
Symbol 
ID5731152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp509372 
End bp510388 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content42% 
IMG OID641284903 
Productprotochlorophyllide oxidoreductase 
Protein accessionYP_001550429 
Protein GI159903085 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID[TIGR01289] light-dependent protochlorophyllide reductase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAT CTCAAGCTGC TCCAGGGACT GTTTTGATTA CAGGTACTAC TTCTGGCGTT 
GGTCTATATG CAACCAAGGC CTTGTTGGAA CTTGGTTGGC GAGTTGTTAC CGCTAATAGA
TCCCCTTTGA GATCTGAAGC GGCAGCTGTC AAGCTAGGTT TGCCATTTGG GAGCCCCCGC
CAGCTTCAGC ATATTTATAT GGATCTTGGT GACTTAGATA GTGTTCGAAA TGGTGTCGAA
AACCTTTTGA ACACGCTTGA AAAACCTTTA GATGCTTTGG TTTGTAATGC AGCTGTTTAT
ATGCCCCGAC TTGCTAAACC CAAAAGATCT GCTCAAGGAT ATGAACTTTC TATGGCAACT
AATCATTTCG GACATTTTTT GCTCATACAA CTTTTATTGG AACATTTAAG TGGATCCAAA
AGACCTGTTT GGCAAGGTAG ATCTTGGGGG TTTGAAGCCC CAAGATTGGT AATGTTGGGC
ACGGTTACGG CAAATTATAA AGAATTAGGC GGTAAAATTC CTATACCCGC TCCAGCAGAT
TTAGGAGATT TATCTGGATT TGAGCAAGGA TTTAGAGATC CTATAAGCAT GGCAAGTGGA
AAACGTTTTA AGCCTGGCAA AGCATATAAA GACAGCAAGC TTTGCAATAT GGTTACTATT
CAAGAATTAC ATAGACGCTA TAAAGACTCT CCTATCCTTT TTAGTTCGCT CTATCCAGGC
TGCGTTGCTA ATACAAAGCT TTTTAGAAGC ACACCCAAGA TATTCCAATG GCTTTTCCCC
TGGTTCCAGA AGTTGATTAC AGGGGGGTTT GTTAGTGAGG ATTTAGCTGG AAAAAGAGTC
GCTCAAGTAG TTTCTGACCC TGAATTTGGC GTTTCAGGTG TTCATTGGAG TTGGGGAAAT
AGGCAACGGA AAAATCGGCA ACAATTCTCC CAGCAATTAT CTGATCGAAT TACTGACCCC
AAAACATCTC AGAATGTTTG GGATTTATCC ATGAGACTTG TTGGATTAAG TTCCTAA
 
Protein sequence
MASSQAAPGT VLITGTTSGV GLYATKALLE LGWRVVTANR SPLRSEAAAV KLGLPFGSPR 
QLQHIYMDLG DLDSVRNGVE NLLNTLEKPL DALVCNAAVY MPRLAKPKRS AQGYELSMAT
NHFGHFLLIQ LLLEHLSGSK RPVWQGRSWG FEAPRLVMLG TVTANYKELG GKIPIPAPAD
LGDLSGFEQG FRDPISMASG KRFKPGKAYK DSKLCNMVTI QELHRRYKDS PILFSSLYPG
CVANTKLFRS TPKIFQWLFP WFQKLITGGF VSEDLAGKRV AQVVSDPEFG VSGVHWSWGN
RQRKNRQQFS QQLSDRITDP KTSQNVWDLS MRLVGLSS