Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_05441 |
Symbol | |
ID | 5731152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 509372 |
End bp | 510388 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641284903 |
Product | protochlorophyllide oxidoreductase |
Protein accession | YP_001550429 |
Protein GI | 159903085 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | [TIGR01289] light-dependent protochlorophyllide reductase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCAT CTCAAGCTGC TCCAGGGACT GTTTTGATTA CAGGTACTAC TTCTGGCGTT GGTCTATATG CAACCAAGGC CTTGTTGGAA CTTGGTTGGC GAGTTGTTAC CGCTAATAGA TCCCCTTTGA GATCTGAAGC GGCAGCTGTC AAGCTAGGTT TGCCATTTGG GAGCCCCCGC CAGCTTCAGC ATATTTATAT GGATCTTGGT GACTTAGATA GTGTTCGAAA TGGTGTCGAA AACCTTTTGA ACACGCTTGA AAAACCTTTA GATGCTTTGG TTTGTAATGC AGCTGTTTAT ATGCCCCGAC TTGCTAAACC CAAAAGATCT GCTCAAGGAT ATGAACTTTC TATGGCAACT AATCATTTCG GACATTTTTT GCTCATACAA CTTTTATTGG AACATTTAAG TGGATCCAAA AGACCTGTTT GGCAAGGTAG ATCTTGGGGG TTTGAAGCCC CAAGATTGGT AATGTTGGGC ACGGTTACGG CAAATTATAA AGAATTAGGC GGTAAAATTC CTATACCCGC TCCAGCAGAT TTAGGAGATT TATCTGGATT TGAGCAAGGA TTTAGAGATC CTATAAGCAT GGCAAGTGGA AAACGTTTTA AGCCTGGCAA AGCATATAAA GACAGCAAGC TTTGCAATAT GGTTACTATT CAAGAATTAC ATAGACGCTA TAAAGACTCT CCTATCCTTT TTAGTTCGCT CTATCCAGGC TGCGTTGCTA ATACAAAGCT TTTTAGAAGC ACACCCAAGA TATTCCAATG GCTTTTCCCC TGGTTCCAGA AGTTGATTAC AGGGGGGTTT GTTAGTGAGG ATTTAGCTGG AAAAAGAGTC GCTCAAGTAG TTTCTGACCC TGAATTTGGC GTTTCAGGTG TTCATTGGAG TTGGGGAAAT AGGCAACGGA AAAATCGGCA ACAATTCTCC CAGCAATTAT CTGATCGAAT TACTGACCCC AAAACATCTC AGAATGTTTG GGATTTATCC ATGAGACTTG TTGGATTAAG TTCCTAA
|
Protein sequence | MASSQAAPGT VLITGTTSGV GLYATKALLE LGWRVVTANR SPLRSEAAAV KLGLPFGSPR QLQHIYMDLG DLDSVRNGVE NLLNTLEKPL DALVCNAAVY MPRLAKPKRS AQGYELSMAT NHFGHFLLIQ LLLEHLSGSK RPVWQGRSWG FEAPRLVMLG TVTANYKELG GKIPIPAPAD LGDLSGFEQG FRDPISMASG KRFKPGKAYK DSKLCNMVTI QELHRRYKDS PILFSSLYPG CVANTKLFRS TPKIFQWLFP WFQKLITGGF VSEDLAGKRV AQVVSDPEFG VSGVHWSWGN RQRKNRQQFS QQLSDRITDP KTSQNVWDLS MRLVGLSS
|
| |