Gene P9303_04491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_04491 
Symbol 
ID4777411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp451586 
End bp452806 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID640085953 
Productaldo/keto reductase family protein 
Protein accessionYP_001016466 
Protein GI124022159 
COG category[R] General function prediction only 
COG ID[COG1453] Predicted oxidoreductases of the aldo/keto reductase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGT CTTTGGAGCA GAGCTCTCTG CCTTGCCGCC GCTTTGGCCG CACCGGTTTA 
TCAATGCCTG TGTTGTCTTT AGGGGGGATG CGCTTTCAGC AGAGCTGGAC AGATTTGGAG
GCGGAGGTCA TTACCTCTGA GTCGCAGCAA CTGCTGCAGG ACATTTTGGA GCGAGCGGTG
GCCTGTGGCT TCCATCATGT GGAGACGGCA CGCCATTACG GCAGCTCTGA GCGGCAGTTG
GGATGGGCGC TGCGTGATGT CTTGGATCCA GAGCGGCTGT TGCAGAGCAA AGTTCCTCCT
CGTGAGGATC CCAAAGTCTT TGAGGCTGAG TTGGCACTCA GCTTTGAACG ATTGGGATGT
GAACGATTGG ATCTAGTTGC CATCCATGGC CTCAACCTTT CGGAGCATCT GGAGCAGACC
TTGCGACCAG GAGGTTGCAT GGATGTGTTG CGTCGTTGGC AGGGTGATGG ACGCATCGGC
CATGTGGGTT TTTCCACCCA TGGCCCCACA GACCTAATCG TGCAGGCGAT CGAGACGGAT
GCCTTTGATT ATGTGAACCT GCACTGGTAT TTCATTTATC AAGACAATGA TCCTGCACTG
GATGCAGCTG CTCGTCATGA CCTAGGCGTT TTCATCATTA GCCCGACAGA TAAGGGTGGC
CATCTGCATA GTCCCTCGTC TCAACTTCTG GAACTCTGCG CTCCACTTCA TCCAATTGTG
TTCAACGATC TGTTCTGCTT GCAAGACCCA AGGGTTCATA CGATCAGCGT TGGCGCAGCG
CGACCCAGTG ATCTCGATCG GCATCTCGAG GCGGTGGATC TCTTGCAGAG TGCCGCTGAG
TTGCTGCCAC CAGTTCAGCA GCGACTCGTT GATGCGGCAC AGTTGGCTTT AGGTGAGGCT
TGGTTGACCA GTTGGCATAG GGGCTTGCCG CCCTGGCAGG AGTCTCCAGG CGAGATCAAT
CTTCCGATCT TGCTTTGGCT TCATAATCTT GTAGAGGCTT GGGGAATGGA GGGTTATGCA
AAAGCCCGCT ACGGCTTACT TGGCAGTGGC AGCCACTGGT TCCCTGGAGC GAATGCCGAA
GCACTGGATG CAGATGTGAG TGAGGCGGCC CTCAGGGAGG TGTTGGTGAA CAGCCCCTGG
TGTGATCAGA TCCCAGGCTT GCTGCGTAGG TTGCGCAACC GTCTTGGTGG TCATCCTCAG
CAACGACTGA CCAGTGTTTA A
 
Protein sequence
MKASLEQSSL PCRRFGRTGL SMPVLSLGGM RFQQSWTDLE AEVITSESQQ LLQDILERAV 
ACGFHHVETA RHYGSSERQL GWALRDVLDP ERLLQSKVPP REDPKVFEAE LALSFERLGC
ERLDLVAIHG LNLSEHLEQT LRPGGCMDVL RRWQGDGRIG HVGFSTHGPT DLIVQAIETD
AFDYVNLHWY FIYQDNDPAL DAAARHDLGV FIISPTDKGG HLHSPSSQLL ELCAPLHPIV
FNDLFCLQDP RVHTISVGAA RPSDLDRHLE AVDLLQSAAE LLPPVQQRLV DAAQLALGEA
WLTSWHRGLP PWQESPGEIN LPILLWLHNL VEAWGMEGYA KARYGLLGSG SHWFPGANAE
ALDADVSEAA LREVLVNSPW CDQIPGLLRR LRNRLGGHPQ QRLTSV