Gene A9601_04961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_04961 
Symbol 
ID4717194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp431676 
End bp432773 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content31% 
IMG OID640078208 
Productaldo/keto reductase 
Protein accessionYP_001008891 
Protein GI123968033 
COG category[R] General function prediction only 
COG ID[COG1453] Predicted oxidoreductases of the aldo/keto reductase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.583752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCTAT TCACTTTAGG GACAATGCGA GGAACTGAAA GTCTTGAAAA AATGTATAGC 
ATAATAAAAA ATGCATATTA TGTAGGAATT AATCACATAG AAACAGCACC CTCTTATGGT
GATGCTGAAT CACTTATTGG AAATTCAATA AAAAAATTAG CAATAGAAGA GAATATAAAA
GAAAAAAATT GGGTAATTAC TTCCAAAGTT TTACCAAAGG GTGATTTTGA CTTTTTAAAA
AATAATTTTA AAAAGTCTCT TAAAAATTTA AATCGCGAGA AAATTAATAA TCTTGCAATT
CACGGACTCA ACTTAAAACA ACATCTAGAT TGGGCTCTTG TTGGAGAGGG TAAGAAATTC
ATATCTTGGA TACTTGAAAA GGAACTAGTT GATCAAGTTG GTTTTAGTTC TCACGGAAGT
TATTCACTAA TTAAAGATGC AATTAACTGT GAAGTTTTTA ATTTTTGTAG TCTTCACTTA
CATTATTTAG ATCAATCTAA GATTTCTTTA GCGGAGGAAG CTATAAAAAA AGGTATGGGA
GTTTTAGCAA TATCACCTGC TGATAAAGGT GGTAAATTGT ATTCTCCAAG TGATATTTTA
ATAGAGGCCT CTAAGCCTTT TCATCCATTA GAATTAGCGT ATCGATTTCT GCTCGCAAAA
GGCGTTACAA CTTTATCCTT GGGGGCGGCA AACAAAAAAG ATTTTGAATT TGCATATAAA
CTTAGAAATT CATTCGATAA GCTTACAAAA CTTGAAAAAA GCGCCCTTAA TAAAATTGAG
GAAGTTTCTA ATGAAAGATT AAACTCAACC AAATGTGAAC AATGTAGATC TTGTCTTCCA
TGTCCAAATG AAGTGCCTAT TCCAGAAATA CTTCGTTTAA GAAATATATC TATTGGTTAT
GGCCAAATAG AATTTTCAAA AGAAAGATAC AATTTAATAG GAAAAGCTGG CCACTGGTGG
GAAGAAAAAA ATTCCTCATT TTGTCAAGAA TGTAATGAAT GTGTTCCTAA ATGTCCTAGT
AAATTAGACA TACCAAATTT ATTAACGGAA GCCCATAACT TATTAACTGA AAATCCTACA
AAAAGATTAT GGGGATAA
 
Protein sequence
MSLFTLGTMR GTESLEKMYS IIKNAYYVGI NHIETAPSYG DAESLIGNSI KKLAIEENIK 
EKNWVITSKV LPKGDFDFLK NNFKKSLKNL NREKINNLAI HGLNLKQHLD WALVGEGKKF
ISWILEKELV DQVGFSSHGS YSLIKDAINC EVFNFCSLHL HYLDQSKISL AEEAIKKGMG
VLAISPADKG GKLYSPSDIL IEASKPFHPL ELAYRFLLAK GVTTLSLGAA NKKDFEFAYK
LRNSFDKLTK LEKSALNKIE EVSNERLNST KCEQCRSCLP CPNEVPIPEI LRLRNISIGY
GQIEFSKERY NLIGKAGHWW EEKNSSFCQE CNECVPKCPS KLDIPNLLTE AHNLLTENPT
KRLWG