Gene P9211_04381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_04381 
Symbol 
ID5731208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp414197 
End bp415333 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content41% 
IMG OID641284795 
Productaldo/keto reductase 
Protein accessionYP_001550323 
Protein GI159902979 
COG category[R] General function prediction only 
COG ID[COG1453] Predicted oxidoreductases of the aldo/keto reductase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG TTCGAAGACC TTTTGGAAAA GAAGTGGAAG TCAGCCTGTT CACATTAGGG 
ACCATGAGAG CCCTTGAGTC TTCTGAAGCA ATGTATGCGG TAGTTAAAGA GGCTTGCTTG
GCTGGGATCA ATCACATAGA AACTTCTCCT TCATATGGAC CTGCTCAAAA GTTCCTAGGC
GAATCTTTAC AAAAACTTAG ATTTCATAAA ATCAATCCTC AGAATGGCTG GGTAGTCACA
AGCAAGATCC TTCCAGGCAT TACCTTCTCA GAAGGCCAAA GGCAACTACA GCAAACTTTA
GTAGACATTG GGATTCCAAA GATTGACAAT CTTGCAGTTC ATGGTCTTAA TCTTCCTGAA
CATTTAACAT GGGCCCTACA TGGAGACGGG ATTAAACTGA TTCAGTGGGC TAAAGAAAAA
AATCTTATTG CCCAGTTCGG GTTTACCTCT CATGGTGATC AATCCCTTAT AGAGAAAGCT
ATAAAAAGTC GCCAATTTAA TTTTTGTAGT CTACATTTAC ATCTTCTCGA CCAAGGTAGG
CTCCATCTTA GCAAACTTGC TTTGAATCAA GACATGGGGG TAATGGCTAT TTCACCAGCA
GACAAAGGTG GTCACTTGCA TACTCCAAGT CAAACCTTAA TTAAAGATTG CTCTCCAATA
TCTCCTATCG AATTGGCATA TAGATTTCTA TTAGCTCAAG GGGTCAGTAC ATTAACACTA
GGAGCCAATA AGCCAGAAGA GCTCTCTATA GCTAAAAAGC TAGTAGCAGC AAATGGGCAA
TTAACCAAAG CAGAGGAAGC CTCTATGAAT CGTCTATATC AAGAGGGGAA GCGTCGGCTA
GGGGATACTT TATGTGGGCA ATGTCGAGAA TGTATCCCAT GTCCAAACAA TGTTCCAATA
CCTGAGATAT TGCGATTGCG AAACTTATCT ATCGGACATG ATCTAACTTC CTTCTCAAAA
GAAAGATATA ACCTCATAGG GAAAGCAGGG CATTGGTGGG AAGAGGTTGA TGCTAGTGCT
TGCAAGAAGT GTGGGGATTG TCTACCACGT TGTCCAAATC ATCTAAAAAT ACCGGACTTA
CTTGAACAAA CACATCATCA CTTATTAGAT AGACCTAAAA GAAGATTATG GGGTTGA
 
Protein sequence
MKIVRRPFGK EVEVSLFTLG TMRALESSEA MYAVVKEACL AGINHIETSP SYGPAQKFLG 
ESLQKLRFHK INPQNGWVVT SKILPGITFS EGQRQLQQTL VDIGIPKIDN LAVHGLNLPE
HLTWALHGDG IKLIQWAKEK NLIAQFGFTS HGDQSLIEKA IKSRQFNFCS LHLHLLDQGR
LHLSKLALNQ DMGVMAISPA DKGGHLHTPS QTLIKDCSPI SPIELAYRFL LAQGVSTLTL
GANKPEELSI AKKLVAANGQ LTKAEEASMN RLYQEGKRRL GDTLCGQCRE CIPCPNNVPI
PEILRLRNLS IGHDLTSFSK ERYNLIGKAG HWWEEVDASA CKKCGDCLPR CPNHLKIPDL
LEQTHHHLLD RPKRRLWG