Gene P9303_17961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_17961 
Symbol 
ID4776227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1567116 
End bp1568018 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content49% 
IMG OID640087304 
ProductShort-chain dehydrogenase/reductase (SDR) superfamily protein 
Protein accessionYP_001017803 
Protein GI124023496 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.140714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTGGA AAGTTGCAGA TATCCCTGAT CAGAAAGGAA GGGTTGCATT AGTAACAGGT 
GCCAATAGTG GCCTTGGTCT TGAGACGGCC AAGGCATTGC TTAACAAGGG CGCCAGAGTC
ATTATGGCTT GTCGCTCACG ACCCAAAGGT GAAGCTGTAC GCCAAATTAT TCTCGAGAGC
AATGATTCCA CAAAGCTTGA TCTGATTGAG TTAGATCTCG CTGATTTAGC GAGCGTTAGA
CGTGCAGCTG AGCAGGTAGA AAGACAATAC AGCCGAGTGG ACCTGCTAAT TAATAATGCT
GGAGTAATGG CTACTCCACA GACCCTCAGC AAACAGGGTC TGGAGCTTCA GTTCGCCGTT
AATCATCTTG GCCACATGGC ATTGACCCTA AAGCTCCTAC CCTTACTCGC AAAGCAACAT
GGAGCAAGGG TTGTTACCGT CACTTCTGGC GCACAGTACA TGGGTCGAAT CGCATGGGAG
GATCTTCAAG GGATCAAGCA CTATGACCGC TGGGCTGCCT ACTCACAAAG CAAACTTGCT
AATGTGATGT TCGCCTTAGA GCTCGACAAG CGAGTGCGCA ATGCAGCAAG TGGGATTGCA
TCATTGTTGG CACATCCAGG TCTTGCACGC ACTAATTTGC AGCCTAAATC TGTTGCTGCG
AACAAATCCT GGCAGGAAGG CCTTGCTTAT CGGTTAATGG ATCCCATGTT TCAGAGCGCT
GCAATGGGAT CCCTACCGCA ATTACATGCT GCAACCGCGC CAACTGCTCA AGGCGGCGAA
CAATACGGAC CTAGATTTAA CTTTAGGGGT TACCCCAAGC TTTGCCGCAT TGCCCCGTTG
GCACTTAGGG AGGAAGACCG TCAAAGGCTT TGGAGCATCA GTGAAAAACT TTTAGAGATT
TGA
 
Protein sequence
MSWKVADIPD QKGRVALVTG ANSGLGLETA KALLNKGARV IMACRSRPKG EAVRQIILES 
NDSTKLDLIE LDLADLASVR RAAEQVERQY SRVDLLINNA GVMATPQTLS KQGLELQFAV
NHLGHMALTL KLLPLLAKQH GARVVTVTSG AQYMGRIAWE DLQGIKHYDR WAAYSQSKLA
NVMFALELDK RVRNAASGIA SLLAHPGLAR TNLQPKSVAA NKSWQEGLAY RLMDPMFQSA
AMGSLPQLHA ATAPTAQGGE QYGPRFNFRG YPKLCRIAPL ALREEDRQRL WSISEKLLEI