Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_17961 |
Symbol | |
ID | 4776227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1567116 |
End bp | 1568018 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640087304 |
Product | Short-chain dehydrogenase/reductase (SDR) superfamily protein |
Protein accession | YP_001017803 |
Protein GI | 124023496 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.140714 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTGGA AAGTTGCAGA TATCCCTGAT CAGAAAGGAA GGGTTGCATT AGTAACAGGT GCCAATAGTG GCCTTGGTCT TGAGACGGCC AAGGCATTGC TTAACAAGGG CGCCAGAGTC ATTATGGCTT GTCGCTCACG ACCCAAAGGT GAAGCTGTAC GCCAAATTAT TCTCGAGAGC AATGATTCCA CAAAGCTTGA TCTGATTGAG TTAGATCTCG CTGATTTAGC GAGCGTTAGA CGTGCAGCTG AGCAGGTAGA AAGACAATAC AGCCGAGTGG ACCTGCTAAT TAATAATGCT GGAGTAATGG CTACTCCACA GACCCTCAGC AAACAGGGTC TGGAGCTTCA GTTCGCCGTT AATCATCTTG GCCACATGGC ATTGACCCTA AAGCTCCTAC CCTTACTCGC AAAGCAACAT GGAGCAAGGG TTGTTACCGT CACTTCTGGC GCACAGTACA TGGGTCGAAT CGCATGGGAG GATCTTCAAG GGATCAAGCA CTATGACCGC TGGGCTGCCT ACTCACAAAG CAAACTTGCT AATGTGATGT TCGCCTTAGA GCTCGACAAG CGAGTGCGCA ATGCAGCAAG TGGGATTGCA TCATTGTTGG CACATCCAGG TCTTGCACGC ACTAATTTGC AGCCTAAATC TGTTGCTGCG AACAAATCCT GGCAGGAAGG CCTTGCTTAT CGGTTAATGG ATCCCATGTT TCAGAGCGCT GCAATGGGAT CCCTACCGCA ATTACATGCT GCAACCGCGC CAACTGCTCA AGGCGGCGAA CAATACGGAC CTAGATTTAA CTTTAGGGGT TACCCCAAGC TTTGCCGCAT TGCCCCGTTG GCACTTAGGG AGGAAGACCG TCAAAGGCTT TGGAGCATCA GTGAAAAACT TTTAGAGATT TGA
|
Protein sequence | MSWKVADIPD QKGRVALVTG ANSGLGLETA KALLNKGARV IMACRSRPKG EAVRQIILES NDSTKLDLIE LDLADLASVR RAAEQVERQY SRVDLLINNA GVMATPQTLS KQGLELQFAV NHLGHMALTL KLLPLLAKQH GARVVTVTSG AQYMGRIAWE DLQGIKHYDR WAAYSQSKLA NVMFALELDK RVRNAASGIA SLLAHPGLAR TNLQPKSVAA NKSWQEGLAY RLMDPMFQSA AMGSLPQLHA ATAPTAQGGE QYGPRFNFRG YPKLCRIAPL ALREEDRQRL WSISEKLLEI
|
| |