Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_04781 |
Symbol | |
ID | 5730601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 448300 |
End bp | 449073 |
Gene Length | 774 bp |
Protein Length | 257 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641284837 |
Product | short chain dehydrogenase |
Protein accession | YP_001550363 |
Protein GI | 159903019 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.864402 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAATG AATCTCAAGA CTCCTCCTCC AATAAAACCC ATCCAATTAA TCCATGGGCT AATCGCCGAA TAGGAATCAC TGGAGCAAGA GGAAGTCTTG GAATTGCCTT AACAAAAAAA TTTAGATCGC AAGGGGCTTT TGTTATAGGA CTAAGTCACA GATCTATATC ACATCAAAAG AATTCAATTA CCTCTCCCAA TGAATGGGTT CAATGGGAAT GTGGACAGGA AGTAAAGCTT GACAAAGTGC TGGCAAGTCT AGACATTCTT ATTCTGAACC ATGGAATAAA CCCTGGTGGA AGTTGTGAGT CAAAAGATAT CAATGAATCA CTTGAAATCA ATGCACTTAG TTCATGGAGA CTATTTCAGC GCTTTGAAAA TATCTGTCTG AATAACAATC ATTCGTCTAA GAAAAACGAA ATTTGGATCA ATACTTCTGA AGCAGAGATT CAGCCTGCAC TAAGTCCTGT ATATGAAATT ACCAAGCGAT TAATTGGCCA ACTTGTTAGT TTAAAGGGGA GTTCAATAAC TAAAACGCAA AGATCTAATT TAAGAATTCG CAAACTTATC CTTGGCCCTT TTCGTTCCGA ATTAAATCCT TTAGGTTTAA TGAGTCCAGA CTGGGTGGCA GGCCAGATAA TCAACCAAGC AAGTCTTAGT TTGAATCTAA TTATTGTTAC GCCAAACCCT GTAACTTATT TTTTGGTACC TCTAAACGAA TTCTTTAGAG CAATGTATTT TAATTTATTT AAGAATAAAG GCGATATTAA TTAA
|
Protein sequence | MINESQDSSS NKTHPINPWA NRRIGITGAR GSLGIALTKK FRSQGAFVIG LSHRSISHQK NSITSPNEWV QWECGQEVKL DKVLASLDIL ILNHGINPGG SCESKDINES LEINALSSWR LFQRFENICL NNNHSSKKNE IWINTSEAEI QPALSPVYEI TKRLIGQLVS LKGSSITKTQ RSNLRIRKLI LGPFRSELNP LGLMSPDWVA GQIINQASLS LNLIIVTPNP VTYFLVPLNE FFRAMYFNLF KNKGDIN
|
| |