Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4049 |
Symbol | |
ID | 5706312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4605724 |
End bp | 4606479 |
Gene Length | 756 bp |
Protein Length | 251 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641273475 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001538830 |
Protein GI | 159039577 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.203044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.111569 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGAGC AGCGGGTCGC GATCGTGACC GGTGCGGCGC GGGGCATCGG GGCGGCCACC GCCCGGCGGT TGGCAGCCGA TGGGCTGGCC GTCGCCGTGG TGGACATCGA CGAGCCCGCG ACCGGGGAGA CCGTCGCCGC CATCACCGCC GCCGGTGGGC GGGCGGTCGG CGTCGGCGCG GACGTGTCGG ACCGGGCCCA GGTGGAGGCG GCCGTCGAGC GGGTCGCGAC CGATCTCGGG GCGCCGACCG TGCTGGTGAA CAATGCCGGC GTGCTGCGGG ACAATCTGCT GTTCAAGATG TCGGACGCGG ACTGGGACAC GGTGCTGGGC GTGCACCTGC GGGGCGCGTT CCTGTTCAGC CAGGTCACAC AGCAGCATAT GGTCGAGCAG CAATGGGGCC GGATCGTCAA CCTCTCCAGC ACCTCCGCGC TGGGCAATCG TGGCCAGGCG AACTACTCCG CCGCCAAGGC CGGCCTGCAG GGATTCACGA AGACCCTCGC GATCGAGTTG GGTCCGTTCG GGGTGACGGT CAACGCCGTT GCTCCGGGCT TCATCGTCAC CGACATGACC GCGGCGACCG CCGCCCGGAT GAAGGTTGAC TTCGAGACGC TGCAGAAGCA CGCCGAGGCG GAGATTCCGG TCCGTCGGGT GGGCCGCCCC GAGGACGTCG CGCACACCAT CTCGTTTCTG GCCAGCTCGG GTGCGTCGTT CGTCTCCGGG CAGGTCATCT ACGTCGCCGG CGGGCCGAAG GACTGA
|
Protein sequence | MSEQRVAIVT GAARGIGAAT ARRLAADGLA VAVVDIDEPA TGETVAAITA AGGRAVGVGA DVSDRAQVEA AVERVATDLG APTVLVNNAG VLRDNLLFKM SDADWDTVLG VHLRGAFLFS QVTQQHMVEQ QWGRIVNLSS TSALGNRGQA NYSAAKAGLQ GFTKTLAIEL GPFGVTVNAV APGFIVTDMT AATAARMKVD FETLQKHAEA EIPVRRVGRP EDVAHTISFL ASSGASFVSG QVIYVAGGPK D
|
| |