Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3609 |
Symbol | |
ID | 5077758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 232528 |
End bp | 233319 |
Gene Length | 792 bp |
Protein Length | 263 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640481333 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001165995 |
Protein GI | 146275835 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.544133 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGGAA CTCTGAAAGG GCGCAGCGCG CTCGTCACCG GTGGCGGTCA GGGCGTGGGC CAGGGCATTG CCCGCGCGCT CGCGGCCGAA GGCGCGGACG TGATGATCGC CCAGCGCGGG CTTGAGGCCG CCGAAGCCGA AGCCGTGCAC CTGCGCGCGA CGTACGGCGT CAACGCCATC GCCCGTCAGG TCGACGTGAC CGTGCGCGGC GAGGTAGACG CCATGGTCGA TGCCTGCGCC GCCGCGTTCG GACGGCTCGA CATTCTCGTC AACAATGCCG GCGGCAGCTT CCCCAAGCGG CTGGAGAACC ATACCGACGA GGACATGGAA GGGTCCTTCC TGCTCAACTA CTGGTCCGCG TTCTGGTCCA TGCGCGCGGC CTTTCCGCTA ATGAAGGCGC AGAAGTACGG GCGGATCGTC AACCTCGGCT CGCTCAATGG CGTAAACGCG CACATGTTCA CCGCCGCCTA CAATGCCAGC AAGGAAGCGG TCCGCGCGCT GACCCGCACC GCGGCGGTCG AATGGGGCGG CCACGGCATT ACCGCCAACG TCATCTGCCC TTCCGCGCTC AGCCCGGCCG CGCGCGATTA TTTCGATGCC AACCCGGAAA TGGCGCAGGC CATCCTCGGC CAGGTTCCGG TGGGCCGCTT CGGGGAGTCG GCGGGCGATA TCGGGCCGGT CGCGGTGTTC CTTGCCGGCG AGGCGTCGAG CTACATGACC GGGAACACGC TCTACGTCGA TGGCGGCGGG CACATCAACG GCGTCGCCTG GCGCCCCGAA GTCGAGGACT GA
|
Protein sequence | MAGTLKGRSA LVTGGGQGVG QGIARALAAE GADVMIAQRG LEAAEAEAVH LRATYGVNAI ARQVDVTVRG EVDAMVDACA AAFGRLDILV NNAGGSFPKR LENHTDEDME GSFLLNYWSA FWSMRAAFPL MKAQKYGRIV NLGSLNGVNA HMFTAAYNAS KEAVRALTRT AAVEWGGHGI TANVICPSAL SPAARDYFDA NPEMAQAILG QVPVGRFGES AGDIGPVAVF LAGEASSYMT GNTLYVDGGG HINGVAWRPE VED
|
| |