Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3768 |
Symbol | |
ID | 5077916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 405900 |
End bp | 406670 |
Gene Length | 771 bp |
Protein Length | 256 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640481491 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001166153 |
Protein GI | 146275993 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0560932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGGGA CATTCGAAGG CAAGGTCGCA CTGGTGACCG GAGCGGCATC GGGCATCGGC CGGGCCGCCG CGATCCGCTT TGCAGAGGAA GGCGCCAGGG TTTTCTGCGC GGACCTCAAC CTTGCCGGCG CCGAAGCAGT CGCGGCAGGC ATCGGCAAGG GCGCGAGCGC GGTGCAGGTC GATGTCGCGA GCTATGCCAG CAACCAGGCG ATGGTCGACG CGGTCATGGC CAACGCAGGC CAGCTTGACG TGGCATTCCT CAACGCCGGC TTCTACGGCG CCGCGGAAGG GCTCGACACG GTGGACGAAG CGCTGTTCGA CCGCCTCGTC GCGATCAACC TCAAGGGCGC GTTCAACGGC ATCAAGGCGG TGCAGGCCGT GATTGCGACA GGTGGCGCCG TGGTGGTCAC CGCTTCGGCG GCGGGCATCG TGGGGCATCC GGCAAACCCG GCTTACAGCG CGGCCAAGCA CGGCGTGGTC GGCCTGGTGA AATCCTGCGT CGATGCATTT GCTGCGCGCG GCGCGCGGAT CAACGCGCTC TGCCCCGGCG GCGTGGAAAC GCCGCTGATC GGCGCGCCCG ACGTTGCCAT CGTCCCGGCC GCGGATCTGC CGCGCGTGCC TGCGCGCGGA ATGGGCCGGG CGCAGCATGT GGCCGAAGTG GCGCTGTGGC TGTCGAGCCC CGCTGCCGGC TTCATCACCG GCCAGGCGCA GGTTCTCGAC GCGGGCCTGC TTTCGACATT CGCGCCGGTC ATGCTGCCGG TCGGTCCCTG A
|
Protein sequence | MAGTFEGKVA LVTGAASGIG RAAAIRFAEE GARVFCADLN LAGAEAVAAG IGKGASAVQV DVASYASNQA MVDAVMANAG QLDVAFLNAG FYGAAEGLDT VDEALFDRLV AINLKGAFNG IKAVQAVIAT GGAVVVTASA AGIVGHPANP AYSAAKHGVV GLVKSCVDAF AARGARINAL CPGGVETPLI GAPDVAIVPA ADLPRVPARG MGRAQHVAEV ALWLSSPAAG FITGQAQVLD AGLLSTFAPV MLPVGP
|
| |