Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3763 |
Symbol | |
ID | 5077911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 401490 |
End bp | 402275 |
Gene Length | 786 bp |
Protein Length | 261 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640481486 |
Product | short chain dehydrogenase |
Protein accession | YP_001166148 |
Protein GI | 146275988 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.566112 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTTT CCGTTCCTCC CTATCCCACT CCGCTCGGCA TGCTGAAGGG CAAGACCGTG GTGGTTACCG CAGCGGCGGG CACTGGCATC GGCTTTGCCG TTGCCAAGCG CGCCGCCGAA GAAGGCGCGC GCCTGCTCAT CAGCGACTTC CATGAACGCC GGCTCGGTGA AGCGGCGGAC CGCATCGCGG CTGAAGTGGG TTGCGAGCGC CCGGCGACCG TTGTCTGCGA CGTGACCAAC GAAGCACAGG TGCAGGACCT GCGCGACGCG GCGCTGGAGA AGCTCGGCAA GGTCGACGTC CTCATCAACA ATGCGGGGCT GGGCGGCGAG GTCGATGTCG TGGACATGAC CGACGACCAG TGGAGCCGCG TGATCGACGT GACGCTGACA AGCCTGTTCC GGATGACGCG GGCGTTCCTG CCCGCGATGT ATGCCAACAA GTCCGGCGTC ATGGTCAACA ACGCTTCGGT GCTGGGTTGG CGGGCACAGA AGGGGCAAGC CCACTATGCT GCGGCAAAGG CCGGCGTGAT GGCCTTCACC CGCTGCGCCG CGCTGGAAGC GGCGGACCAT GGCGTGCGGA TCAATGCTGT CGCGCCCAGC CTTGCCATGC ATCCCTTCCT GGCAAAGGTG ACGACGGAGG AGCGCCTGGC CGAACTGGTC AAGACCGAAG CCTATGGGCG TCCGGCTGAA GTGTGGGAAG TCGCGAACGT CATGCTGTTC CTGGCCAGCG ACCTGTCTTC CTACATGACC GGCGAGATCG TTTCCGTCTC TAGCCAGAGG GCCTGA
|
Protein sequence | MSVSVPPYPT PLGMLKGKTV VVTAAAGTGI GFAVAKRAAE EGARLLISDF HERRLGEAAD RIAAEVGCER PATVVCDVTN EAQVQDLRDA ALEKLGKVDV LINNAGLGGE VDVVDMTDDQ WSRVIDVTLT SLFRMTRAFL PAMYANKSGV MVNNASVLGW RAQKGQAHYA AAKAGVMAFT RCAALEAADH GVRINAVAPS LAMHPFLAKV TTEERLAELV KTEAYGRPAE VWEVANVMLF LASDLSSYMT GEIVSVSSQR A
|
| |