Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3607 |
Symbol | |
ID | 5077756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 230820 |
End bp | 231605 |
Gene Length | 786 bp |
Protein Length | 261 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640481331 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001165993 |
Protein GI | 146275833 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.197864 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCAGG ACATGACTGG CAAGGTCGCC ATCGTCACCG GCGGAAGCGA CGGCATCGGC CTCGCCACGG CTCGCCTTCT GGCCGCGCGT GGCGCCACCG TTGTCATCTG CGCCCGCCGC GAGGACAAGC TCGAGGCCGC CCGTGCCGAG ATCGCCAAGG TCGGCAAGGT CGAGGCGGTG AAGCTCGACG TCTCCGACGA AGCGGCCTTC ACCGCGCTGG TCGAAGATGT GGCCGCGCGC CACGGTCGGC TCGACATGCT GGTCAACAAC GCCATGTCGG TGCACTACGC ACCCATCGCC AAGCTGCGGC TCGACCATTG GCGCAAGGAC TTCGCGGTCA ATGCCGATGC CGTGTTCGTC GGCACCAAGG CGGCGATGAA GGTCATGGCC GCGCAGGAAC AGCGGGGCCG CCAGCGCGGC GCCATCGTCA ACATCGCATC GACCTGCGGC ATCCGCGCCG CGCCCAACAT GGCAAGCTAT TCGGCCTCGA AGGCAGCCAT GGTCCACTTC ACCGCCGCAG CGGCAATGGA AGGCGCGCCG CTTGGCATCC GCGTCAACGC CATCGTGCCC GGCCAGGTGA TGACCGCGGC CACCCAGGAA TTCGCCGACC GCGCGCCCGA AGTCGCCGCG CGCACCACCG GCGCGATCCC TATGCAGCGC GGCGGCGAGC CGGAAGAACT GGCCGAAGCC ATCGTCTTCA TGCTGTCCGA AGCCGCCAGC TACGTCACCG GCACTGCGCT GCCGGTCGAT GGCGGCAAGG CCGCGCAGCT CTACATGCCG GGTTGA
|
Protein sequence | MSQDMTGKVA IVTGGSDGIG LATARLLAAR GATVVICARR EDKLEAARAE IAKVGKVEAV KLDVSDEAAF TALVEDVAAR HGRLDMLVNN AMSVHYAPIA KLRLDHWRKD FAVNADAVFV GTKAAMKVMA AQEQRGRQRG AIVNIASTCG IRAAPNMASY SASKAAMVHF TAAAAMEGAP LGIRVNAIVP GQVMTAATQE FADRAPEVAA RTTGAIPMQR GGEPEELAEA IVFMLSEAAS YVTGTALPVD GGKAAQLYMP G
|
| |