Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0998 |
Symbol | |
ID | 3915780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1041882 |
End bp | 1042730 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640443732 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_496277 |
Protein GI | 87199020 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0692729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTCG ACGGAAAGAC CGTGGTGGTA ACGGGCGCCG CGTCGGGCAT CGGGCGGGCG GCGGCGCTGG CCTTCGCTGC GGAAGGCGCG AAAGTCTATG CGGCGGACAT CGACGAGGCG GGGCTGGCAG AGACGGCTGC GCAATCGAAC GGCGCGATCC GGACCGCGCG CTGCGACGTT ACCCGCTGCG AGGACATCAA GGCCCTGATG GATCGCGCAG GCACCGAGAC GGGCGGCATC GACACCGTGT TCAACAATGC CGGCGCAGGC GGTGACCGCG CGCCCATCGA CGAGATCGAG CCCGAAGGCT GGGACCGGAC GATGGACCTG CTGCTGCGGT CGGTCGCTTT CGGCATTCGC TATGCCGTGC CGCACATGAA GGGCCGCCAT GGCGCATCGT TCGTGAACAC GTCCAGCGTA GCGGCGGTCG GCCCCGGTTA TTCGCCCACG GCCTATGCAG TTGCAAAGGC GGGCGTGCTT CACCTGACGA AGGTCGCTGC GGCGGACCTT GCGAAGCACC AGATCCGCGT GAACGCGGTG CAGCCCGGCT TCATCAACAC CAACATCTTC ACCAGCTCGC TCGAAATGCC GGAGGAACTG GAAGCACAGG CCAAGGGCGC GATCGCGGCG ATGTCGCAAC AGGCCCAGCC GGTAGCGCGG GGCGGACAGC CAGAGGATAT CGCGCAAGCC GTGCTTTTCC TCGCGAGCGA GGCGGCGGGC TTCGTGACGG GGACTTCGCT CATCGTCGAT GGCGGCATCA CCGTGGGCCC GCGCCATAGC TGGGACCCGA ACATGCCCGG CCTGTTCGAT GCGCTGCAGC AGATGAAGGA AAGCGGCAAG GTTTCATGA
|
Protein sequence | MRFDGKTVVV TGAASGIGRA AALAFAAEGA KVYAADIDEA GLAETAAQSN GAIRTARCDV TRCEDIKALM DRAGTETGGI DTVFNNAGAG GDRAPIDEIE PEGWDRTMDL LLRSVAFGIR YAVPHMKGRH GASFVNTSSV AAVGPGYSPT AYAVAKAGVL HLTKVAAADL AKHQIRVNAV QPGFINTNIF TSSLEMPEEL EAQAKGAIAA MSQQAQPVAR GGQPEDIAQA VLFLASEAAG FVTGTSLIVD GGITVGPRHS WDPNMPGLFD ALQQMKESGK VS
|
| |