Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3470 |
Symbol | |
ID | 5077619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 71196 |
End bp | 72206 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640481194 |
Product | alcohol dehydrogenase |
Protein accession | YP_001165856 |
Protein GI | 146275696 |
COG category | [R] General function prediction only |
COG ID | [COG2130] Putative NADP-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAA ACCGCTTCTG GCGCCTTGAT CGACACCCCG AAGGCTCAGA CTTCGCATCG GCGCTGAGCC TTCAGACCGA GACTCTTCCG GCGCTTGGCG AAGGCGAAGT GCGGGTGAGG GTCGGATGGC TATCGATGGA CGCAGGCACA CGCATGTGGA TGAGCCCGCG AACCGACGGC TACCAGCCCC CGTTGCCGCT GGGTTCGAAG ATGGCGGGTC TTGTGCTGGG CCGGGTGAGC GAAAGCCGCG ACCCGGCATT TCCGGCCGGC ACCCTGGTGC GCGGGTTTGG CCAATGGGCG GATTACGCCA CTGTCGTTCC GGCGCTTTCC GGTCTCGAGG CGGTGGACGA CGGAATCGCG GACATTCGCC AGCACTTCGG TGCGCTGGGC ATGAATGCCT GGACCGCCTT TGTCGGCGTG CGCGAAGTGG CCGCGATCAG CCCCGGCGAA TGGCTGGTCG TCTCGGCGGC GGCGGGCGCG ACCGGGAGCA TGGCCTGCCA GGTGGGGCGC AACCTTGGCG CCAAGGTCGT GGGCATTGCC GGCGGCGCGG AGAAGTGCCG CTACCTTGTC GAGGAATTGG GCGTCGACAT TGCCATCGAC TACAAGAACG AGGACGTGGC CGCGCGCCTT GCCGAAGTGC CTGGCGGCGT TAACGCCTAT TTCGACAACG TCGGCGGACC GATGCTCGAC GCCGTGCTGC CGAACATGGC GCACTACGGC CGCGTCGCGG TGTGCGGCAT GGTGGCGGCC TACGACAATG ACGCCCCCCT CCCCGGACCC GCGCGGTTCG ACCAGGTGCT GATGCGGCGC CTGCGGATCG AGGGCTTCTT CATCCCCGAC TTCCTCCATC GCGGGGCCGA GTTCATGCCA ATCCTCCGCG AATGGGTCGA TGCGGGCAAG CTGACGGTAC GCCTCAACGA GACGGTGGGG CTGGAGAACG TGCTGGAAGG CTACGAACGG ATGCTCTCCG GCAAGGCGAT CGGCAAGGTC ATCGTCAAGG TCGGCGACTG A
|
Protein sequence | MTENRFWRLD RHPEGSDFAS ALSLQTETLP ALGEGEVRVR VGWLSMDAGT RMWMSPRTDG YQPPLPLGSK MAGLVLGRVS ESRDPAFPAG TLVRGFGQWA DYATVVPALS GLEAVDDGIA DIRQHFGALG MNAWTAFVGV REVAAISPGE WLVVSAAAGA TGSMACQVGR NLGAKVVGIA GGAEKCRYLV EELGVDIAID YKNEDVAARL AEVPGGVNAY FDNVGGPMLD AVLPNMAHYG RVAVCGMVAA YDNDAPLPGP ARFDQVLMRR LRIEGFFIPD FLHRGAEFMP ILREWVDAGK LTVRLNETVG LENVLEGYER MLSGKAIGKV IVKVGD
|
| |