Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3811 |
Symbol | |
ID | 5077959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 465785 |
End bp | 467293 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640481534 |
Product | aldehyde dehydrogenase (acceptor) |
Protein accession | YP_001166196 |
Protein GI | 146276036 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.505255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGTT CCGAAGGTCA GTTGTCGAAG CTGGGTGAAG CGGCCCGAAC ATTTCTTGCC ACGCGGCCGG GCCTGCTGAT CGCCGGACGT AACACGCCCG CCGCCAGCGG CGCGCGGCGC GACGTGCTCG ATCCATCGAC CGGGCTTGTG GTCACCGACG TGGCCGAAGG CGGAGCGGCG GACGTCGACG CGGCGGTGGC AGCTGCCCGC GCGACCTTCC AGAGCGCCGA GTGGCGCGAC ATGCGTCCCC TGGCGCGCGA ACGCCTGCTT CACCGCCTGG CCGACCTGAT CGAACGCGAC GCCGATATCC TTGCGGAACT CGAAGTGATC GACACCGGCA AGATCCTGCC GATGGCGCGT CATGGCGATC TCGTCATCGC TCTGGATTCC CTGCGCTACA TGGCAGGCTG GACCACCAAG ATCGAAGGGA CGACGATCGA TCCCTCGTTC TCCTACATTC CCCCGATCCG CTTTTCTGCC CGCACCGTGC GCCAGCCGGT CGGCGTTGTC GGGCAGATCA TCCCGTGGAA CTTCCCGCTG GTCATGGCCA TCTGGAAAAT CGCGCCGGCG CTTGCCGCAG GCTGCACCGT CGTGCTGAAG CCCGCCGAGG ACACCCCGCT CACCGCGCTC TACCTTGGCC GCCTGATCGC GGAAGCCGGG TTCCCGGCGG GAGCCGTCAA CATCGTGACC GGCGGGCGCG AGGTCGGCAT GGCGATCGTC GAGCATCCCG GCATCGACAA GATTGCCTTT ACCGGATCGA CCGCCGCCGG CCAGGACATC CAGCGTAGGG CCGCCGCCAC GATGAAGCGC CTCAGCCTGG AACTCGGTGG CAAGAGCCCG GTCGTTATCC TCGAGGATTG CCCGGTGCCG ATGGCGGTGG AAGGCGCGGC GGGCGCGATC TTCTTCAACC ACGGCCAGGT CTGCACCGCC GGTTCACGCC TGCTGGTCCA CCGCAGCATC TACGAGGACG TCGTGCAGGG GCTAGCCCAT GCGGCGAATG GCATGGTGCT GGGCGAAGGG ATGGACCCGG CCGGCCAGAT GGGGCCCTTG ATCTCCGCCC GCCAGCGGGA CCGTGTTGCC GGTTACGTGC AGGGTGCGCT CGATCAGGGC GCGCGGCTGC TGGCGGGCGG TGAAGCGCCG GACCGCGACG GGTTCTTCTA CCGGCCGACA GTGCTTGCCG ATGGCACGCC TTCCATGACC ATCTTCCAGG AGGAAGTGTT CGGTCCGGTG GTCATTGCCA TGCCCTTCGA TACGGAAGAG GAAGCGCTCG CGCTTGCCAA CGATTCCTGC TTCGCGCTGG GCGCCAGCGT CTGGACGCAG AACCTTGCGG CGGCCAACCG CTTCGCCGGC GCGCTGCGCT CGGGCAACGT CTGGATCAAC GCGCACAACA TCCTCGACCC GGCGGTGCCG TTCGGTGGGT GGAAGATGTC GGGATATGGC CGCGAACTGG GACACAGCGC GGTCGAACTC TATACCGAGG CCAAGTCCAT CACGATGCCG CTCCTCTGA
|
Protein sequence | MASSEGQLSK LGEAARTFLA TRPGLLIAGR NTPAASGARR DVLDPSTGLV VTDVAEGGAA DVDAAVAAAR ATFQSAEWRD MRPLARERLL HRLADLIERD ADILAELEVI DTGKILPMAR HGDLVIALDS LRYMAGWTTK IEGTTIDPSF SYIPPIRFSA RTVRQPVGVV GQIIPWNFPL VMAIWKIAPA LAAGCTVVLK PAEDTPLTAL YLGRLIAEAG FPAGAVNIVT GGREVGMAIV EHPGIDKIAF TGSTAAGQDI QRRAAATMKR LSLELGGKSP VVILEDCPVP MAVEGAAGAI FFNHGQVCTA GSRLLVHRSI YEDVVQGLAH AANGMVLGEG MDPAGQMGPL ISARQRDRVA GYVQGALDQG ARLLAGGEAP DRDGFFYRPT VLADGTPSMT IFQEEVFGPV VIAMPFDTEE EALALANDSC FALGASVWTQ NLAAANRFAG ALRSGNVWIN AHNILDPAVP FGGWKMSGYG RELGHSAVEL YTEAKSITMP LL
|
| |