Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3700 |
Symbol | |
ID | 5077848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 334883 |
End bp | 336337 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640481423 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001166085 |
Protein GI | 146275925 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGCAG GACCGATGGA AATGAACGCC TGCGAGCGCC TTTACATCGA TGGCGATTGG GTGGAACCCG CCCGCCCCGC CCCGCCGATC CCGGTAATCA ATCCTGCAAC CGAAATGGCG TGCGGCTCAG TCAGGCCCGG AAGCGCGCTC GATGTCGATC GCGCGGTGGC AGCGGGGCGC CGGGCATTTG CGTCATTCTC CGTGAGCCCT GCGCAGGAGC GTATTGCCCT GCTCGGCAGA ATTCTCTCCC TGATGGAGGA AAGGGCGGAA GCTCTCGCTC AGGCCGTCAC GCTGGAAATG GGGTGCGCAA TCTCGTTTTC TCGTACGGCT CAGGTACCTT TCGGTGTCGC CCATGTTCGC GCAGCACTCA AAGTGCTGCA TGACTTCCCG TTCCTCACTC CACAGGGTTC CACCGCGATC CTGCGTGAAC CCATCGGCGT CTGCGGGCTC ATCACGCCGT GGAACTGGCC GCTGTACCAG ATCACGGCGA AGCTTGCGCC GGCGCTTGCC GCAGGGTGCA CCGTTGTGTT GAAGCCCAGC GAGCTGTCGC CGTTCAGCGC GGCGCTGCTG GCGCAGATCG TCCACGATGC CGGCACACCG GCTGGCGTAT TCAACATGGT GACCGGCACC GGCCCCGGGG TGGGAGAGGC CATCGCTGCG CATGGCGACA TCGACATGGT CTCGATAACC GGCTCGACGC GTGCGGGTGT GCTTGTCGCC CAGGCTGCCG CGACCACGGT AAAGCGGGTA ACGCAGGAGC TAGGTGGCAA GTCGCCGAAC ATCCTGCTCG ACGATGCCGA TTTCGAGCGG GTCGTTCCCC TAGGCATCGC GGCGGGCATG CGCAACGTCG GCCAGTCCTG CAGCGCGCCG ACGCGCATGC TGGTGCCGCG CCATCGCCTG ACCGAAGTGG AGCAGCTTGC CGCGAATGCC GCCAACGCGC TCGTTGTCGG GGATCCGCTG GCCGAAGCGA CGGACCTCGG ACCGGTTGCC AACAGCCGCC AGTTCGACAA GGTCCAACAG ATGATCGCCG TCGGCCAGGC CGAAGGCGCC CGTCTGCTGT GTGGCGGCCC CGGCAGGCCC GAAGGGCTGG ATCGCGGCTT CTTCTGCAGG CCGACGGTAT TCACCTGCGA TGATCCGGCG ATGCGCGTAG CGCAGGAGGA GATCTTCGGG CCGGTGATCT GCGTGATCCC CTACGACGAT GACGACCACG CCGTCCGCAT TGCCAATGAC ACGATCTACG GGCTCGGCTC GCACGTCCAG TCCGCAGATA TCGACCGCGC ACGAACTGTG GCGGCAAGGA TCCGGGCGGG GCAGGTCCAC ATCAACCATC CCGCCTGGGA CGGGCATGCA GCCTTCGGCG GTTACAAGCA GTCTGGAAAC GGGCGTGAAT ATGGGCGTTT CGGCCTGGAG GAATACCTCG AGACCAAGGC CGTTCTGGGA TACTTCCCCC AGTGA
|
Protein sequence | MKAGPMEMNA CERLYIDGDW VEPARPAPPI PVINPATEMA CGSVRPGSAL DVDRAVAAGR RAFASFSVSP AQERIALLGR ILSLMEERAE ALAQAVTLEM GCAISFSRTA QVPFGVAHVR AALKVLHDFP FLTPQGSTAI LREPIGVCGL ITPWNWPLYQ ITAKLAPALA AGCTVVLKPS ELSPFSAALL AQIVHDAGTP AGVFNMVTGT GPGVGEAIAA HGDIDMVSIT GSTRAGVLVA QAAATTVKRV TQELGGKSPN ILLDDADFER VVPLGIAAGM RNVGQSCSAP TRMLVPRHRL TEVEQLAANA ANALVVGDPL AEATDLGPVA NSRQFDKVQQ MIAVGQAEGA RLLCGGPGRP EGLDRGFFCR PTVFTCDDPA MRVAQEEIFG PVICVIPYDD DDHAVRIAND TIYGLGSHVQ SADIDRARTV AARIRAGQVH INHPAWDGHA AFGGYKQSGN GREYGRFGLE EYLETKAVLG YFPQ
|
| |