Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3848 |
Symbol | |
ID | 5077459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | - |
Start bp | 17608 |
End bp | 19113 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640480957 |
Product | aldehyde dehydrogenase (acceptor) |
Protein accession | YP_001165619 |
Protein GI | 146275458 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0362745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACGC AGTTGAGAAG TGCAGAAAAC GAATACGGGA TCAAGTCCGA GTACGGCCAC TATATCGGCG GCGAGTGGAT CGCCGGGGAC AGCGGCAAGA CCATCGATCT GCTCAATCCC TCGACCGGCA AGGTGCTGAC CAAGATTCAG GCCGGCAACG CCAAGGATAT CGAACGCGCG ATTGCCGCCG CCAAGGCGGC GTTTCCCAAG TGGTCGCAGA GCCTGCCCGG CGAGCGCCAA GAAATCCTGA TCGAGGTTGC GCGTCGTCTG AAGGCACGCC ATTCGCACTA TGCCACCCTC GAAACGCTCA ACAACGGCAA GCCGATGCGC GAATCGATGT ATTTCGATAT GCCGCAGACG ATCGGGCAGT TTGAGCTGTT CGCCGGTGCC GCCTATGGCC TGCACGGCCA GACGCTCGAT TATCCCGACG CGATCGGCAT CGTCCACCGC GAACCGCTCG GCGTCTGCGC GCAGATTATC CCATGGAACG TGCCGATGTT GATGATGGCG TGCAAGATCG CGCCCGCGCT GGCCTCGGGC AACACTGTCG TTCTGAAGCC GGCCGAAACG GTCTGCCTTT CGGTGATTGA ATTCTTCGTG GAAATGGCTG ATCTGTTGCC GCCGGGTGTG ATCAACGTCG TCACCGGCTA TGGCGCGGAC GTGGGCGAGG CGCTGGTCAC CAGCCCCGAT GTCGCCAAGG TGGCCTTCAC CGGTTCGATC GCCACCGCGC GCCGGATCAT CCAGTATGCC TCGGCCAACA TCATCCCCCA GACGCTCGAG TTGGGCGGCA AGTCGGCGCA CATCGTGTGT GGCGATGCCG ACATCGACGC GGCGGTGGAA AGCGCGACTA TGTCGACCGT GCTCAACAAG GGCGAAGTCT GTCTGGCCGG TTCGCGCCTG TTCCTGCACC AGTCGATCCA GGACGAGTTC CTGGCCAAGT TCAAGACCGC GCTTGAAGGC ATCCGCCAGG GCGACCCGCT CGACATGGCG ACCCAGCTTG GCGCCCAGGC ATCGAAGATG CAGTTTGACA AGGTGCAAAG CTACCTGCGC CTGGCCACCG AGGAAGGGGC CGAGGTCCTG ACCGGCGGCA GCCGCTCGGA TGCCGCAGAT CTGGCCGATG GCAATTTTAT CAAGCCGACC GTGTTCACCA ACGTCAACAA CTCCATGCGG ATCGCGCAGG AAGAGATCTT CGGACCGGTT ACCAGCGTCA TCACCTGGAG CGACGAAGAC GACATGATGA AGCAGGCCAA CAATACAACT TACGGCCTCG CTGGCGGCGT CTGGACCAAG GACATCGCCC GAGCCCACCG GATTGCGCGC AAGCTCGAAA CTGGCACGGT CTGGATCAAT CGCTACTACA ACCTGAAGGC CAACATGCCG CTGGGCGGTT ACAAGCAAAG CGGCTTCGGG CGTGAATTCA GCCATGAAGT GCTGAATCAC TACACCCAGA CCAAGTCGGT GGTGGTCAAC CTCCAGGAAG GTCGCACCGG AATGTTCGAT CAGTGA
|
Protein sequence | MATQLRSAEN EYGIKSEYGH YIGGEWIAGD SGKTIDLLNP STGKVLTKIQ AGNAKDIERA IAAAKAAFPK WSQSLPGERQ EILIEVARRL KARHSHYATL ETLNNGKPMR ESMYFDMPQT IGQFELFAGA AYGLHGQTLD YPDAIGIVHR EPLGVCAQII PWNVPMLMMA CKIAPALASG NTVVLKPAET VCLSVIEFFV EMADLLPPGV INVVTGYGAD VGEALVTSPD VAKVAFTGSI ATARRIIQYA SANIIPQTLE LGGKSAHIVC GDADIDAAVE SATMSTVLNK GEVCLAGSRL FLHQSIQDEF LAKFKTALEG IRQGDPLDMA TQLGAQASKM QFDKVQSYLR LATEEGAEVL TGGSRSDAAD LADGNFIKPT VFTNVNNSMR IAQEEIFGPV TSVITWSDED DMMKQANNTT YGLAGGVWTK DIARAHRIAR KLETGTVWIN RYYNLKANMP LGGYKQSGFG REFSHEVLNH YTQTKSVVVN LQEGRTGMFD Q
|
| |