Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1104 |
Symbol | |
ID | 3916400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1150672 |
End bp | 1152096 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640443839 |
Product | aldehyde dehydrogenase |
Protein accession | YP_496383 |
Protein GI | 87199126 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.304588 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGAAC GGCTACAGCA ATACATTGAT GGCAAGTGGG TAGACAGCGA GGGTGGCAAG CGCCACGAGG TCATCAATCC GACGACCGAG GAACCCTGCT GCGTCATCAC GCTGGGCACG CAGGCCGATG TCGACAAGGC AGTGGCCGCG GCCCAGCGCG CCTTCAAGAC CTTCAGCAAG ACGACGCGCG AGGAGCGACT CGCGCTGCTT GAACGCATCG TCGAGGAATA CAAGAAGCGC GTCCCCGATC TCGCCGCCGC GATGGCCGAG GAAATGGGCG CTCCGGTAAG CTTCGCCAGC ACCGCGCAGG TCGGCGCCGG CATCGGCGCC TTCCTCGGCA CCATGGCCGC GCTCCGCAAC TTCTCCTTCG TCGAGGACAA CGGTGCGTTC AAGGTCGCCT ACGAACCGAT CGGCGTCGTC GGCATGATCA CGCCATGGAA CTGGCCCCTC AACCAGATCG CGCTCAAGGT CGCACCGGCG CTGGCCGCGG GCAACACCAT GATCCTCAAG CCGTCCGAGG AATGCCCCAC CAACGCCGCG ATCTTTACCG AGATCCTCGA TGCCGCCGGC GTCCCGCCAG GCGTCTTCAA CCTCATCCAG GGCGATGGTC CCGGCGTCGG CACTGCGATC AGCTCGCACC CGGGCATCGA CATGGTCAGC TTCACCGGCT CGACCCGCGC GGGCATCCTC GTGGCGAAGG CTGCGGCCGA TACCGTCAAG CGCGTCCATC AGGAGCTTGG CGGCAAGTCG CCCAACGTCG TCCTGCCCGA TGCAGACTTC GCCAAGTACC TGCCGTCGAC CGCGTCCGGC CCGTTGGTCA ACAGCGGCCA GAGCTGCATT TCGCCCACCC GCATTCTCGT ACCCCGCGAA CGCGAAGCCG AAGCCGCGGC GTTCGTTTCG GCGATGTACT CGGCAACCCC GGTCGGCGAT CCGATGCAGG AAGGTGCGCA CATCGGCCCG GTGGTCAACA AGGCGCAGTT CGACAAGATC CGCGGCCTGA TCCAGTCGGC GATCGACGAA GGCGCGAAGC TCGAGACCGG CGGCCCCGAC CTCCCGGCCA ACGTCAACCG CGGCTACTAC ATCAAGCCCA CGGTCTTCTC CGGCGTCACG CCCGACATGC GCATTGCGCA GGAGGAAATC TTCGGCCCGG TCGCGACGAT CATGGCGTAC GACAGCCTCG AGGAGGCCAT CGAGATCGCC AACGACACCG CCTATGGCCT GTCGGCCTGC ATCACCGGCG ATCCGGCGAA GGCGGCTGAA GTCGCGCCCG AGCTTCGCGC CGGCATGGTC GCGATCAACA ACTGGGGCCC CACCCCGGGC GCGCCGTTCG GCGGCTACAA GCAGTCCGGC AACGGCCGCG AGGGCGGGCT CTATGGCCTC AAGGACTTCA TGGAAATGAA GGCGATCAGC GGCCTGCCTG CCTGA
|
Protein sequence | MRERLQQYID GKWVDSEGGK RHEVINPTTE EPCCVITLGT QADVDKAVAA AQRAFKTFSK TTREERLALL ERIVEEYKKR VPDLAAAMAE EMGAPVSFAS TAQVGAGIGA FLGTMAALRN FSFVEDNGAF KVAYEPIGVV GMITPWNWPL NQIALKVAPA LAAGNTMILK PSEECPTNAA IFTEILDAAG VPPGVFNLIQ GDGPGVGTAI SSHPGIDMVS FTGSTRAGIL VAKAAADTVK RVHQELGGKS PNVVLPDADF AKYLPSTASG PLVNSGQSCI SPTRILVPRE REAEAAAFVS AMYSATPVGD PMQEGAHIGP VVNKAQFDKI RGLIQSAIDE GAKLETGGPD LPANVNRGYY IKPTVFSGVT PDMRIAQEEI FGPVATIMAY DSLEEAIEIA NDTAYGLSAC ITGDPAKAAE VAPELRAGMV AINNWGPTPG APFGGYKQSG NGREGGLYGL KDFMEMKAIS GLPA
|
| |