Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2801 |
Symbol | |
ID | 3916961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 3024456 |
End bp | 3025925 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640445580 |
Product | aldehyde dehydrogenase |
Protein accession | YP_498071 |
Protein GI | 87200814 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000306882 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACCCG AAGGCGTGAA CGTAGCCCAT CCCGACAAGC TCTACATCGC CGGCCGCTGG GTGCCGGCGC ATTCGGGGCG GATGATCGAA CTGGTCTCGC CCAACACCGA GCTGGTGGTG GGCCGGGTTG CCGAAGCCGA CGAGACGGAC ATGGACGCCG CCGTTGCCGC CGCGCGTGAT GCCTTCGATC ACGGGCCATG GCCAACCACG CCTCCGGCCG AGCGCATCGC CGCATTCCGG CGAATGGTCG CGCACCTTGA AACGCGCGTG CCCGAACTGG CGAAGGCGTG GACCGCGCAG ATGGGCGGGC TCGCCTCGTT CGCGGGGCCG ATGCACGGCG GCAGCGTGAT GGCGCTGGGC CAGATTGCGG GGTTTGCGGA AAAGTTCGAG TTCGTCGAGC GCCGCCCGAG CATGGCCGCC GATACCGCGC TGATCGCCTA TGAGCCGGTT GGGGTCGTGG CAGCGATCGC GCCGTGGAAC GGGCCGTTCG GGATCATGGC GAACAAGGTC GCCTATGCCC TGCTGACCGG CTGCACGGTG ATCATGAAGC CGTCGCCCGA AACGCCGCTG GAAGCCTATA TCATCGCCGA GGCGGCCGAG GCGGCTGGTA TTCCGGCCGG CGTCGTCAAT CTTGTGACGG GACACCGCGA GGCGAGCGAT CACCTTGTCT GTAACGAAGG CGTCGACAAG GTGAGTTTCA CCGGATCGAC CGTGGCCGGC AAGCGCATCG CCAGCGTTTG CGGTTCGCGC ATGGCGCGCT GCACGCTGGA ACTGGGCGGC AAGTCGGCAG CCATCGTGCG GGACGATTTC CCCATCGATG CCGCCGCGAA GATGCTGGCG GGGACGATCA CGATGATGTC GGGCCAGGTC TGTGCGATGC TCAGCCGGGT CATCGTTCCG CGCCGCCGTC ACGATGAACT CGCCGAGGCG ATCGCGGCGG AAATGAAGAA GGTGCGCATA GGCAATTCCG ACGATCCCGA GACCCAGCTC GGCCCGGTCG CGATGAAGCG GCAGCTCGAA CGGATAGAGA TGTATATCGA GGAGGGCCGG AAGACGGCCG ATCTCGTCAC CGGCGGCGGA CGCCCGGCCC ATCTGAATCG CGGCTACTTC ATCGAGCCCA CGCTTTTCGC CAACGTCAGG AACGAGAGCC GCATCGCGCA GGAGGAGATT TTCGGCCCTG TCCTCAGCCT GATCCCGGTC GAGGACGAGG AGGACGCCAT CCGCACCGCC AATGCCAGCG CATATGGCCT CAACGGATCG GTCCTGACCA ACGATGCCGA AGCCGCCTAT CGCATCGCCC GGCGCGTGCG CACCGGGGGC TTCGGTCAGA ACGGGCTGCG CATGGACTTC GGCCTGCCCT TCGGCGGGTT CAAGCAATCC GGCATCGGCC GCGAAGGCGG GCCGGAAGGG CTCTTCCCCT ATCTCGAAAT GAAGACCATC CTGATTGATG GCATGCCTTC GGCGCTTTGA
|
Protein sequence | MTPEGVNVAH PDKLYIAGRW VPAHSGRMIE LVSPNTELVV GRVAEADETD MDAAVAAARD AFDHGPWPTT PPAERIAAFR RMVAHLETRV PELAKAWTAQ MGGLASFAGP MHGGSVMALG QIAGFAEKFE FVERRPSMAA DTALIAYEPV GVVAAIAPWN GPFGIMANKV AYALLTGCTV IMKPSPETPL EAYIIAEAAE AAGIPAGVVN LVTGHREASD HLVCNEGVDK VSFTGSTVAG KRIASVCGSR MARCTLELGG KSAAIVRDDF PIDAAAKMLA GTITMMSGQV CAMLSRVIVP RRRHDELAEA IAAEMKKVRI GNSDDPETQL GPVAMKRQLE RIEMYIEEGR KTADLVTGGG RPAHLNRGYF IEPTLFANVR NESRIAQEEI FGPVLSLIPV EDEEDAIRTA NASAYGLNGS VLTNDAEAAY RIARRVRTGG FGQNGLRMDF GLPFGGFKQS GIGREGGPEG LFPYLEMKTI LIDGMPSAL
|
| |