Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3855 |
Symbol | |
ID | 5077466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | - |
Start bp | 23208 |
End bp | 24692 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640480964 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001165626 |
Protein GI | 146275465 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.146546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTCAA CTTCCTCCTC TCCGGTCACC GAAACTATCC TGAACTTCAT TGACGGTTCC TATCGCAAAG GTAGCGAGGG TAAGTCGTTT CCCAACGTCA ATCCGGCCAC CGGCACCGCG ATCGGGGTTG TGCACGAAGC AAGCCAGGCC GACGTCGCAG ACGCCGTGGC TGCGGCCAAG GCAGCGCTTA CCGGGCCATG GGGCAAGATG ACCACGGCCG AGCGGGTCAA GCTGATCGCC GCCGTGGCGA CCGAGATCGA ACGCCGAGCG GATGATTTCC TGGCTGCCGA AGTGGCCGAC ACCGGCAAGC CGCGCCATGT CGCGTCCCAT ATCGACATTC CGCGCGGGGC CGCCAACTTC CGCATGTTCG CGGATGTCGT CTCGACAATG CCGGGCGAAA GCTTCAACAC ATCGACCCCC GATGGCGGCC AGGCGCTCAA CTATACCGTG CGCAAGCCCA AGGGTGTGGT CGCGGTGGTC TGCCCATGGA ACTTCCCGCT GCTGCTGATG ACCTGGAAGG TTGGCCCGGC GCTGGCCTGC GGCAATACCG TGGTGGTCAA GCCGTCCGAG GAAACGCCTC GGACTGCTGC CCTGCTGGGT GAAGTAATGA ACGCGGTGGG CATGCCCAAG GGTGTCTACA ACGTCGTCCA CGGATTCGGT CCGGGTTCGG CCGGCGAATT CCTCACGTCC AACCCCGATG TCGATGCCAT CACCTTCACC GGCGAGACCG GCACCGGACA GGCGATCATG CAGAAGGCCG CGATCGGCGT TCGCGACATT TCGTTCGAAC TCGGTGGCAA GAACCCGGCG ATTGTGTTCG CCGATGCCGA CCTCGACAAG GCGGTCGAGG GTCTGTCGCG CTCGGTCTTC CTGAACACCG GGCAGGTCTG CCTCGGAACC GAGCGGGTCT ATGTCGAACG ACCGATCTTC GACGCCTTCG TGGCGCGGAT GGCGGCGGCG GCGCAGGGCT TCAAGCCGGG CGTGACCGGT GATCGCGCCT ATCTCGGCCC GCTGATCAGC GCCGAGCACC GCGAGAAAGT GCTGGGTTAC TATCGCCGTG CGGTCGAGGA CGGGGCCACC GTGGTCACCG GCGGCGGCGT TCCCGAAATC TCGGGTGCGG AAGCCGGTGG CTTCTTCGTG GAACCGACGC TGTGGATCGA CGTCGCCCAC GGCGATACCG TGATGCGCGA GGAAATCTTC GGACCGTGCT GCGGTATCGT GCCGTTCGAC AGCGAGGACG AGGTGATCGC ACTTGCAAAC GATACGGTTT ACGGCCTGTG CGCCTCGATC TGGACCGAAA ACATGTCCCG CGGACACCGC GTGGCGGCGG CGATGGACGT GGGGGTGTGC TGGGTCAATT CCTGGTTCCT GCGCGATCTG CGCACGGCTT TCGGCGGGTC CGGCCATTCC GGCATCGGCC GGGAAGGCGG GGTGCACAGC CTCGAATTCT ACACCGAGAT CACCAACATT TGCGTGAAGC TGTAA
|
Protein sequence | MTSTSSSPVT ETILNFIDGS YRKGSEGKSF PNVNPATGTA IGVVHEASQA DVADAVAAAK AALTGPWGKM TTAERVKLIA AVATEIERRA DDFLAAEVAD TGKPRHVASH IDIPRGAANF RMFADVVSTM PGESFNTSTP DGGQALNYTV RKPKGVVAVV CPWNFPLLLM TWKVGPALAC GNTVVVKPSE ETPRTAALLG EVMNAVGMPK GVYNVVHGFG PGSAGEFLTS NPDVDAITFT GETGTGQAIM QKAAIGVRDI SFELGGKNPA IVFADADLDK AVEGLSRSVF LNTGQVCLGT ERVYVERPIF DAFVARMAAA AQGFKPGVTG DRAYLGPLIS AEHREKVLGY YRRAVEDGAT VVTGGGVPEI SGAEAGGFFV EPTLWIDVAH GDTVMREEIF GPCCGIVPFD SEDEVIALAN DTVYGLCASI WTENMSRGHR VAAAMDVGVC WVNSWFLRDL RTAFGGSGHS GIGREGGVHS LEFYTEITNI CVKL
|
| |