Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3876 |
Symbol | |
ID | 5077487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | + |
Start bp | 45672 |
End bp | 47111 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640480985 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001165647 |
Protein GI | 146275486 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.104084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCAAA TTGAGCACTG GATTGGCGGC ACCGCCGTGG CTGCAACGGG CGGGGCCTAT TTCGACGATC TTAACCCGGT CGATGATTCG GTCTATTCCC GGGTTCCCGC CGGCACCGCG GAAGACGTGA ACCGCGCCGT GGAGAACGCG CATCAGGCCT ATCTCCAGCA TCGCGATCTG CCCGCCGCAG TGCGCGAAGG GTGGATCGCG AAAGCCGCCG AAATCATGGA GCGCGATACC GCCAAATTCG CCGACGTGCT GGTCGACGAA ATCGGTTCGC CGATCGCCAA GGCCGGGTTC GAAACCCGCT TCGCGGTCAG CTTCCTGCGC GCGGCGATCG GCGTGCCGCG CCGGATTCGG GGGGAAACGA TCCCTTCGGA CACCCCTGGG CGGTTCAGCA TGAGCCTGCG CCAGCCGGTC GGTGTGGTCG CCGGGATTAC CCCGTTCAAC GTGCCGCTGA TCAAGGGGAT CAAACAGTCG GCCATGGCCC TGGCCACAGG CAACGCCTTC GTGCTGCTAC CCTCGGAAGC CGCGCCAATG ATCGCCGATC TGCTCGCTAA ATTGTGGAAG GAGGCCGGCG TTCCCGACGG CTTGTTCAAC GTGGTCTACG GCAATGGCGC GGAGATCGGC GACGTGCTGA CCGGCCATCC CAAGGTCGCA TCGATAACCT TCACCGGGTC CTCGCGCGTC GGCAAGCACA TCGCCGAAAT CGCGGCCCGC AACCTCAAGA AGTACACGCT GGAGCTGGGC GGCAAGAGCC CGCTGGTGAT CTGCGCCGAT GCCGATCTGG ACAAGGCGGT CAACGCCGCG CTGTTCAGCA TCTTCATGTA CCAGGGCCAG GTCTGCATGG GGGCTTCGCG GATCTATGTG GAACGGTCGA TCTTCGACCA GTTCACCAAG GCCTTCGCCG CCGCGACAGG CCGCGCCAAC AGCGGCGACC TGCGCGATCC TACCACCATG CTGGGCCCGA TAATTTCCGA ACGCCAGCGC GACCGCGTCC GCCGCCACAT CGACGATGCC CGGTCCAAGG GCGCCGCGGT GCTGGCCGGC GGCGAGTGGA GCGGAAACAG CTGCGCGGCC ACCATCCTGA GTGGCGTTAC CGCGGAAATG ACCGTGTTCG AGGAAGAGAC CTTCGGCCCG GTCACCTCGC TCTTTCCGTT CGACACACTT GAGGAAGCCC TCGAACTCGC CAACAACACC GAATACGGCC TCAGCGCCTC TATCTTCACC CGCGATCTCG ACAAGGCGCT GGCCTTTGCC CAGCGGGCCG AAGCGGGGAT GGTCCACATC AACGCGCCGA CCCTGCACGA CGAGCCGCAC GTTCCGTTCG GCGGAACGAA GGCCTCTGGG TTCGGGCGCG AAGGCACCGA AGCCGATCTC GAGATCATGA CCGAATGGAA ATGGGTCACG ATCCAGTCGG CGACTGAGGG CGGACACTGA
|
Protein sequence | MTQIEHWIGG TAVAATGGAY FDDLNPVDDS VYSRVPAGTA EDVNRAVENA HQAYLQHRDL PAAVREGWIA KAAEIMERDT AKFADVLVDE IGSPIAKAGF ETRFAVSFLR AAIGVPRRIR GETIPSDTPG RFSMSLRQPV GVVAGITPFN VPLIKGIKQS AMALATGNAF VLLPSEAAPM IADLLAKLWK EAGVPDGLFN VVYGNGAEIG DVLTGHPKVA SITFTGSSRV GKHIAEIAAR NLKKYTLELG GKSPLVICAD ADLDKAVNAA LFSIFMYQGQ VCMGASRIYV ERSIFDQFTK AFAAATGRAN SGDLRDPTTM LGPIISERQR DRVRRHIDDA RSKGAAVLAG GEWSGNSCAA TILSGVTAEM TVFEEETFGP VTSLFPFDTL EEALELANNT EYGLSASIFT RDLDKALAFA QRAEAGMVHI NAPTLHDEPH VPFGGTKASG FGREGTEADL EIMTEWKWVT IQSATEGGH
|
| |