Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2304 |
Symbol | |
ID | 3915649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2443979 |
End bp | 2445409 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640445060 |
Product | aldehyde dehydrogenase |
Protein accession | YP_497575 |
Protein GI | 87200318 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGACA CGACGCGGGG CGAGATGCTG GCGCAGCTCG AGCGGCAGAA GGCGGCATTC ACCCAGGCGC GCCCCGAACC CCTGTCCACG CGCAACGACA GGCTGGAGCG ATGCGCACGG CTGTTGCTCC AGCACGGCGA AGACTTCGCC CGGGCGATGA GCACCGATTT CGGCCACCGT AGCCACGAGC AGTCGATGCT GACCGACATC ATGCCGGCGT TGAGCCTGGT GCGCTATTCG CAGAAGCGGA TGAAGGCCTG GTCGAAGCCG GAGAAGCGCC ACGTCAACTT CCCGCTGGGC CTGCTGGGGG CAAGGGCCGA GGTGCGGTAC GAACCCAAGG GCGTGATAGG AATCGTGGCG CCCTGGAACT TTCCGGTCGG CCTGACGCTG GCGCCACTGG CGCAGGCCTT CGCGGCCGGC AACCGCGCCA TGCTGAAGCC CAGCGAGTTC ACCGAGCGGA CTTCGGAACT GATGGCTGAA CTTTTCCCGA AGTATTTCGG GGAGGACGAG GTTGCCGTCG TGCTGGGCGG GCCGCAGGCG GGGCAGGATT TCTGCTCGTT GCCGTTCGAC CATCTGCTGT TCACCGGCGC CACCTCGATC GGCAAGCACG TGCTTCATGC CGCGGCGGAC AACCTCGTGC CGGTGACCCT TGAACTTGGG GGCAAGTCGC CGACCATCCT CGGGCGGAGC GCCAATGTAG AACAGGCGGC CCAGCGCATC GCACTGGGCA AGATGATGAA CGCCGGGCAG ATCTGCCTTG CGCCCGACTA CATGCTGGTG CCCGAGGACA TGGAAGAGCG GGCGATTGGC GCGGTCAGCG CGAGCGTCGC GCAGATGTAC CCGACCTTGC TCGCGAACGA CGACTATACT TCCGTTATCA ACCGGCGGCA CCGCGACCGG CTGGTCGGGC TTGTCGACGA TGCCGTGGCG AAGGGCGCGG AGGCCATCGT CGTCAATCCG GGCGGGGAAA ACTTCGAGGG ATCGAACGGC AACAAGCTGC CGCTCACGAT ACTGCGCAAC GTCGACGACG GCATGAAGGT GATGCAGGAC GAAATCTTCG GGCCGGTGCT GCCGGTGAAG ACCTATCGCG GGATCGACGA GGCGATCGAC TACATCAATG CCCATGACCG TCCGCTGGGG CTCTACTATT TCGGCGAGGA CGCGGGCGAG CGGGAGCGGT TGCTGACGCG GACGATATCG GGCGGGGTTA CCGTGAACGA CGTGATCTTC CACGTATCCG CCGACGATCT GCCGTTTGGC GGGGTCGGGC CTTCGGGCAT GGGCAGCTAC CACGGCATCG AAGGATTCCG CAGCTTCAGC CACGCGCGCG CGGTCTATCG GCAACCCAAG GTGAACGTGG CCAAGCTTGC CGGATTGTTG CCGCCCTATG GCGCGGCGAC CGCTCGCACG CTCAAGATGC AGCTAAAGTA A
|
Protein sequence | MKDTTRGEML AQLERQKAAF TQARPEPLST RNDRLERCAR LLLQHGEDFA RAMSTDFGHR SHEQSMLTDI MPALSLVRYS QKRMKAWSKP EKRHVNFPLG LLGARAEVRY EPKGVIGIVA PWNFPVGLTL APLAQAFAAG NRAMLKPSEF TERTSELMAE LFPKYFGEDE VAVVLGGPQA GQDFCSLPFD HLLFTGATSI GKHVLHAAAD NLVPVTLELG GKSPTILGRS ANVEQAAQRI ALGKMMNAGQ ICLAPDYMLV PEDMEERAIG AVSASVAQMY PTLLANDDYT SVINRRHRDR LVGLVDDAVA KGAEAIVVNP GGENFEGSNG NKLPLTILRN VDDGMKVMQD EIFGPVLPVK TYRGIDEAID YINAHDRPLG LYYFGEDAGE RERLLTRTIS GGVTVNDVIF HVSADDLPFG GVGPSGMGSY HGIEGFRSFS HARAVYRQPK VNVAKLAGLL PPYGAATART LKMQLK
|
| |