Gene Saro_2304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2304 
Symbol 
ID3915649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2443979 
End bp2445409 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content65% 
IMG OID640445060 
Productaldehyde dehydrogenase 
Protein accessionYP_497575 
Protein GI87200318 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGACA CGACGCGGGG CGAGATGCTG GCGCAGCTCG AGCGGCAGAA GGCGGCATTC 
ACCCAGGCGC GCCCCGAACC CCTGTCCACG CGCAACGACA GGCTGGAGCG ATGCGCACGG
CTGTTGCTCC AGCACGGCGA AGACTTCGCC CGGGCGATGA GCACCGATTT CGGCCACCGT
AGCCACGAGC AGTCGATGCT GACCGACATC ATGCCGGCGT TGAGCCTGGT GCGCTATTCG
CAGAAGCGGA TGAAGGCCTG GTCGAAGCCG GAGAAGCGCC ACGTCAACTT CCCGCTGGGC
CTGCTGGGGG CAAGGGCCGA GGTGCGGTAC GAACCCAAGG GCGTGATAGG AATCGTGGCG
CCCTGGAACT TTCCGGTCGG CCTGACGCTG GCGCCACTGG CGCAGGCCTT CGCGGCCGGC
AACCGCGCCA TGCTGAAGCC CAGCGAGTTC ACCGAGCGGA CTTCGGAACT GATGGCTGAA
CTTTTCCCGA AGTATTTCGG GGAGGACGAG GTTGCCGTCG TGCTGGGCGG GCCGCAGGCG
GGGCAGGATT TCTGCTCGTT GCCGTTCGAC CATCTGCTGT TCACCGGCGC CACCTCGATC
GGCAAGCACG TGCTTCATGC CGCGGCGGAC AACCTCGTGC CGGTGACCCT TGAACTTGGG
GGCAAGTCGC CGACCATCCT CGGGCGGAGC GCCAATGTAG AACAGGCGGC CCAGCGCATC
GCACTGGGCA AGATGATGAA CGCCGGGCAG ATCTGCCTTG CGCCCGACTA CATGCTGGTG
CCCGAGGACA TGGAAGAGCG GGCGATTGGC GCGGTCAGCG CGAGCGTCGC GCAGATGTAC
CCGACCTTGC TCGCGAACGA CGACTATACT TCCGTTATCA ACCGGCGGCA CCGCGACCGG
CTGGTCGGGC TTGTCGACGA TGCCGTGGCG AAGGGCGCGG AGGCCATCGT CGTCAATCCG
GGCGGGGAAA ACTTCGAGGG ATCGAACGGC AACAAGCTGC CGCTCACGAT ACTGCGCAAC
GTCGACGACG GCATGAAGGT GATGCAGGAC GAAATCTTCG GGCCGGTGCT GCCGGTGAAG
ACCTATCGCG GGATCGACGA GGCGATCGAC TACATCAATG CCCATGACCG TCCGCTGGGG
CTCTACTATT TCGGCGAGGA CGCGGGCGAG CGGGAGCGGT TGCTGACGCG GACGATATCG
GGCGGGGTTA CCGTGAACGA CGTGATCTTC CACGTATCCG CCGACGATCT GCCGTTTGGC
GGGGTCGGGC CTTCGGGCAT GGGCAGCTAC CACGGCATCG AAGGATTCCG CAGCTTCAGC
CACGCGCGCG CGGTCTATCG GCAACCCAAG GTGAACGTGG CCAAGCTTGC CGGATTGTTG
CCGCCCTATG GCGCGGCGAC CGCTCGCACG CTCAAGATGC AGCTAAAGTA A
 
Protein sequence
MKDTTRGEML AQLERQKAAF TQARPEPLST RNDRLERCAR LLLQHGEDFA RAMSTDFGHR 
SHEQSMLTDI MPALSLVRYS QKRMKAWSKP EKRHVNFPLG LLGARAEVRY EPKGVIGIVA
PWNFPVGLTL APLAQAFAAG NRAMLKPSEF TERTSELMAE LFPKYFGEDE VAVVLGGPQA
GQDFCSLPFD HLLFTGATSI GKHVLHAAAD NLVPVTLELG GKSPTILGRS ANVEQAAQRI
ALGKMMNAGQ ICLAPDYMLV PEDMEERAIG AVSASVAQMY PTLLANDDYT SVINRRHRDR
LVGLVDDAVA KGAEAIVVNP GGENFEGSNG NKLPLTILRN VDDGMKVMQD EIFGPVLPVK
TYRGIDEAID YINAHDRPLG LYYFGEDAGE RERLLTRTIS GGVTVNDVIF HVSADDLPFG
GVGPSGMGSY HGIEGFRSFS HARAVYRQPK VNVAKLAGLL PPYGAATART LKMQLK