Gene Saro_3688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3688 
Symbol 
ID5077836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp322619 
End bp323554 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content65% 
IMG OID640481411 
Productacetaldehyde dehydrogenase 
Protein accessionYP_001166073 
Protein GI146275913 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4569] Acetaldehyde dehydrogenase (acetylating) 
TIGRFAM ID[TIGR03215] acetaldehyde dehydrogenase (acetylating) 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGCG TGAAGGCCGC GATCATCGGT TCGGGCAATA TCGGCACCGA CCTGATGATG 
AAGATGATCA AGTACCCGCA GAACATGGAG CTGGCCATCG TCGTCGGTAT CGACGAGAAG
TCGGAAGGCC TCGCGATGGC CCGCGAACAC GGCATCGCAA CGACACACGA AGGCCTCGAA
GGCCTGCGTC GCCATCCGCT CTACAAGGAG ATCGGCATCG CCTTCGACGC GACCAGCGCA
TATGCCCACA AGGTCCACGA CGAAGCCCTG CGCGCCGACG GCATCCAGGT CGTCGACCTG
ACGCCCGCAG CCATCGGCCC CTTCACTGTG CCGCCGGTCA ACATGAGCCA GCATCTCGAC
CAGCCCAACG TGAACATGGT CACCTGCGGC GGCCAGGCGA CGATCCCGAT GGTCGCCGCC
GTCGCCCGCG TGTCGGACAA GGTTCACTAC GCCGAGATCG TCGCTTCGGT CTCCTCGCGT
TCGGCGGGCC CCGGCACGCG CGCCAACATC GACGAGTTCA CCCGCACCAC CGCCCGCGCG
ATCGAAGTCG TCGGCGGCGC GACGCGCGGC AAGGCGATCA TCATCCTCAA CCCGGCCGAA
CCGCCGATGA TCATGCGCGA CACGGTGTTC ACCCTGTCCG AAGGCGCTGA CGAGGACCAG
ATCCGCCGCT CGGTCGCCGA CATGGTTGCC GAAGTGCAGA AGTACGTGCC CGGCTACCGG
CTCAAGCAGG AAGTCCAGTT CGAACGCTTC GGCGACAACA ACAAGCTGAA GATCCCCGGC
CAGGGCGAAT TCACCGGCAT CAAGTCGATG ATCATGCTCG AGGTCGAGGG CGCCGGCGAC
TACCTGCCCA GCTATTCCGG CAACCTCGAC ATCATGACCG CCGCCGCCAA GGCCACCGGC
GAGCTTCTCG CTGCGCGCCG CATGGCCGCC GCCTGA
 
Protein sequence
MTRVKAAIIG SGNIGTDLMM KMIKYPQNME LAIVVGIDEK SEGLAMAREH GIATTHEGLE 
GLRRHPLYKE IGIAFDATSA YAHKVHDEAL RADGIQVVDL TPAAIGPFTV PPVNMSQHLD
QPNVNMVTCG GQATIPMVAA VARVSDKVHY AEIVASVSSR SAGPGTRANI DEFTRTTARA
IEVVGGATRG KAIIILNPAE PPMIMRDTVF TLSEGADEDQ IRRSVADMVA EVQKYVPGYR
LKQEVQFERF GDNNKLKIPG QGEFTGIKSM IMLEVEGAGD YLPSYSGNLD IMTAAAKATG
ELLAARRMAA A