Gene Saro_2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2801 
Symbol 
ID3916961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3024456 
End bp3025925 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content66% 
IMG OID640445580 
Productaldehyde dehydrogenase 
Protein accessionYP_498071 
Protein GI87200814 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000306882 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCCG AAGGCGTGAA CGTAGCCCAT CCCGACAAGC TCTACATCGC CGGCCGCTGG 
GTGCCGGCGC ATTCGGGGCG GATGATCGAA CTGGTCTCGC CCAACACCGA GCTGGTGGTG
GGCCGGGTTG CCGAAGCCGA CGAGACGGAC ATGGACGCCG CCGTTGCCGC CGCGCGTGAT
GCCTTCGATC ACGGGCCATG GCCAACCACG CCTCCGGCCG AGCGCATCGC CGCATTCCGG
CGAATGGTCG CGCACCTTGA AACGCGCGTG CCCGAACTGG CGAAGGCGTG GACCGCGCAG
ATGGGCGGGC TCGCCTCGTT CGCGGGGCCG ATGCACGGCG GCAGCGTGAT GGCGCTGGGC
CAGATTGCGG GGTTTGCGGA AAAGTTCGAG TTCGTCGAGC GCCGCCCGAG CATGGCCGCC
GATACCGCGC TGATCGCCTA TGAGCCGGTT GGGGTCGTGG CAGCGATCGC GCCGTGGAAC
GGGCCGTTCG GGATCATGGC GAACAAGGTC GCCTATGCCC TGCTGACCGG CTGCACGGTG
ATCATGAAGC CGTCGCCCGA AACGCCGCTG GAAGCCTATA TCATCGCCGA GGCGGCCGAG
GCGGCTGGTA TTCCGGCCGG CGTCGTCAAT CTTGTGACGG GACACCGCGA GGCGAGCGAT
CACCTTGTCT GTAACGAAGG CGTCGACAAG GTGAGTTTCA CCGGATCGAC CGTGGCCGGC
AAGCGCATCG CCAGCGTTTG CGGTTCGCGC ATGGCGCGCT GCACGCTGGA ACTGGGCGGC
AAGTCGGCAG CCATCGTGCG GGACGATTTC CCCATCGATG CCGCCGCGAA GATGCTGGCG
GGGACGATCA CGATGATGTC GGGCCAGGTC TGTGCGATGC TCAGCCGGGT CATCGTTCCG
CGCCGCCGTC ACGATGAACT CGCCGAGGCG ATCGCGGCGG AAATGAAGAA GGTGCGCATA
GGCAATTCCG ACGATCCCGA GACCCAGCTC GGCCCGGTCG CGATGAAGCG GCAGCTCGAA
CGGATAGAGA TGTATATCGA GGAGGGCCGG AAGACGGCCG ATCTCGTCAC CGGCGGCGGA
CGCCCGGCCC ATCTGAATCG CGGCTACTTC ATCGAGCCCA CGCTTTTCGC CAACGTCAGG
AACGAGAGCC GCATCGCGCA GGAGGAGATT TTCGGCCCTG TCCTCAGCCT GATCCCGGTC
GAGGACGAGG AGGACGCCAT CCGCACCGCC AATGCCAGCG CATATGGCCT CAACGGATCG
GTCCTGACCA ACGATGCCGA AGCCGCCTAT CGCATCGCCC GGCGCGTGCG CACCGGGGGC
TTCGGTCAGA ACGGGCTGCG CATGGACTTC GGCCTGCCCT TCGGCGGGTT CAAGCAATCC
GGCATCGGCC GCGAAGGCGG GCCGGAAGGG CTCTTCCCCT ATCTCGAAAT GAAGACCATC
CTGATTGATG GCATGCCTTC GGCGCTTTGA
 
Protein sequence
MTPEGVNVAH PDKLYIAGRW VPAHSGRMIE LVSPNTELVV GRVAEADETD MDAAVAAARD 
AFDHGPWPTT PPAERIAAFR RMVAHLETRV PELAKAWTAQ MGGLASFAGP MHGGSVMALG
QIAGFAEKFE FVERRPSMAA DTALIAYEPV GVVAAIAPWN GPFGIMANKV AYALLTGCTV
IMKPSPETPL EAYIIAEAAE AAGIPAGVVN LVTGHREASD HLVCNEGVDK VSFTGSTVAG
KRIASVCGSR MARCTLELGG KSAAIVRDDF PIDAAAKMLA GTITMMSGQV CAMLSRVIVP
RRRHDELAEA IAAEMKKVRI GNSDDPETQL GPVAMKRQLE RIEMYIEEGR KTADLVTGGG
RPAHLNRGYF IEPTLFANVR NESRIAQEEI FGPVLSLIPV EDEEDAIRTA NASAYGLNGS
VLTNDAEAAY RIARRVRTGG FGQNGLRMDF GLPFGGFKQS GIGREGGPEG LFPYLEMKTI
LIDGMPSAL