Gene Saro_1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1197 
Symbol 
ID3916494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1245211 
End bp1246557 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content65% 
IMG OID640443933 
Productaldehyde dehydrogenase 
Protein accessionYP_496476 
Protein GI87199219 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.238764 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCCC CGACCGCCGC CGACCTTTCC GCCGACATCG CACGCGTCTT CGCACTCCAG 
CAGGCGCACA TGTGGGAGGC CAAGGCCTCC ACCGCGGCCG AGCGCAAGGA AAAGCTCGCG
CGCCTCAAGG CCGCCGTCGA AGCCCACGCC GACGACATCG TCGCCGCCGT CCTCGAAGAC
ACGCGCAAGC CGGTTGGCGA AATCCGCGTG ACCGAAGTCC TCAACGTCAC CGCCAACATC
CAGCGCAACA TCGACAATCT CGATGAATGG ATGAAGCCGG TCGAGGTCGC CACCTCGCTC
AATCCCGCCG ACCGCGCGCA GATCATCCAC GAAGCGCGCG GCGTCTGCCT GATCCTTGGC
CCCTGGAACT TCCCCCTCGG CCTCGCGCTC GGTCCGGTCG CCGCTGCCAT CGCCGCAGGC
AACACCTGCA TCGTGAAGCT CACCGACCTC TGCCCCGCCA CCGCAAGGGT GGCCTCGGTG
ATCGTCAGGG AAGCGTTCGA CGAAAAGGAT GTGGCTCTGT TCGAAGGCGA CGTCTCGGTC
GCCACCGCGC TCCTCGATCT GCCGTTCAAC CACGTCTTCT TCACCGGCTC GCCCCGCGTC
GGCAAGATCG TGATGGCCGC TGCCGCAAAG CACCTCACCA GCGTCACGCT CGAACTTGGC
GGGAAGTCGC CCGTCATCGT CGACGATAGC GCCGACATCG ATCAGGTCGC CGCCCAGCTC
GCCGCGGCCA AGCAGTTCAA CGGCGGGCAG GCCTGCATCA GCCCGGACTA CGTCTTCGTG
AAGGAAGACA AGAAGGCCGC GCTGGTCGAA GGCTTCCGGG CCAACGTGCA GAAGAACCTC
TATGACGATG CCGGCAACCT GAAGAAGGAC AGCATCGCCC AGGTGGTCAA CAAGGCGAAC
TTCGACCGCG TGAAGGCCAT GTTCGACGAT GCCGTCGCCA AGGGCGCGAC CGTCGCCGCC
GGCGGAACGT TCGAAGCCGA TGACCTCACC ATCCATCCGA CCATGCTGAC CGGCGTCACC
CCGCAGATGA CCATCCTCCA GGACGAAATC TTCGCCCCCG TCATCCCGGT GATGACCTAC
GACACGCTCG ACCAGGCGAT CGGCTACATC GAAGCCCGCG ACAAGCCGCT CGCACTCTAT
GTCTACAGCA AGGACGAAGC GAACGTCGAA AAGGTCCTCG CCCGCACCTC GTCGGGCGGT
GTCACGGTGA ATGGCGTGTT CTCGCACTAC CTGGAAAACA ACCTGCCGTT CGGCGGCGTC
AACACCAGCG GCATGGGCAG CTACCACGGC GTGTTCGGCT TCAAGTGCTT CAGCCACGAA
CGGGCTGTCT ACCGCCACCA GCAGTAA
 
Protein sequence
MTAPTAADLS ADIARVFALQ QAHMWEAKAS TAAERKEKLA RLKAAVEAHA DDIVAAVLED 
TRKPVGEIRV TEVLNVTANI QRNIDNLDEW MKPVEVATSL NPADRAQIIH EARGVCLILG
PWNFPLGLAL GPVAAAIAAG NTCIVKLTDL CPATARVASV IVREAFDEKD VALFEGDVSV
ATALLDLPFN HVFFTGSPRV GKIVMAAAAK HLTSVTLELG GKSPVIVDDS ADIDQVAAQL
AAAKQFNGGQ ACISPDYVFV KEDKKAALVE GFRANVQKNL YDDAGNLKKD SIAQVVNKAN
FDRVKAMFDD AVAKGATVAA GGTFEADDLT IHPTMLTGVT PQMTILQDEI FAPVIPVMTY
DTLDQAIGYI EARDKPLALY VYSKDEANVE KVLARTSSGG VTVNGVFSHY LENNLPFGGV
NTSGMGSYHG VFGFKCFSHE RAVYRHQQ