Gene Saro_3872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3872 
Symbol 
ID5077483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp41255 
End bp42727 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content64% 
IMG OID640480981 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_001165643 
Protein GI146275482 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.928472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGCGTTA CGATCCCCCC GCTCGAGCGC AGCGATTTCC TGACCGGCCA ACCACTTGTC 
GACGGCACCG AGATCTGTGC CGGCTCTACC TACGCGGTCA CTAACCCGGC AACCGGCAAA
ACGCTGGCGA ATGTCGTCAA GCTGGGCGCT GCCCAGACGC GTCGGGCAAT TGAAAGCAAT
TCCCGCGCCT TGATCGATTG GCGCAAGCGG CCCGCTCAGG AACGCGCGCG CCTGCTGCAT
GACTGGCTGG CGCTGGTCCG GCTCCATCGC CACGATCTTG GTTTGCTGAT GACCGCCGAA
CAGGGCAAAC CGCTGGCCGA AGCGCTGGGC GAGGTCGATT ACGCCGCAAG TTTCATCCAG
TGGTTCGCTG AAGAGGCGCG CCGTGTCTAT GGCGAGGTCA TTCCGGCCAG CGCTGACCGG
CGGATCGTCG TGATCCGTCA GCCGGTCGGC GTGGCGGGCG CCATTACGCC GTGGAACTTT
CCCGCCGCGA TGATCACCCG CAAGGTCGCG CCTGCACTGG CGGCGGGCTG CACCGTGACC
CTCAAACCGT CCGAACTCAC CCCCATGACC GCTTTCGCTC TGGCGAAGCT CGCTCGCGAA
GCGGGCGTTC CGCCCGGCGT GTTCAACGTG GTCTGCGGGG ATGCGCCCGA AATCGGATCG
GTCCTGACCA GCCATCCCGA TGTTACGAAG TTCACCTTCA CCGGTTCGAC CGCGATTGGC
AAGCTGTTGA CCGCTCAATG CGCTGCCACG CTCAAGCGGG TTTCGATGGA ACTGGGCGGC
AATGCCCCGC TGCTGGTGTT CGACGATGCC GATCTCGATC AGGCTGTCGA GGGGGCGATC
GCCTCGAAAT TCCGTAACAC CGGACAGACC TGCGTTTGCG CCAACCGGAT CCTCGTGCAA
AGCGGCATTC ATGACCGCTT CGTCGAGGCG CTGGCCGCCA GGGTCTCCGC GTTCCGGGTC
GGAAACGGCC TTGAAGGTGC AACCGACCAG GGACCGTTAA TCACCGCATC AGCCTTGGCC
AAGGTTCAGG GACATGTCGC CGATGCCGTG GCGCAAGGTG CCCAGCTGGT CACCGGCGGC
AAGAGACACG AGGCCGGAGA ACTGTTCTTC CAACCAACTG TGCTGACTGG AGCGAGACCG
GCGATGCGGC TGGCGGACGA GGAGACCTTC GGGCCGGTGG CCCCGGTGTT CCGTTTCGAA
ACCGAGGCCG AAGCGCTGGC GCTTGCCAAC GCCACGCACT CGGGGCTGGC GGCCTATGCC
TTCACCCGCG ACATTGACCG CGCCTGGCGG GTTTCCGAAG GCCTTGAGAC CGGGATGGTC
GGCTTGAACA GCGGCATCGT CTCGACCGAG ACCGCCCCGT TTGGCGGCAT CAAGGAATCG
GGACTGGGCC GAGAAGGTTC GCGACACGGT ATTGAAGAAT TCCTGGAGAT GAAGACCATC
AGTGTTGGGG TCCGGCCCGA GAGTCCGGTG TAA
 
Protein sequence
MGVTIPPLER SDFLTGQPLV DGTEICAGST YAVTNPATGK TLANVVKLGA AQTRRAIESN 
SRALIDWRKR PAQERARLLH DWLALVRLHR HDLGLLMTAE QGKPLAEALG EVDYAASFIQ
WFAEEARRVY GEVIPASADR RIVVIRQPVG VAGAITPWNF PAAMITRKVA PALAAGCTVT
LKPSELTPMT AFALAKLARE AGVPPGVFNV VCGDAPEIGS VLTSHPDVTK FTFTGSTAIG
KLLTAQCAAT LKRVSMELGG NAPLLVFDDA DLDQAVEGAI ASKFRNTGQT CVCANRILVQ
SGIHDRFVEA LAARVSAFRV GNGLEGATDQ GPLITASALA KVQGHVADAV AQGAQLVTGG
KRHEAGELFF QPTVLTGARP AMRLADEETF GPVAPVFRFE TEAEALALAN ATHSGLAAYA
FTRDIDRAWR VSEGLETGMV GLNSGIVSTE TAPFGGIKES GLGREGSRHG IEEFLEMKTI
SVGVRPESPV