Gene Saro_0874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0874 
Symbol 
ID3917959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp929439 
End bp930590 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content65% 
IMG OID640443607 
Productalcohol dehydrogenase 
Protein accessionYP_496153 
Protein GI87198896 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID[TIGR02818] S-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.101876 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTATCGG ACCGCCACGT CAAAGGGAGA CCGCACGAAA TGAAGACCCG CGCCGCAGTT 
GCGTTCGCGC CCAAGCAGCC GCTCGAGATC GTCGAACTGG ACCTCGAAGG CCCCAAGGCT
GGCGAAGTGC TGGTCGAGAT CATGGCGACC GGCGTGTGCC ACACCGATGC CTACACGCTC
GACGGGTTCG ACAGCGAAGG CATCTTCCCC AGCGTGCTGG GCCACGAAGG CGCCGGTATC
GTGCGCGAGG TGGGCCCTGG GGTCACTTCG GTGAAGCCCG GCGATCACGT GATCCCGCTC
TACACGCCGG AATGCCGCCA GTGCAAATCG TGCCTCTCGG GCAAGACCAA CCTGTGCACC
GCGATCCGCG CCACGCAAGG GCAGGGCCTG ATGCCCGACG GCACCAGCCG CTTTTCGTAC
AAGGGCCAGA CCGTGTTCCA CTACATGGGC TGCTCGACCT TCTCTAACTT CACCGTCCTG
CCCGAGATCG CGGTTGCCAA GATCCGCGAG GACGCGCCGT TCAAGACCTC GTGCTATATC
GGCTGCGGCG TGACGACGGG CGTCGGCGCG GTGATCAACA CCGCCAAGGT CCAGGTCGGT
GACAACGTCG TGGTCTTCGG CCTCGGCGGC ATCGGCCTCA ACGTGATCCA GGGCGCGCGG
CTTGCCGGTG CCGGCAAGAT CATCGGCGTC GACATCAACC CCGACCGCGA GGAATGGGGC
CGCAAGTTCG GCATGACCGA CTTCCTCAAC AGCAAGGGCA TGAGCCGCGA GGACGTCGTC
GCCAAGGTCG TCGCCATGAC CGACGGCGGC GCGGACTACA CCTTCGACGC CACCGGCAAC
ACCGAAGTGA TGCGCACGGC GCTTGAAGCC TGCCATCGCG GCTGGGGCAC CTCCATCATC
ATCGGCGTGG CCGAGGCGGG CAAGGAAATC AGCACGCGTC CGTTCCAGCT CGTCACCGGC
CGCAACTGGC GCGGCACGGC CTTCGGCGGC GCCAAGGGCC GCACCGACGT GCCCAAGATC
GTCGACATGT ACATGACCGG CAAGATCGAG ATCGACCCGA TGATCACCCA TGTCATGGGC
CTGGAAGAGA TCAACACCGC CTTCGACCTG ATGCACGCCG GCAAGTCGAT CCGTTCAGTC
GTGGTGTTCT GA
 
Protein sequence
MLSDRHVKGR PHEMKTRAAV AFAPKQPLEI VELDLEGPKA GEVLVEIMAT GVCHTDAYTL 
DGFDSEGIFP SVLGHEGAGI VREVGPGVTS VKPGDHVIPL YTPECRQCKS CLSGKTNLCT
AIRATQGQGL MPDGTSRFSY KGQTVFHYMG CSTFSNFTVL PEIAVAKIRE DAPFKTSCYI
GCGVTTGVGA VINTAKVQVG DNVVVFGLGG IGLNVIQGAR LAGAGKIIGV DINPDREEWG
RKFGMTDFLN SKGMSREDVV AKVVAMTDGG ADYTFDATGN TEVMRTALEA CHRGWGTSII
IGVAEAGKEI STRPFQLVTG RNWRGTAFGG AKGRTDVPKI VDMYMTGKIE IDPMITHVMG
LEEINTAFDL MHAGKSIRSV VVF