Gene Saro_0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0995 
Symbol 
ID3915777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1037964 
End bp1039052 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content67% 
IMG OID640443729 
Productalcohol dehydrogenase, zinc-containing 
Protein accessionYP_496274 
Protein GI87199017 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0675195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCG CCGTACTCGT CGAACCGGGC AAGCCGCTGG ATATTCAGCA TCTCAGCGTG 
TCCAAGCCCG GCCCGCATGA AGTCCTTATC CGCACCGCAG CCTGCGGGCT GTGCCATTCG
GACTTGCACT TCATCGAAGG TGCCTATCCC CATCCGCTGC CCGCGGTGCC GGGGCACGAG
GCGGCGGGGA TCGTCGAGGC GGTCGGCTCG GAAGTGCGCA CGGTCAAGGT GGGTGACGCG
GTCGTCACCT GCCTGTCCGC GTTCTGCGGT CATTGCGAGT TCTGCGTGAC CGGCCGGATG
TCGCTGTGCC TTGGCGGCGA CACCCGGCGC GGCGCGGGCG AGGCACCTCG CCTTACCCGC
ACCGACGACG GCAGCGCCGT GAACCAGATG CTCAACCTCT CGGCCTTTGC CGAACAGATG
CTGGTGCACG AACATGCCTG CGTGGCGATC AATCCCGAGA TGCCGCTCGA CCGCGCGGCG
GTGATCGGCT GCGCGGTCAC CACTGGCGCG GGTGCGGTGT TCAACGCGGC GAAGCTGACC
CCGGGCGAGA CGGTCTGCGT GGTCGGCTGT GGCGGCGTCG GCCTTGCCAC GGTCAACGCC
GCGAAGATCG CCGGCGCAGG CCGGATCATC GCGGTGGACC CGATGCCGGA AAAGCGCGAA
CTGGCCATGA AGCTGGGCGC GACCGATGTG ATGGACGCGG GACCCGATGC GGCGGCACAG
ATCGTCGAGA TGACGAAAGG CGGCGTCCAC CATGCGATCG AGGCCGTGGG GCGTCCGGCA
TCGGGCGACC TTGCGGTCGC GACGCTGCGC CGCGGCGGCA CCGCCACGAT CCTTGGCATG
ATGCCGCTGG CACACAAGGT CGGACTTTCC GCGATGGACC TGCTGTCGGA CAAGAAGCTG
CAGGGCGCCA TCATGGGCCG CAACCACTTC CCGGTGGACC TGCCGCGCCT GGTCGACTTC
TACATGCGCG GCTTGCTCGA TCTCGACACG ATCATTGCCG AACGCATCCC GCTCGAAGGG
ATCAACGATG GCTTCGAGAA GATGAAGCAG GGCCATTCCG CCCGCTCTGT CATCGTGTTC
GACCAATGA
 
Protein sequence
MKAAVLVEPG KPLDIQHLSV SKPGPHEVLI RTAACGLCHS DLHFIEGAYP HPLPAVPGHE 
AAGIVEAVGS EVRTVKVGDA VVTCLSAFCG HCEFCVTGRM SLCLGGDTRR GAGEAPRLTR
TDDGSAVNQM LNLSAFAEQM LVHEHACVAI NPEMPLDRAA VIGCAVTTGA GAVFNAAKLT
PGETVCVVGC GGVGLATVNA AKIAGAGRII AVDPMPEKRE LAMKLGATDV MDAGPDAAAQ
IVEMTKGGVH HAIEAVGRPA SGDLAVATLR RGGTATILGM MPLAHKVGLS AMDLLSDKKL
QGAIMGRNHF PVDLPRLVDF YMRGLLDLDT IIAERIPLEG INDGFEKMKQ GHSARSVIVF
DQ