Gene Saro_2450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2450 
Symbol 
ID3916769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2639757 
End bp2640728 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content67% 
IMG OID640445205 
Productalcohol dehydrogenase 
Protein accessionYP_497720 
Protein GI87200463 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAGT TCGCGCTTGC CGAACTGGTG GGGGCGGCCA GCGGCCGTAT CGTGGATACC 
GCCCTGCCGC AGCCCGCATC GGGCGAGGTC CGAGTGCGGG TCGAGGCGGT TGGTCTCGGC
TTTGTCGATG ATCTCATTGT TTCAGGCCGC TACCAATGGA AGCCGCAATT GCCGTTCGTG
CCGGGGGGCG AAATCGTCGG TACGGTTGAG GCAGTTGGGG AAGGCGTACA GGGCCTGACC
ACTGGTACCC GCGTGGCGGC ATGGCGCATG GGCGGTGGGC TTTGCGAATA TTGCACGCTC
CCGGCCGGAT ATCTCGTCCC CGTTCCGGCC CCGCTTGCGT CGGCAGATGC AGCCGGGATG
GTGCTCGACT ATGCCACGGC GGACTATGCC CTCATCGGCA GGGGCCAATT GCGCAAGGGC
GACACCGTCT TCGTGCTAGG GGCAACGGGC GGCGTGGGCG GCGCGGCAGT CCGCATCGCG
AGAGCGGCAG GCGCAGAAGT CATCGCTGGC GTCTCGAACC TGTCGCAGGG CGATAAAGTG
CTCGCCGATG GCGCTAGTGC AGTCGTCGAC TGCTCGGCTC CGGACTGGCG CGACCAACTG
CGTGGACACC CGCTCGATCT GGTTTTCGAT CCCTTGGGTG GCGCATTCAC CGAACCCGCC
TTCCGTTCGC TCGGAAAGCT GGGGCGGCAT CTCGTGGTGG GCTTCGCTGC GGGCGGTATC
CCGGCATTGC CGGTCAATCT TCCCCTCCTC AAGAGCGCAT CGCTCGTCGG CGTCGATGTG
CGCTTTTTTG CCGAGTCTGA TCCGGAGGGG TTCCGGCAGC GGCTCGCGCT GGTCTTCGAT
CAGGCGGCAC GCGGGAGCCT GCGGCCCCCG GAAACGCTCT GCTTCAGCCT TGGCGAGGCC
GCCGCCGCCT TTGCCGCACT CACCCGACGC GGCCGGGGCG GCAAAGTGGT CGTCTGCCCG
CAATTGTCCT GA
 
Protein sequence
MKQFALAELV GAASGRIVDT ALPQPASGEV RVRVEAVGLG FVDDLIVSGR YQWKPQLPFV 
PGGEIVGTVE AVGEGVQGLT TGTRVAAWRM GGGLCEYCTL PAGYLVPVPA PLASADAAGM
VLDYATADYA LIGRGQLRKG DTVFVLGATG GVGGAAVRIA RAAGAEVIAG VSNLSQGDKV
LADGASAVVD CSAPDWRDQL RGHPLDLVFD PLGGAFTEPA FRSLGKLGRH LVVGFAAGGI
PALPVNLPLL KSASLVGVDV RFFAESDPEG FRQRLALVFD QAARGSLRPP ETLCFSLGEA
AAAFAALTRR GRGGKVVVCP QLS