Gene Saro_3899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3899 
Symbol 
ID5077383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp68869 
End bp69966 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content61% 
IMG OID640481006 
Productalcohol dehydrogenase 
Protein accessionYP_001165668 
Protein GI146275507 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.323018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCAT ACGCGGCAAT TATCGAGCGT CAAGGCGGCG AATTCGTTCT GGATAACGTC 
TCTATCGAGG ATCCGCGCGA CGGCGAAGTG CTGGTCAAGG TTGCCGCAGC TGGCATGTGT
CATACCGACC TGACGGTTCG CGATCAATAT TACCCGACGC CGCTGCCGGC GGTGCTGGGC
CATGAAGGTT CGGGCGTTGT CGAAAAGGTC GGACGTGGCG TCACCACTGT CAAGCCAGGC
GACAAGGTCG TGCTCTCCTT CAGCTATTGC GGCACCTGTC CATCGTGCCT CAAGGGGCAT
CAGGCCTATT GTCCGAGCCT GTTCCCGCTC AATTTCATGG GCCGCCGCCT GGATGGTTCG
ACGCCGATTA CCCGCAACGG CCAAGAGGTC AACGCCTGCT TCTTCGGGCA ATCCTCGTTC
GCGACCTATT CGATCGCGTC GGAAAACAAC TGCGTCAAGG TTGCCGACGA CGCACAGATC
GAACTTTTGG GCCCACTGGG CTGCGGCATC CAGACCGGGG CGGGCAGCAT CCTCAATGCG
CTTTGTCCCG AACCTGGCTC CTCGATCGCG ATCTTCGGGG TCGGGTCGGT CGGCCTCAGC
GCCGTGATGG CCGCCAAGGC CTCGGGCTGC CTCAAGATCA TCGCGGTTGA CCGCAACGCA
GGCCGCTTGG AACTGGCGCG TGAACTGGGC GCCACCGATG TGATCGACGC CAACACGGTC
AACGCTCAGG AAGCGATCGT CGCGATGACC GGTGGCGGCG CCGACTATGC CATGGATACC
ACCGCCATTC CAGCGGTGCT GCGCTCGGCG GTGGACAGCA CGCACAACAT GGGTGAAACC
GCAGTGGTCG GCGGGGCGAA GCTGGGCACC GAGTTTTCGC TAGACATGAA CAACATGCTG
TTTGGCCGCA AGTTGCGCGG CGTAGTCGAA GGATCGAGCA CCCCGCAGGT CTTCATCCCG
CAACTGATTG CGATGCAGAA GGCCGGGCTG TTCCCGTTCG AGAAGCTCTG CACCTTCTAT
GATCTCGACC AGATCAACCA GGCCGTCGAG GATACCGAAA AGACCGGCAA GGCGATCAAG
GCCATTCTCA AAATGTAG
 
Protein sequence
MDAYAAIIER QGGEFVLDNV SIEDPRDGEV LVKVAAAGMC HTDLTVRDQY YPTPLPAVLG 
HEGSGVVEKV GRGVTTVKPG DKVVLSFSYC GTCPSCLKGH QAYCPSLFPL NFMGRRLDGS
TPITRNGQEV NACFFGQSSF ATYSIASENN CVKVADDAQI ELLGPLGCGI QTGAGSILNA
LCPEPGSSIA IFGVGSVGLS AVMAAKASGC LKIIAVDRNA GRLELARELG ATDVIDANTV
NAQEAIVAMT GGGADYAMDT TAIPAVLRSA VDSTHNMGET AVVGGAKLGT EFSLDMNNML
FGRKLRGVVE GSSTPQVFIP QLIAMQKAGL FPFEKLCTFY DLDQINQAVE DTEKTGKAIK
AILKM