Gene Saro_1864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1864 
Symbol 
ID3917085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1964998 
End bp1966011 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content67% 
IMG OID640444608 
Productalcohol dehydrogenase 
Protein accessionYP_497138 
Protein GI87199881 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.410641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCAAT GGCTTGTTGC GAAGGGGTCG ACCTCGCTCG ACGATCTGAG AATGGGCGAC 
GTGCCGGTCC CGCAACCGGG TGCGGGTGAG GTGCTGGTAC GGGTTCATGC CTGCTCGCTG
AACTATCGCG ACCAGATCAT TCCCCTGGGC TTCTACATGG GCGGCGTGGT GCAGCATGAC
ACTGTGCCGC TTTCCGATGG CGCGGGCGAA ATCGTAGCGG TGGGTGAGGG TGTTTCCTCG
TTCAAGGTCG GCGACCGGGT GGCCGGGCTC TTCTTCCAGA ACTGGAACGA CGGTCCGCCG
AACCCCGGCG TGGGCCCCGC GCTGGGCGCG CCGCCGGCGC AGGGGATGCT TCAGGATTAC
GTCGTGCTGC CCGAGCACGG TGTCGTGCGC CTTGCCGCGA CGCTGGACTA TACCGAGGCG
GCATGCCTGC CCTGCGCCGG CGTCACTGCC TGGAACGCGC TGATGGAAGG CCCGCGTCCT
GTGAAGGCAG GCGACAGCGT GCTGGTGCTG GGCACCGGCG GCGTGTCGCT GCTGGCCTTG
CAGATCGCCA AGGCCGCAGG AGCGACGGTG ATCGCGACGT CTTCGTCGGA CGAGAAGCTG
GAGCGGGTCA AGGCGCTCGG CGCGGACCAT GTGATCAATT ACCGCACGAC GCCCGAATGG
GGCGCGGAAG CGGCCCGGCT TGCCGGCGGC GGGGTGGACA AGGTCGTCGA GGTTGGCGGG
GCGGGCACGC TTTCGCAGTC GATCGCGGCG GTCGGCTTCG CCGGCGAGAT CGCGCTGATC
GGCGTGCTGA CGCGCGAGGG TGACACCAAC CCGCACGGGC TGATGTTCAA GGGCGCATCG
ATCCGCGGGA TCTTCGTCGG CTCGAAGGGC ATGGCCGAAC GTCTCAACGC CTTCATCGAC
GCGCACGGCA TCAAGCCGGT CGTCGACCGG GCGTTCCCCA TCGAGCAGGC AATGGATGCC
TATTCCTATC AATCTTCGCC GGGGCTCTTC GGGAAGGTCG CAATAACCCT TTGA
 
Protein sequence
MRQWLVAKGS TSLDDLRMGD VPVPQPGAGE VLVRVHACSL NYRDQIIPLG FYMGGVVQHD 
TVPLSDGAGE IVAVGEGVSS FKVGDRVAGL FFQNWNDGPP NPGVGPALGA PPAQGMLQDY
VVLPEHGVVR LAATLDYTEA ACLPCAGVTA WNALMEGPRP VKAGDSVLVL GTGGVSLLAL
QIAKAAGATV IATSSSDEKL ERVKALGADH VINYRTTPEW GAEAARLAGG GVDKVVEVGG
AGTLSQSIAA VGFAGEIALI GVLTREGDTN PHGLMFKGAS IRGIFVGSKG MAERLNAFID
AHGIKPVVDR AFPIEQAMDA YSYQSSPGLF GKVAITL