Gene Saro_0475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0475 
Symbol 
ID3918604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp518616 
End bp519611 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content67% 
IMG OID640443205 
Productaldo/keto reductase 
Protein accessionYP_495757 
Protein GI87198500 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACCC GCACTCTCGG CCAGTCCGGC CTCGCCGTTT CCGCGCTCGG CTATGGCTGC 
ATGGGCATCG ATTTCGGCTA CCGCAACAAG CTGAGCCGCG AGGATGGCAT CTCCATCATC
CGCGCCGCCG CAGAGCGCGG CGTCACCTTC TTCGACACCG CCGAAGTCTA CGGGCCCTGG
ACCAACGAGG ACATGGTCGG TGAAGCACTG GAACCCTTCA AGGACAAGGT CGTCATCGCC
ACCAAGTTCG GCTTCGACAT CGATCCCGAC ACCGGCGCCA TCAGCGGAAC CGACAGTCGT
CCCGAACGCA TCCGCAAGGT CTGCGAAGCA TCGCTGAAGC GCCTGCGGGT CGACGCCATC
GACCTGTTCT ACCAGCACCG CGTCGATCCC AAGGTGCCGA TCGAGGACGT CGCCGGAACC
GTGCGCGACC TCATCGCCGA GGGCAAGGTC AAGCACTTCG GCCTGTCCGA ACCGGGAGCC
CCCACGGTGC GCCGGGCCCA TGCAGTCCAG CCCGTGGCAG CCCTGCAAAA CGAGTACTCG
CTGTGGACGC GCCAGGTCGA AAGCAACGGC ATTCTCGACA CCTGCCGCGA ACTCGGGATC
GGCCTCGTGC CCTATTCTCC GCTCGGCAAG GGCTTCCTTG CCGGGGGCGT GACCAGCGCC
GAACAGGTCG CCAATGGCGA CTTCCGCGGC ACCCTGCCCC GCTTCCAGGC AGCCGCCTTC
GCGCACAATC TCCAGTTGCT CGACCTCGTG AAGAGGATCG CCGCAGAACG CGATGCCACG
CCCGCGCAGA TTGCGCTGGC CTGGCTACTC GCCAAGGCCC CGTTCATCGT GCCGATCCCC
GGCACGACGA AACTGCACCG CCTCGACGAA AATCTGGGCG CGGCCGACGT CGTCCTGACC
GGCACCGACC TGACCGAGAT CGAGGCCCTC CTCGCCACCG TCACCGTCGT CGGCACGCGC
TACCCTCCCG AACGCGAGGC AGCCACCGGC CTGTGA
 
Protein sequence
MQTRTLGQSG LAVSALGYGC MGIDFGYRNK LSREDGISII RAAAERGVTF FDTAEVYGPW 
TNEDMVGEAL EPFKDKVVIA TKFGFDIDPD TGAISGTDSR PERIRKVCEA SLKRLRVDAI
DLFYQHRVDP KVPIEDVAGT VRDLIAEGKV KHFGLSEPGA PTVRRAHAVQ PVAALQNEYS
LWTRQVESNG ILDTCRELGI GLVPYSPLGK GFLAGGVTSA EQVANGDFRG TLPRFQAAAF
AHNLQLLDLV KRIAAERDAT PAQIALAWLL AKAPFIVPIP GTTKLHRLDE NLGAADVVLT
GTDLTEIEAL LATVTVVGTR YPPEREAATG L