Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0475 |
Symbol | |
ID | 3918604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 518616 |
End bp | 519611 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640443205 |
Product | aldo/keto reductase |
Protein accession | YP_495757 |
Protein GI | 87198500 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGACCC GCACTCTCGG CCAGTCCGGC CTCGCCGTTT CCGCGCTCGG CTATGGCTGC ATGGGCATCG ATTTCGGCTA CCGCAACAAG CTGAGCCGCG AGGATGGCAT CTCCATCATC CGCGCCGCCG CAGAGCGCGG CGTCACCTTC TTCGACACCG CCGAAGTCTA CGGGCCCTGG ACCAACGAGG ACATGGTCGG TGAAGCACTG GAACCCTTCA AGGACAAGGT CGTCATCGCC ACCAAGTTCG GCTTCGACAT CGATCCCGAC ACCGGCGCCA TCAGCGGAAC CGACAGTCGT CCCGAACGCA TCCGCAAGGT CTGCGAAGCA TCGCTGAAGC GCCTGCGGGT CGACGCCATC GACCTGTTCT ACCAGCACCG CGTCGATCCC AAGGTGCCGA TCGAGGACGT CGCCGGAACC GTGCGCGACC TCATCGCCGA GGGCAAGGTC AAGCACTTCG GCCTGTCCGA ACCGGGAGCC CCCACGGTGC GCCGGGCCCA TGCAGTCCAG CCCGTGGCAG CCCTGCAAAA CGAGTACTCG CTGTGGACGC GCCAGGTCGA AAGCAACGGC ATTCTCGACA CCTGCCGCGA ACTCGGGATC GGCCTCGTGC CCTATTCTCC GCTCGGCAAG GGCTTCCTTG CCGGGGGCGT GACCAGCGCC GAACAGGTCG CCAATGGCGA CTTCCGCGGC ACCCTGCCCC GCTTCCAGGC AGCCGCCTTC GCGCACAATC TCCAGTTGCT CGACCTCGTG AAGAGGATCG CCGCAGAACG CGATGCCACG CCCGCGCAGA TTGCGCTGGC CTGGCTACTC GCCAAGGCCC CGTTCATCGT GCCGATCCCC GGCACGACGA AACTGCACCG CCTCGACGAA AATCTGGGCG CGGCCGACGT CGTCCTGACC GGCACCGACC TGACCGAGAT CGAGGCCCTC CTCGCCACCG TCACCGTCGT CGGCACGCGC TACCCTCCCG AACGCGAGGC AGCCACCGGC CTGTGA
|
Protein sequence | MQTRTLGQSG LAVSALGYGC MGIDFGYRNK LSREDGISII RAAAERGVTF FDTAEVYGPW TNEDMVGEAL EPFKDKVVIA TKFGFDIDPD TGAISGTDSR PERIRKVCEA SLKRLRVDAI DLFYQHRVDP KVPIEDVAGT VRDLIAEGKV KHFGLSEPGA PTVRRAHAVQ PVAALQNEYS LWTRQVESNG ILDTCRELGI GLVPYSPLGK GFLAGGVTSA EQVANGDFRG TLPRFQAAAF AHNLQLLDLV KRIAAERDAT PAQIALAWLL AKAPFIVPIP GTTKLHRLDE NLGAADVVLT GTDLTEIEAL LATVTVVGTR YPPEREAATG L
|
| |