Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1561 |
Symbol | |
ID | 3917236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1618665 |
End bp | 1620215 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640444301 |
Product | short chain dehydrogenase |
Protein accession | YP_496835 |
Protein GI | 87199578 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.636243 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAAGCGA AACAAAGGGA AGGGCGTGCG CGCCTCGTTC TGGTGACCGG CGCGGCCCGC GGCATCGGTC TGGCGTGTGC CCACCGCTTT GCCAGGGCCG GTGATCGCGT GGTCATGGCC GACCGCGACT TGGCGGCGTG CACTTTCGAG GCTGAAAGGC TCGGCTCGCG GCATGTCGCG CTGCAACTGG ACGTTTCGGA CGAAGCGGCG GTCGAGCACG CGATGGACGG TCTTCTGCAG CAGTTCGGCG CGTTCGATGT CGTGATCAAC AACGCCGGGG TGGTGGATCG CTTTGCCCGG CCGCTTCTTG ACGTACCGCC GGAGGATATC GACCGGCTGA TAGGCGTCAA TCTCGAAGGT CCCTATCTGG TTGTGCGCGC TGCGCTGCGG ACGATCCTTG CCGGACGGCG TGGCGCGGCA ATCGTCAACG TCGCATCGGG CGCGGCACTG CGCGCGCTGC CGGGCCGTGC CGCCTACAGC ATGACAAAGG CGGGCGTCAT CGGCATGACC CGCGCGATGG CGATAGAGCT TGGCCCCCAG GGTATCGCCG TCAACGCCGT GCTGCCGGGA TACATCGACA CCGAAATTCT CCTTGCTCTG GAGCGGGAGG GCAAGTTCGA CCGCGCCGCC GCTGCCGGCG CAATACCGAT GGGAAGGCTC GGCCGGACAG ATGAGATTGC CGAGGCGGTC CATTACCTCG CGCGCGGGGG TTATCATTGC GGCAGCCTGC TTTCGGTCGA TGGTGGGGTC GATGCGTATG GCGGTTCGGG CAAGGCCTCC ACCGCCGTCA TGCCGCACCG CCCGGTGCGC GCGGGCGACG TCGCTTGCGT GACCGGCGGG GCGAGCGGCA TCGGCGCCGT TGTGGCAGAC CGGCTTGCCG GGCTCGGCTG GCTCGTGGCG ATAATCGACA GCCGGGAAAT CGCGGACGGA CCACACCCTG CGTGGCAGGC CGACATCGCC AGCGAAGCCT CGGTCGAGAG CGCGATGGCA GGCATCGCTG GCCAGCTCGG CCCGGTGACG CTGCTGGTCA ACAATGCCGG TATCGTCGAA CCCATGGCGA AGTCTGCCGA CCAGGCGCTT GCCGACTTCC GCCGCACGAT CGACGTGAAC GTGAAGGGCA CTATCCATGC ATCGCGCGCG GCTGCGCGGC AGATGATCGG CGCGGGCGGT GGGGCCATTG TCAATCTTTC CTCCATCACG GCATCGCTCG GTTTGCCGGG GCGCAATGCC TATTGCGCGT CGAAATCTGC CGTCACCATG CTCACCCGCA GTCTCGCCTG CGAATGGGCC GCGCATGGCA TCCGGGTGAA TGCGGTCGCG CCAGGATACA TCCTGACCCC CGCAGTGCAG GCCTTGCTGG CTTCGGGAGA GCGCGACATG AACTCCGTCG TCCGGCGCAT ACCGGTGGCG CGCCTTGGTC AGCCTGACGA AGTGGCGGAC GCCATCGCGT TTCTGGCCTC GGATGCGGCA TCCTATGTTA CCGGCGCCAC GCTTCAGGTG GATGGCGGCT ATCTTGCCAG CGGGCATCCG CCCGATGGAC CGATGCCCTG A
|
Protein sequence | MEAKQREGRA RLVLVTGAAR GIGLACAHRF ARAGDRVVMA DRDLAACTFE AERLGSRHVA LQLDVSDEAA VEHAMDGLLQ QFGAFDVVIN NAGVVDRFAR PLLDVPPEDI DRLIGVNLEG PYLVVRAALR TILAGRRGAA IVNVASGAAL RALPGRAAYS MTKAGVIGMT RAMAIELGPQ GIAVNAVLPG YIDTEILLAL EREGKFDRAA AAGAIPMGRL GRTDEIAEAV HYLARGGYHC GSLLSVDGGV DAYGGSGKAS TAVMPHRPVR AGDVACVTGG ASGIGAVVAD RLAGLGWLVA IIDSREIADG PHPAWQADIA SEASVESAMA GIAGQLGPVT LLVNNAGIVE PMAKSADQAL ADFRRTIDVN VKGTIHASRA AARQMIGAGG GAIVNLSSIT ASLGLPGRNA YCASKSAVTM LTRSLACEWA AHGIRVNAVA PGYILTPAVQ ALLASGERDM NSVVRRIPVA RLGQPDEVAD AIAFLASDAA SYVTGATLQV DGGYLASGHP PDGPMP
|
| |