Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1546 |
Symbol | |
ID | 3917221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1599944 |
End bp | 1600894 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640444286 |
Product | taurine dioxygenase |
Protein accession | YP_496820 |
Protein GI | 87199563 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCCA TCACCACTTA TGCCAATGCC AAGGACCCCA ACTCTCCTCT CGACATCGTT CCGGTCACAG GAACCATCGG TGCGGAAATC CGCGGCGTGA CGCTGTCCGG CGATCTTGAT GCCGGGACCG TCCAGGCGAT CAAGGATGCC GTCGTGCGCC ACAAGGTCGT GTTCTTCCGC GGCCAGAAGG ACCTTGACGA TGCACGACAT GAAGGCTTCG CCTCGTTGTT CGGAGAGCCG GTCGCACATC CAACCGTGCC GGTCGCGGAA GGTTCGCGCT ATCTGCTCGA ACTCGACAGC AAGGAAGGCT ACGCGGCCTC GAGCTGGCAC ACCGACGTGA CCTTCGTCGA TTCCTATCCC AAGGGCTCGA TCCTACGCGC GATCACTGTT CCTGAAGCGG GCGGGGATAC CGTCTGGGCG AATGGTGAAA CCGCCTACGA AAGCCTACCG GAATCCCTTC GCCAACTGGT GAACAACCTT TGGGCGGTCC ACACCAACCT TTACGACTAC GCCGCCGTCC TCAATGCACC CAAGGGCGAC GAGACCGAGC GCGAGCGCGT GAACTTCCAC AAGAGCGTGT TCGCATCGAC CGTCTACGAG ACCGAGCACC CCGTGGTCCG CGTCCACCCC GTCTCGGGCC AGCGCAGCCT GCTGCTGGGT CATTTCGTGA AGCAATTCGT CGGGCTCAAC CAGGCGGACT CCTCGCGCCT GTTCCAGATC CTGCAGGATC ACATCACCCG GCCCGAAAAC GTCGTGCGCT GGCGCTGGCA GCCGGGTGAT GTGGCGTTCT GGGACAACCA GTCGACCCAG CACCGTGCCG TCGCCGACTT TGGTCTGCAG CGTCGCACCC TGCGCCGCGC TACTATTGCA GGTGAAGTGC CGGTCGGCAT CGACGGTCGC CAGAGCCGTA CCGTGCGGAA GGAAAAGGCC ACCGAATACC AGCCCGCCTG A
|
Protein sequence | MTAITTYANA KDPNSPLDIV PVTGTIGAEI RGVTLSGDLD AGTVQAIKDA VVRHKVVFFR GQKDLDDARH EGFASLFGEP VAHPTVPVAE GSRYLLELDS KEGYAASSWH TDVTFVDSYP KGSILRAITV PEAGGDTVWA NGETAYESLP ESLRQLVNNL WAVHTNLYDY AAVLNAPKGD ETERERVNFH KSVFASTVYE TEHPVVRVHP VSGQRSLLLG HFVKQFVGLN QADSSRLFQI LQDHITRPEN VVRWRWQPGD VAFWDNQSTQ HRAVADFGLQ RRTLRRATIA GEVPVGIDGR QSRTVRKEKA TEYQPA
|
| |