Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2312 |
Symbol | |
ID | 3915657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2452218 |
End bp | 2453627 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640445068 |
Product | 2-nitropropane dioxygenase, NPD |
Protein accession | YP_497583 |
Protein GI | 87200326 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.241055 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGCGT TCAAGGGGTT GAAGCCGATT CTCTATGGTG GCCGTGAAGT CTGGCCACTG GTCGAGGGCG GCAAGGGCGT GGCGGCGACG AACCACATGA GTTCCGGCGC CTGGGCGGCA GCCGGCGGCA TCGGCACGGT CAGCGCGGTC AATGCCGACA GCTACGACGC CGAAGGCAAG ATCGTTCCAC AGGTTTATCA CGCCCTCACG CGCAAGGAGC GCCACGAGGA GCTGATCAAG TACGCGATCG ACGGCGCGGT CGAGCAGGTC AAGCGAGCCT ATGACATCGC CAGCGGCAAG GGCGCGATCA ACATCAACGT GCTGTGGGAA ATGGGCGGCG CGCAGCAGGT GCTCGAGGGT GTTCTGGAAA AGACCCGCGG CCTGGTCACC GGCGTCACCT GCGGCGCCGG CATGCCGTAC AAGCTGTCCG AGATCGCGGC GCGGTTCAAC GTGAACTATC TGCCCATCGT GTCGTCGGGC CGTGCATTCC GCGCGCTGTG GAAGCGCGCC TACCACAAGG TTTCGCACCT GCTTGCCGCC GTGGTCTATG AAGACCCGTG GCTGGCGGGC GGCCACAATG GCCTGTCCAA CGCCGAAGAC CCGCGCAAGC CGGAAGACCC CTATCCGCGC GTCAAGGCGC TGCGCGACGT GATGCGCGCC GAAGGCGTTT CGGATGACGT TCCCATCGTC ATGGCGGGCG GCGTCTGGTT CCTGCGGGAA TGGAACGACT GGATCGACAA TCCCGAGCTT GGGGCGATTG CCTTCCAGTT CGGCACGCGC CCCCTGCTGA CCGAGGAAAG CCCGATCCCC CAGGGGTGGA AGGACCACCT GCGCACGCTC GAGCCGGGCG ACGTGTTGCT GCATCGCTTC TCGCCCACGG GGTTCTACTC GTCGGCGGTG CGTAATCCGT TCCTGCGCGC GCTCGAAGCG CGGTCGGAAC GCCAGATTCC CTATTCGCGG GTGGAAGCCG GCGAACACAC CGCGCAACTC GACGTCGGCG TAAGGGGCAA GAACTTCTGG GTGACGCCGA ACGACCTGGC GCGCGCGCGC GAGTGGCACG GTGCCGGTTT CGTCCACGCC CTTCGCACGC CCGACGACAC GATGGTCTTC GTGACGCCCC AGGAACGCGA TGAAATCCAG CAGGACCAGA AGGACTGCAT GGGCTGCCTT TCGCACTGCG GGTTCTCGTC GTGGAAGGAT CACGACGACT ACACGACCGG GCGGCTTGCC GATCCGCGCA GCTTCTGCAT CCAGAAGACC TTGCAGGACA TCGCGCACGG CGGCGATATC GACCAGAACC TGATGTTCGC GGGCCATGCG GCATACCGCT TCAAGCAGGA CCCGTTCTAT TCGAACAACT TCACCCCGAC GGTGAAGCAG TTGGTCGATC GCATCCTGAC CGGCGACTGA
|
Protein sequence | MSAFKGLKPI LYGGREVWPL VEGGKGVAAT NHMSSGAWAA AGGIGTVSAV NADSYDAEGK IVPQVYHALT RKERHEELIK YAIDGAVEQV KRAYDIASGK GAININVLWE MGGAQQVLEG VLEKTRGLVT GVTCGAGMPY KLSEIAARFN VNYLPIVSSG RAFRALWKRA YHKVSHLLAA VVYEDPWLAG GHNGLSNAED PRKPEDPYPR VKALRDVMRA EGVSDDVPIV MAGGVWFLRE WNDWIDNPEL GAIAFQFGTR PLLTEESPIP QGWKDHLRTL EPGDVLLHRF SPTGFYSSAV RNPFLRALEA RSERQIPYSR VEAGEHTAQL DVGVRGKNFW VTPNDLARAR EWHGAGFVHA LRTPDDTMVF VTPQERDEIQ QDQKDCMGCL SHCGFSSWKD HDDYTTGRLA DPRSFCIQKT LQDIAHGGDI DQNLMFAGHA AYRFKQDPFY SNNFTPTVKQ LVDRILTGD
|
| |