Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3901 |
Symbol | |
ID | 5077385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | - |
Start bp | 70559 |
End bp | 71938 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640481008 |
Product | ring hydroxylating dioxygenase, alpha subunit |
Protein accession | YP_001165670 |
Protein GI | 146275509 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.682799 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGGAT CGGCGGCACT CGTCGATAAT GCCAACGCGA GCCAGTCTCG CCGGGTGTTC TGGGATCAGG ACGTCTATCA GCTGGAGCTG GAGCGGATAT TCTCGCGATG CTGGCTCATG CTCGGACATG ATTCGCTAGT TCCCAAACCG GGCGACTTCA TTACGACGTA CATGGCAGAA GACCGTGTCA TCTTGTCGCG GCAGCCGGAC GGTTCGCTGA AGGCGTTCAT CAATTCCTGC ACTCACCGCG GCAACCAGAT CTGTCATGCC GACAGCGGCA GCGCCAAGGC GTTCGTGTGC AATTATCATG GTTGGGTTTT CGGTCAGGAT GGTTCGCTCG TCGATGTTCC GATGGAAGAG CGGTGCTATC ACAGCGATCT CGACAAATCC AAGCTGGGGC TCGCGCCGAT CCGGGTCGAA ACTTACAAGG GCTTCATCTT CGGCTGCCAT GATCCCGAAG CGCCCTCGCT TGAGGACTAT TTGGGGGACT TCTGCTGGTA CCTCGATACG ATCTGGGACG GTCCGGACGG TGGTCTGGAA CTGCTCGGGC CGCCGTTGAA GAGCACCCTC GCCTGCAATT GGAAAGTCCC GACCGAGAAC TTCGTCGGCG ATGGGTATCA CGTGGGCTGG ACGCATGCCG CCGCTCTCCA GATGATCGGG GGCGAGCTGG CTGGCCTGTC GGGCAATCGC GCCGACATGC CGTTTGACGA CCTTGGTCTG CAATTCACCA TGCGGCATGG CCACGGGTTT GGCCTGATCG ATAACGCGGC GACTGCGATC CACGTCAAGC GCGACGGGTA CGTCAAATAT CTCGAGGAGA CGCGGGGCGG AATTCGCGAA AAATTCGGGC CGGAGCGCGA ACGGCTCTAT GTCGGTCACT GGAATACGTC GATCTTCCCA AACTGTTCGT TCCTCTACGG AACCAACACC TTCAAGATCT GGCATCCGCG CGGGCCGCAT GAAATCGAGG TCTGGACCTA TACCATGGTA CCGAAGAATG CCGACACCGA AACTAAGCGG TCGATCCAGC GCGAAGCGAT CCGTTCATTT GGTACGGCGG GAACGCTCGA AAGCGACGAT GGCGAAAACA TGTCGTCGGC CACCTACAAC AACAACGGTA TCATCACCCG CAAGGGGCGG ATGAATTCGA GCATGGGCAA GGACCGCGAA GGGCCGCACC CCGTCTATCC AGGAATTGTC GGGGTCAGCT TCATCGGCGA AACCTCGTAT CGAGGCTTTT ATCGTTTCTG GCAAGAAATG CTCGATGCGC CAGATTGGGC CGCCATCCGG GCCAATGACG ATACCTGGGA TGCAATGTGG ACCAACCGTA ATTTCTGGCC TGAACGTCTG TCGGCGAAGC AAGCCGAGCC GCAAGACTGA
|
Protein sequence | MNGSAALVDN ANASQSRRVF WDQDVYQLEL ERIFSRCWLM LGHDSLVPKP GDFITTYMAE DRVILSRQPD GSLKAFINSC THRGNQICHA DSGSAKAFVC NYHGWVFGQD GSLVDVPMEE RCYHSDLDKS KLGLAPIRVE TYKGFIFGCH DPEAPSLEDY LGDFCWYLDT IWDGPDGGLE LLGPPLKSTL ACNWKVPTEN FVGDGYHVGW THAAALQMIG GELAGLSGNR ADMPFDDLGL QFTMRHGHGF GLIDNAATAI HVKRDGYVKY LEETRGGIRE KFGPERERLY VGHWNTSIFP NCSFLYGTNT FKIWHPRGPH EIEVWTYTMV PKNADTETKR SIQREAIRSF GTAGTLESDD GENMSSATYN NNGIITRKGR MNSSMGKDRE GPHPVYPGIV GVSFIGETSY RGFYRFWQEM LDAPDWAAIR ANDDTWDAMW TNRNFWPERL SAKQAEPQD
|
| |