Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2443 |
Symbol | |
ID | 3916762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2629276 |
End bp | 2630364 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640445198 |
Product | alkanesulfonate monooxygenase |
Protein accession | YP_497713 |
Protein GI | 87200456 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAAC AGAATTGCGA AATCGCCTGG TTCTCGGCGC TGTGCGACGA CGACTACGAG TTCCTCGGGG TGCCGGACAA GTACCTCCAG TCGAGCTGGG AGCATTGCCG AAACATCGTG TTGCGCGCCG AGGAGGGTGG CTTCGACAAC ATTCTCCTGC CCTCGGGCTA CCAACTCGGG CTCGATACCA CCGCCTTTGC CGCGGCGGTC GCCACGCAGG TGCGCCGCAT CAAGCTGCTC TGGGCGACGC GCATGGGCGA GGACTGGCCG CCGCAGCTCG CGCGCCGCAT CGCCACGCTC GACCGCATTC TCGGCCCTAA TGCTGAAGGC ACCGGCGGGC GGCTCAACGT CAACATCATC TCGTCCGATA TGCCAGGCGA AACGATCGCC AGCGGGCCGC GCTATGCCCG CGCGACCGAG ATCATGAAGA TCGTGCGCAC CCTGCTGAAC GGCGAGCATC TCGATTTCCA GGGCGAGTTC TACAAGCTCA AGCTCGATCC GCCGCGCATT GGCACGATCT CCGGCAGGTG TCCGGCATTC TACTTCGGCG GCCTCTCCCA CGACGCGCGC GAATGCGCGG CGGAGGCGAG CGACGTCTAC CTGATGTGGC CCGACACGAT GGACAAGGTG CGCGAGACCA TCGCCGACAT GAAGGCGCGG GCCGCCAATT ATGGCCGCAC GCTCAGGTTC GGCTATCGCG TCCACGTAGT CGTGCGCGAG ACCGAGGACG AGGCTCGTGC CTATGCCGAC CGGCTCCTGT CCAAGCTCGA CGACGAGGCC GGCAAGGCGA TCCGCGAGAA GTCGCTCGAT GCCAAGAACT TCGGCGTGCA GCGCCAGCAG GAACTGCGGG GCGCGGCCGA CGGCGACGGC TTTGTCGAGG AGAACCTTTG GACCGGCATC GGCCGCGCCC GCTCCGGTTG CGGCGCAGCC ATCGTCGGCA CCCCGGACCA GGTCCTGGCG AAGCTGCGCG CGTATCAGGC CGAAGGCATC GAGGCTTTCA TCCTGTCGGG TTATCCGCAT GCGCAGGAAG CCGACATGTT CGCCCGCTAC GTCCTGCCGC ACATCAACCA CGGACCGTTG AACATCTGA
|
Protein sequence | MAEQNCEIAW FSALCDDDYE FLGVPDKYLQ SSWEHCRNIV LRAEEGGFDN ILLPSGYQLG LDTTAFAAAV ATQVRRIKLL WATRMGEDWP PQLARRIATL DRILGPNAEG TGGRLNVNII SSDMPGETIA SGPRYARATE IMKIVRTLLN GEHLDFQGEF YKLKLDPPRI GTISGRCPAF YFGGLSHDAR ECAAEASDVY LMWPDTMDKV RETIADMKAR AANYGRTLRF GYRVHVVVRE TEDEARAYAD RLLSKLDDEA GKAIREKSLD AKNFGVQRQQ ELRGAADGDG FVEENLWTGI GRARSGCGAA IVGTPDQVLA KLRAYQAEGI EAFILSGYPH AQEADMFARY VLPHINHGPL NI
|
| |