Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_43870 |
Symbol | msuD |
ID | 7763260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4434287 |
End bp | 4435420 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643807242 |
Product | Alkanesulfonate monooxygenase |
Protein accession | YP_002801483 |
Protein GI | 226946410 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCTGG ACATCTTCTG GTTCCTTCCC ACTTCCGGCG ATACCCGCTA TCTCGGCAAG TCTTCCTCCG GCCGCCCGGC GACCAACGAA TACATGCGGC AGATCGCCGT GACCGCCGAA AGCCTCGGCT ACGACGGCCT GCTGATCCCC ACCGGCAGCA GTTGCCTGGA CCCCTGGGTG ACGGCGGCCA GCCTGGTGCC AGTGACCAGC CGCATCAAGC TGCTGGTGGC GTTGCGCACC TCGGTCAGCG GCCCGACCGC CACCGCGCGC CAGGCCGCCA CTCTCGACCA GGCGCTGAAG GGCCGCCTGC TGCTCAACGT GGTGCCGGGC GGCGATGCCA CCGAACTGGC CGCCGACGGT GTGTTCCTCG ACCATGACGA GCGCTACGAG GCGGCCGACG AAGTGCTCAC CGTGTGGCGC GACCTGCTGC AGGGCAAGAC CGTCGACTTC GCCGGCAAGC ACGTCACCGT CGAGGGCGCG AAGAACTTCT TCCCGCCGGT GCAGCAACCC TATCCGCCGC TGTATTTCGG CGGCTCCTCG CCGGCGGCCC ACGAACTGGC GGCCAAGCAC GTCGACGCCT ACCTGACCTG GGGCGAACCG CCGGCGGCGG TGGCCGAGAA GATCGCCGAC GTGCGCGAGC GGGCGAAGAA GTACGGGCGC AGCGTGCGCT TCGGCGTGCG CCTGCACGTG ATCGTGCGCG AGACCAACGA GGAAGCCTGG GCCGCCGCCG AGAAGCTGAT CAGCCATCTC GACGACGAGA CCATCGCCAA GGCCCAGGCC AACTACGCGG CCATGGACTC CGAGGGCCAG CGGCGCATGG CCGCGCTGCA CGGCGGGCGG CGCGACAAAC TGGAAGTCAG CCCCAACCTC TGGGCCGGGG TCGGCCTGGT GCGCGGCGGC GCCGGCACCG CGCTGGTCGG CGACCCGCAG ACCGTGGCCG CGCGCCTCAA GGAATACGCC GACCTCGGCG TCGACAGCTT CGTGCTCTCC GGCTATCCGC ACCTGGAGGA GGCCATTCGC TTCGCCGAAC TGGTGTTCCC GCTGCTGCCC GGCAAGCAGC CGGTGACCGT CGAGGAGGAA CTGACCGGCG GCGCCTTCGA CGTGCGCGCC ACCAAGAGCG AGGCCGCCGC ATGA
|
Protein sequence | MSLDIFWFLP TSGDTRYLGK SSSGRPATNE YMRQIAVTAE SLGYDGLLIP TGSSCLDPWV TAASLVPVTS RIKLLVALRT SVSGPTATAR QAATLDQALK GRLLLNVVPG GDATELAADG VFLDHDERYE AADEVLTVWR DLLQGKTVDF AGKHVTVEGA KNFFPPVQQP YPPLYFGGSS PAAHELAAKH VDAYLTWGEP PAAVAEKIAD VRERAKKYGR SVRFGVRLHV IVRETNEEAW AAAEKLISHL DDETIAKAQA NYAAMDSEGQ RRMAALHGGR RDKLEVSPNL WAGVGLVRGG AGTALVGDPQ TVAARLKEYA DLGVDSFVLS GYPHLEEAIR FAELVFPLLP GKQPVTVEEE LTGGAFDVRA TKSEAAA
|
| |