Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_31340 |
Symbol | dszA |
ID | 7762033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3240053 |
End bp | 3241378 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643806008 |
Product | dibenzothiophene sulfone monooxygenase |
Protein accession | YP_002800272 |
Protein GI | 226945199 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.885637 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGCC AGATCCACCT GGCCGGCTTC CTGCTCGCCG GCCCCGTCGT CCACAGCCAT GCCGCCTGGC GGCACCCGGA AACCCAGGGC AACTTCCTCG AACCCGAGTA CTACGCGCGC AGCGCGAAGG TACTGGAAGA AGGTCTGTTC GACCTGCTGT TCTTCGCCGA CCGCTTCGCC ATCGGCGACC AGTTGGGCGG CTCCCGAGAG CTGGCGCTGC GCCACGGCGC CCAGGACGCC ACCCGCCTCG ACCCGCTGCC GATCCTCTCC TACCTGGTCG GCCAGACCAG CCGGCTCGGC CTCGGCGTCA CCCGCTCCAC CACCTATTAC GAGCCGGTCC ACGTGGCCCG CTCGCTGGCC ACCCTCGACC ATCTGTCGCG CGGGCGCGCC GCCTGGAACA TCGTCACCTC GATGAACGAC AGCGAAGGCC GGCTGTTCGG CAAGGACCGC CACCTGGAGC ACGACCTGCG CTACGACCGC GCCGACGAGT TCGTCGAGGT CGCCCTGAAG CTGTGGAGCA GTTGGGCGCC GGGCGCGCTC AAGCTCGACC GCGAGCAAGG CATCCTCGCC GACCCGGCCG GGGTGCGCGA CGTCGCCCAC CGGGGCGAGT GGTTCCGCGT CGAGGGGCCG CTGAACATCC CGCGCACCCC GCAGGGCCGC CCGGTGCTGA TCCAGGCCGG CTCCAGCGGT CGCGGCCGGC GCTTCGGCGC GCGCTGGGCC GAGGCCATCT TCAGCATCCA CGGCAACCTG CCGGCGATGC GCGCCTTCCG CGACGACGTG CGCCAGCAGG TGGTCGCCCA GGGCCGCGCG GCGGAGCAGT GCAAGATCCT CACCGCGGTG ATGCCCTTCA TCGGCGGCAG CGAGGCCGAG GCGCGCGCCA AGCGCGACCG CCACAACGCC CTGGCGCACC CGCAGATCGG CCTGGCGACG CTGTCGGCAC AGCTCAACTT CGACTTCTCC GCCTTCGCCC CGGACAGCCG CCTGGAAAGC ATCGCCGCGC ACCCCGAAAC GCCGCCGGCG GTCGCCGAGA AACTGCGCTC GCTGGGCGCC TCGCTGAGCC TCGGCGAACT GGCGCAGACC TTCGCCAGCA GCGTGCGCGT GCCGCAACTG GTGGGCACCG GCGTGCAGAT CGCCGAGCAA CTGGCGGAGA TCTTCAACGC CGGCGGCTGC GACGGCTTCG TGATTTCCCC CGGCTACCTG CCGGGCTCCT TCGCCGAATT CGCCGAGAGC GTGGTGCCGC ACCTGCAGCG CCTGGGCCTC TACCGCCGCG CCTACGAGGG CGCGACCCTG CGCGAACACC TGGGCCTCGG CCCCCTGGAG GCCTGA
|
Protein sequence | MSRQIHLAGF LLAGPVVHSH AAWRHPETQG NFLEPEYYAR SAKVLEEGLF DLLFFADRFA IGDQLGGSRE LALRHGAQDA TRLDPLPILS YLVGQTSRLG LGVTRSTTYY EPVHVARSLA TLDHLSRGRA AWNIVTSMND SEGRLFGKDR HLEHDLRYDR ADEFVEVALK LWSSWAPGAL KLDREQGILA DPAGVRDVAH RGEWFRVEGP LNIPRTPQGR PVLIQAGSSG RGRRFGARWA EAIFSIHGNL PAMRAFRDDV RQQVVAQGRA AEQCKILTAV MPFIGGSEAE ARAKRDRHNA LAHPQIGLAT LSAQLNFDFS AFAPDSRLES IAAHPETPPA VAEKLRSLGA SLSLGELAQT FASSVRVPQL VGTGVQIAEQ LAEIFNAGGC DGFVISPGYL PGSFAEFAES VVPHLQRLGL YRRAYEGATL REHLGLGPLE A
|
| |