Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_22400 |
Symbol | dszA |
ID | 7761158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2239883 |
End bp | 2241244 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643805125 |
Product | xenobiotic compound monooxygenase, DszA family, A subunit protein |
Protein accession | YP_002799406 |
Protein GI | 226944333 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAC GCCAGCTTGC GCTGAACGCC TTCATCCTGC CGACCGGCCA CCACATCGCC GCCTGGCGCC GCGCGGACGT GCCGGAGCAC GCCAACCACG ATTTCGGCGA ATACGTCCGC GTGGCGCGCC TGGCCGAGGC GGCCAAGTTC GACGCGGTGT TCGTCTCCGA CTCGGTGGCG GTGAACGCGG GCAACGGTGG CAGCGGCCTG GAGGAACTGG CGCTGACCGC CCGCGGTACC CGGCTGGAAC CCTTTACCCT GCTCTCGGGG CTGGCCGCGG TGACCGAGCG CATCGGTCTG ATCGCCACCG TCAGCACCAC CTACAACGAG CCCTATCACC TGGCCCGCAA GTTCGCCTCG CTGGACTGGA TCTCCAGGGG ACGCGCCGGC TGGAACCTGG TGACCTCTTC GGCGCCGGCC GAGGCGCTGA ACTTCAACCA GAGCGAGCTG CCCGAACACG GCGAGCGCTA CCGGCGCGCC GAGGAGTTCC TCGAGGTGGT GCAGGGCCTG TGGGATAGCT GGGAAGACGA CGCCTTCGTC CGCGACAAGG CCAGCGGCCA GTTCTTCGAT CCGGCCAGGC TGCACGTGCT GGACCACCAG GGCGAGCACT ACCGGGTGCG CGGTCCGCTG AACGTGGCGC GCCCGCCGCA GGGCTATCCG GTGCTGGTGC AGGCCGGCTC CTCCGAGCCG GGCAAGGAAA TCGCCGCGCG CAGCGCCGAG GCGATCTTCA CCGCACACCA GACCCTGGAA AGCGCCCAGG CCTTCTACGC CGATGTGAAA GGCCGTCTCG CCAAGTACGG CCGCGCGCCC GAGGAGCTGA AGATCCTCCC GGGCATTTTC CCGGTGGTCG GCCGCAGCGA GGCCGAAGCC CAGGCCAAGT ACCGCGAACT GCAGGAACTG ATCGACCCGC GGGTCGGCGT CAACCTGCTC AGCGCCCTGA TCGGCAGCGA CCTGTCCGGC TTCGATGTCG ACGGCCCACT GCCCGAGCTG GCGACCACCA ACCAGAACAA GAGCCGCCAG GCCCTGCTGC TCGAACTGGC GCACCGCGAG AACCTGACCA TCCGCGAACT CTACCTGCGC ATCGCCGGCG CCCGCGGCCA CTGGACGGTG GTCGGCACCC CGACGCAGAT CGCCGACCAG ATCCAGACCT GGTTCGAGCA GGGCGCGGCC GACGGCTTCA ACGTGCTGGC GCCCTGGCTG CCCGGCGGCC TGGAGGAGTT CATCGACCAG GTGGTGCCGG AACTGCGCCG CCGCGGCCTG TTCCGCGAGG AGTACAGCGG CGCCACCCTG CGCGAGCACC TGGGCCTGGC CCGTCCGGAA AACCAGCATG CCGCCGGGCG CCAGCGGCGC GCGGCGAGCT GA
|
Protein sequence | MSKRQLALNA FILPTGHHIA AWRRADVPEH ANHDFGEYVR VARLAEAAKF DAVFVSDSVA VNAGNGGSGL EELALTARGT RLEPFTLLSG LAAVTERIGL IATVSTTYNE PYHLARKFAS LDWISRGRAG WNLVTSSAPA EALNFNQSEL PEHGERYRRA EEFLEVVQGL WDSWEDDAFV RDKASGQFFD PARLHVLDHQ GEHYRVRGPL NVARPPQGYP VLVQAGSSEP GKEIAARSAE AIFTAHQTLE SAQAFYADVK GRLAKYGRAP EELKILPGIF PVVGRSEAEA QAKYRELQEL IDPRVGVNLL SALIGSDLSG FDVDGPLPEL ATTNQNKSRQ ALLLELAHRE NLTIRELYLR IAGARGHWTV VGTPTQIADQ IQTWFEQGAA DGFNVLAPWL PGGLEEFIDQ VVPELRRRGL FREEYSGATL REHLGLARPE NQHAAGRQRR AAS
|
| |