Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_21340 |
Symbol | |
ID | 7761059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 2135356 |
End bp | 2136708 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643805029 |
Product | xenobiotic compound monooxygenase, DszA family, A subunit |
Protein accession | YP_002799310 |
Protein GI | 226944237 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.341615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCTGG GCGCCTTTCT CTATCCCACC GGACACCATG CGGCCGCCTG GCGCCATCCC CAGGCACAGG CGGACGCCGG CATCAATTTC GAGCACTACG TCGGCCTGGC GCAGACCGCC GAGCGGGCCG GCTTCGACCT GCTGTTCCTC GCCGACAGCG CCGCCGCGCG CGGCAAGGAC TGGGTCGCCC TGTCGCGTTT CGCCACCCAT TACGTGGCGC AGTTCGAACC CCTGACGCTG ATCTCGGCGC TCGCCGCGGT GACCCGGCGG ATCGGCCTGG TGGCCACCGC CTCGACCACC TACAACGATC CCTACAGCCT GGCGCGCAAG CTCGCCTCCA TCGACCACAT CAGCCACGGT CGGGTGGGCT GGAACCTGGT CACCTCGGGC AACGAGGAGG AAGCCTTCAA CTTCGGCGCC ACGGCCTATC CGCCGCATGC CGAGCGTTAC CGCCGGGCCC GCGAGTTCGC CCAGGTGCTC AAGGGCCTGT GGGACAGTTG GGACGACGAT GCCTTCCCGC GCGACAAGGC CAGCGGTCTG TTCCTGGACG TCGACAAGAT GCATGTGCTC GGTCACCAGG GCGAGCATTT CAGCGTGCGC GGCCCGCTGA ACATCCCGCG TTCGCCCCAG GGCTGGCCGG TGCTGGTGCA GGCCGGTTCG TCCGAGGCCG GCAAGGAGCT GGCCGCGGAA ACCGCCGAGG TGGTGTTCAC CGCCCAGCGC ACGCTGGCCG ACGCGCAGGC TTTCTATGCG GACGTGAAAG GACGCATGGC GCGTTACGGG CGCCAGCCCT CGGCGCTGAA GATCATGCCG GGGATCTTCC CGGTGGTGGC CGCCACGGAG GCCGAGGCCC AGGCCAAGTT CGAGGCGCTA CAGGACTTGA TCCATCCCTC GGTGGCGCTG GCCATCCTCG AACACCGCCT GGGCGTGCCG CTGGCGCACC TGCCCCAGGA CGGGCCGCTG CCGGAAATCG CCGAGGTCGA ATCGACCCGC AGTCGCCGCG CCTTGCTGCT GGAGGTGGCC AGGCGCGAGG GCCTGAGCAT TCGCCAACTG GCGCTGCGGG TGGCCGGCGC GCGCGGCCAC TGGCAGGTGG TCGGCTCCGG CGAACAGGTC GCCGATGCCA TGCAGGAACT GTTCGAGAAC GGCGGCGCCG ACGGCTTCAA TGTCATGTCC CCTTATTTGC CGGGAGGCCT GGACGATTTC GTCGAGCATG TCCTGCCGGT ATTGCGCGAG CGCGGCCTGC TGCGCGGCGA ATACCAGGGC ACGACCCTGC GCGAGCATCT CGGCCTGCAA CGTCCGCCGT CGCCGCACCT GCGCGCGACC GCCGGCCTTT CCAGCCACGA ATTGACTGCC TGA
|
Protein sequence | MALGAFLYPT GHHAAAWRHP QAQADAGINF EHYVGLAQTA ERAGFDLLFL ADSAAARGKD WVALSRFATH YVAQFEPLTL ISALAAVTRR IGLVATASTT YNDPYSLARK LASIDHISHG RVGWNLVTSG NEEEAFNFGA TAYPPHAERY RRAREFAQVL KGLWDSWDDD AFPRDKASGL FLDVDKMHVL GHQGEHFSVR GPLNIPRSPQ GWPVLVQAGS SEAGKELAAE TAEVVFTAQR TLADAQAFYA DVKGRMARYG RQPSALKIMP GIFPVVAATE AEAQAKFEAL QDLIHPSVAL AILEHRLGVP LAHLPQDGPL PEIAEVESTR SRRALLLEVA RREGLSIRQL ALRVAGARGH WQVVGSGEQV ADAMQELFEN GGADGFNVMS PYLPGGLDDF VEHVLPVLRE RGLLRGEYQG TTLREHLGLQ RPPSPHLRAT AGLSSHELTA
|
| |