Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_21740 |
Symbol | |
ID | 7761094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2172716 |
End bp | 2174044 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643805062 |
Product | xenobiotic compound monooxygenase, DszA family, A subunit |
Protein accession | YP_002799343 |
Protein GI | 226944270 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.77772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAAT CCTCTCGTCA ACTCAGTCTC GGCGCCTTCC TCATGGCCAC CGGGCATCAT GTGGCCGCCT GGCGGCATCC GGATGTGCCG GCCGATCCCC TGGATTTTTC CGTTTACCGG CGTGCCGCGC AGGTCGCCGA AGCGGCCTGC TTCGATGCGC TGTTCGTCGC CGACAGCGTC GCCGTCTTCG ACGACAAGGT TGCCAGCCAT ACCTCGCGCT CGACCTATTT CGAGCCCCTC ACCCTGCTCT CGGCGCTCGC TGCGATCACC GAGCGGATCG GTCTGGTCGG CACGGCGACC ACCACCTATA ACGAGCCCTA CCACGTGGCG CGCAAGTTCG CCTCGCTGGA TCACCTGTCG GGCGGGCGCT GCGGCTGGAA CCTGGTGACC TCCGACGCCG CCGCCGAAGC ACAGAACTTC GGCCGCGACG AACATCTCGG GCATGCCGAA CGCTATGTCC GGGCACACGA GTTCCACCGC GTGGTCACCG GCCTGTGGGA CAGTTGGGCG GATGACGCCT TCCTGCGCGA CAAGGCCAGC GGGCGCTTCT ACGACCCGGA CAAGCTGCAT GTGCTTGATC ACCGGGGCGA GCATTTCCAG GTGCGCGGTC CGCTCAATGT GGCGCGTTCG CCTCAAGGAC GCCCGGTGGT CGTGCAGGCC GGCTCCTCCG AAGCCGGCCG CGAGCTGGCC GCCGAGACCG CCGAGATGGT GTTCACCGCA CAGGCTTCGC TGGAATCGGC GAAAACCTTT TACGCCGACC TCAAGGGACG GCTTGGCAGG TTCGGCCGCG ACGAGAACGC GCTGCGCATC ATGCCTGGCG TCTTCGTGGT CGCCGGGCAG AGCGAGAGCG AGGCCAAAGA GAAGTTCGAG GCCTTCCAGG AGTTGGTCGA GCCGCAGGTG GGCGTCGCCC TGCTCGGGCG CATGCTGGGT AACTTCGACC TGTCCGAATA CCCGCTGGAC GGCCCGCTGC CGGAACTGCC GCTCACCGAC AGCGGCCAGC GCAGCCGCCA GCAGCTTTTG ACCGAACTGG CGGGCGCAGA AAACCTGACT CTGGCCCAGT TGGGCCGGCG CATCGCCGGC GGGCGCGGCC ACTACAGCCT GATCGGCACG CCTCGGCAGA TCGCCGACGA GCTACAGGCC TGGTTCGAGG AGCGCGCTGC CGACGGTTTC AATGTGCTGG TGCCGCACCT GCCGGGCGGT CTGGAAGACT TCGCCGCACT GGTGGTACCG GAACTGCAGC GGCGCGGCCT GTTCCGGCGC GAATACCGGG GCCGCACTCT GCGCGAACAC CTCGGCTTGG CGCGCCCGAA AAGCCCCTTC TTCGCCTGA
|
Protein sequence | MSQSSRQLSL GAFLMATGHH VAAWRHPDVP ADPLDFSVYR RAAQVAEAAC FDALFVADSV AVFDDKVASH TSRSTYFEPL TLLSALAAIT ERIGLVGTAT TTYNEPYHVA RKFASLDHLS GGRCGWNLVT SDAAAEAQNF GRDEHLGHAE RYVRAHEFHR VVTGLWDSWA DDAFLRDKAS GRFYDPDKLH VLDHRGEHFQ VRGPLNVARS PQGRPVVVQA GSSEAGRELA AETAEMVFTA QASLESAKTF YADLKGRLGR FGRDENALRI MPGVFVVAGQ SESEAKEKFE AFQELVEPQV GVALLGRMLG NFDLSEYPLD GPLPELPLTD SGQRSRQQLL TELAGAENLT LAQLGRRIAG GRGHYSLIGT PRQIADELQA WFEERAADGF NVLVPHLPGG LEDFAALVVP ELQRRGLFRR EYRGRTLREH LGLARPKSPF FA
|
| |