Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_21890 |
Symbol | |
ID | 7761107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2185363 |
End bp | 2186694 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643805074 |
Product | monooxygenase, NtaA/SnaA/SoxA family |
Protein accession | YP_002799355 |
Protein GI | 226944282 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACGG GACAACTCAA GCTGGGCACC ATGATCCATG GGGTCGGCCA TGGCTGGGGA GAATGGCGGC ACCCCGAGGC CCTGGCCGAC GCCAGCGTCA ATTTCGAGTT CTACCGGCAG CAGGCCCAGG TCGCCGAGGC GGGCAAGTTC GACTTCGTGT TCATCGCCGA CAGCCTGCAC ATCCACGAGA AATCCAGTCC GCACTACCTC AACCGCTTCG AGCCCCTGAC CATCCTCTCG GCGCTGGCGG CGGTGACCCG GCACGTCGGC CTGGTGGGCA CCGTCACGGT CAGCTACAGC GAGCCCTTCA ACGTCGCCCG CCAGTTCGCC TCGCTCGACC ACATCAGCGG CGGGCGGGCC GGCTGGAACG TGGTGACCTC CTGGCTGTCC GGCACGGCGG ACAATTTCGG CAGGGCCGAG CATCCGGCGC ATGCCGTGCG CTACCGGATC GCCAGGGAAC ATGTCGGGGT CGTGAAAGGA CTGTGGGACT CCTGGGAGGA CGATGCCTTC GTCCGCGACA AGGCGAGCGG CGAATTCTTC GCTCCCGGCA AGCTGCACGC GTTGAACCAC CAGGGCGAGT TCTTTGCCGT CAAGGGTCCC TTGAACATCG CCCGTTCGCG CCAGGGGCAG CCGGTCATCT TCCAGGCCGG CATCTCCGAG GCCGGCCGCG ATTTCGCGGC GCAGAACGCC GACGCGGTCT TCACCAACCC GGGCTCGTTC GACGAAGCCC AGGCCTTCTA CCGCGACCTC AAGGCGCGTG CCGCCGCTCA CGGCCGCGAC CCGCGGGAAC TCTCGATCCT GCCGGGCATC AGCCCGATCG TCGGACGCGA TCCCGTCGAG GTCGAGCGGC GTTACCGGCA GGCCGTCGAC CTGGTGTCCA TCGAGGACGC CCTCGTCGCC CTCGGCCGCC CGTTCGACGA CCACGATTTC TCGCGCTACC CGCTCGACGA GCCCTTCCCC GATATCGACG ACGGCGACGA CAGCCATAAA GGCAGCGCCG ACCGCATCAG GCGAGTCGCC CGCGAAGAAG GACTGAGCCT GCGCGAGGTG GCGCTGCGCT TCGCCCTGCC CGACCGGACC TTCGCCGGCA CTCCCGAGCA GGTCGCCGAC ACCCTGCAGC ACTGGTTCGA GAAGGACGCG GCGGATGGTT TCATCGTCAG GTCGCTGCTG CCGGACGGCC TGGAGCATTT CGTCGAGCTG GTCGTGCCGG TCCTGCAGGC GCGCGGCCTG TTCCGCCGGG AATACAGCGG CCGGACCCTG CGCGACAACC TGCGTCTGCC GGTGCCGGCG AACCGCTACA GCGTGCGCGA CGAAGCGGGA GCGGCGAGAT GA
|
Protein sequence | MSTGQLKLGT MIHGVGHGWG EWRHPEALAD ASVNFEFYRQ QAQVAEAGKF DFVFIADSLH IHEKSSPHYL NRFEPLTILS ALAAVTRHVG LVGTVTVSYS EPFNVARQFA SLDHISGGRA GWNVVTSWLS GTADNFGRAE HPAHAVRYRI AREHVGVVKG LWDSWEDDAF VRDKASGEFF APGKLHALNH QGEFFAVKGP LNIARSRQGQ PVIFQAGISE AGRDFAAQNA DAVFTNPGSF DEAQAFYRDL KARAAAHGRD PRELSILPGI SPIVGRDPVE VERRYRQAVD LVSIEDALVA LGRPFDDHDF SRYPLDEPFP DIDDGDDSHK GSADRIRRVA REEGLSLREV ALRFALPDRT FAGTPEQVAD TLQHWFEKDA ADGFIVRSLL PDGLEHFVEL VVPVLQARGL FRREYSGRTL RDNLRLPVPA NRYSVRDEAG AAR
|
| |