Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_43740 |
Symbol | |
ID | 7763247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4420357 |
End bp | 4421715 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643807229 |
Product | monooxygenase, NtaA/SnaA/SoxA/DszA family |
Protein accession | YP_002801470 |
Protein GI | 226946397 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.514983 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGGA AACGCCTGAT TCTCAACGGC TTCGCGATGA ACACCGTGTC GCACGTCTAC CACGGACTGT GGCGCCACCC GGACAGCCAG CAGATTCATT TCAACGACCT GGAAACCTGG GTCGAGCTGG CGCAGTTGCT GGAGCGCAGC CATTTCGACG CGCTGTTTCT GGCCGACGTG ATCGGCATCG ACCCGTCCTA CCAGGGCAAC TGGGACACCT ACCTGCGCGG TGCGGTGCAG GTGCCGATCA ACGATTCCTC GACGCTGATC GCCGCGCTGA TCGGCGCCAC CCGCGACCTC GGTCTGGTCT TCACCAGTTC GATCCTCCAG GACCATCCGT TCAACTTCGC CCACCGCGCC TCGACCCTGG ATCACCTGAG CAAGGGGCGC GTCGGCTGGA ACATCGTCAC CAGCGTCAGC CACAACGCCG CGCAGAACTT CGGCTTCGAG CGCATCGTCG CCCACGACCG GCGCTACGCC TGGGCCGAGG AATACATGGA GGTGGTCTAC AAGCTGTGGG AAGGCTCCTG GGAGGAGGAT GCGGTGCGCG CCGACCGGCG CGCGGGGATC TACGCCGATC CGCTCAAGGT GCACCGCATC CACCACCAGG GCGAACGCTA CAAGGTCGCC GGACCGCATC TCAGCCAGCC TTCGCCGCAA CGCACGCCGG CGCTGTTCCA GGCCGGCGCC TCGTACGCCG GACGGGCCTT CGCCGCGCGC AACGCCGAAG CGACCTTCAT CGCCAGCCGC CACCCGGAAG GCGCGCGGCG GCTGATCGAG GACGTGCGCG GGCAGGTGCG GCGCGCCGGC CGGCGCGCCG ACGACCTGCT GTTCATCCAG GGGCTGTCGT TCGTCGTCGG CAGCAGCGAG GAAGAAGCGC AATGCAAGGC GCGGGAGCTG GACGAACTGC TCTGCGTCGA CGGGCTGGCC GCGCACATCA GCCGCGACCT CGGCATCGAC CTCGGCCTGC TGGAGCCCGA GCAGCCCATC GACGAGCTGG AGGTCGAGGG CGTGCAGGGC ATCCTGCGCT TCTTCGAGGA GGCCAATCCC GGCCAGCGCG CCACGGTGGC GGATCTGGCG CGCGCCTACG CCGGCACGCG CCTGGTCGGC TCGCCGGAGT CCATCGCCGA CGAGCTGGAG CGCTGGCAGG ACGCGGGGAT CGACGGCGTC AACGTCATCT ACCAGACCCT GCCCGGCACC TTCCGCGAGG TGGCCGAGCA ACTCATCCCC GAACTGCAGA AACGCGGCCT GGCGCAGCGC GAATACGCGC CGGGCACCCT GCGCGAGCGG CTGTTCCCCG GCCGTCCCGC GCATCTCAAC GAACGCCACC CGGCCGCCGC CCAGCGGCGC CGGGGGTGA
|
Protein sequence | MSRKRLILNG FAMNTVSHVY HGLWRHPDSQ QIHFNDLETW VELAQLLERS HFDALFLADV IGIDPSYQGN WDTYLRGAVQ VPINDSSTLI AALIGATRDL GLVFTSSILQ DHPFNFAHRA STLDHLSKGR VGWNIVTSVS HNAAQNFGFE RIVAHDRRYA WAEEYMEVVY KLWEGSWEED AVRADRRAGI YADPLKVHRI HHQGERYKVA GPHLSQPSPQ RTPALFQAGA SYAGRAFAAR NAEATFIASR HPEGARRLIE DVRGQVRRAG RRADDLLFIQ GLSFVVGSSE EEAQCKAREL DELLCVDGLA AHISRDLGID LGLLEPEQPI DELEVEGVQG ILRFFEEANP GQRATVADLA RAYAGTRLVG SPESIADELE RWQDAGIDGV NVIYQTLPGT FREVAEQLIP ELQKRGLAQR EYAPGTLRER LFPGRPAHLN ERHPAAAQRR RG
|
| |