Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_48990 |
Symbol | anfD |
ID | 7763758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4958633 |
End bp | 4960189 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643807739 |
Product | nitrogenase iron-iron protein, alpha chain |
Protein accession | YP_002801974 |
Protein GI | 226946901 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01284] nitrogenase alpha chain [TIGR01861] nitrogenase iron-iron protein, alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.546352 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCATC ACGAGTTCGA GTGCAGCAAG GTTATTCCCG AGCGGAAGAA GCATGCCGTT ATCAAAGGTA AAGGCGAAAC GCTGGCCGAC GCCCTGCCTC AAGGGTATCT GAATACCATC CCTGGTTCCA TCTCCGAGCG TGGTTGTGCC TACTGTGGTG CCAAGCACGT TATCGGGACT CCCATGAAGG ATGTGATTCA CATCAGTCAT GGCCCGGTCG GCTGCACTTA CGATACCTGG CAGACCAAGC GTTATATCAG CGACAACGAC AACTTCCAGC TCAAATACAC CTATGCCACC GATGTGAAGG AAAAGCATAT CGTGTTCGGC GCCGAGAAGT TGCTGAAGCA GAACATCATC GAAGCCTTCA AGGCGTTCCC GCAGATCAAG CGGATGACCA TCTACCAGAC CTGCGCCACG GCGCTGATCG GAGACGACAT CAACGCCATC GCCGAAGAGG TGATGGAAGA GATGCCGGAG GTGGATATCT TCGTCTGCAA CTCGCCCGGT TTCGCCGGTC CGAGCCAGTC CGGTGGTCAC CACAAGATCA ACATCGCCTG GATCAACCAG AAGGTGGGTA CCGTCGAGCC GGAGATCACC GGCGACCATG TGATCAACTA TGTGGGCGAG TACAACATTC AGGGCGACCA GGAAGTGATG GTGGATTACT TCAAGCGCAT GGGTATCCAG GTGCTATCCA CTTTCACCGG CAACGGTTCC TACGACGGCC TGCGTGCCAT GCACAGAGCC CATCTGAACG TACTGGAATG TGCCCGCTCC GCCGAGTACA TCTGCAACGA ACTGCGTGTC CGTTACGGCA TTCCGCGTCT GGATATCGAC GGTTTCGGTT TCAAGCCACT GGCGGATTCG CTGCGTAAGA TCGGTATGTT CTTCGGCATC GAAGACCGTG CCAAGGCCAT CATCGACGAG GAAGTCGCCC GCTGGAAGCC GGAGTTGGAC TGGTACAAGG AGCGGCTGAT GGGCAAGAAG GTCTGCCTGT GGCCGGGCGG TTCCAAACTC TGGCACTGGG CCCATGTGAT CGAGGAAGAA ATGGGCCTCA AGGTGGTGTC GGTCTATACC AAGTTCGGCC ATCAGGGCGA CATGGAGAAA GGCATCGCCC GTTGCGGCGA AGGCACTTTG GCCATCGACG ACCCGAACGA ATTGGAAGGT CTGGAAGCCC TGGAGATGCT CAAGCCCGAC ATCATCCTGA CCGGCAAGCG TCCGGGTGAA GTGGCCAAGA AAGTCCGGGT TCCCTACCTG AACGCCCACG CCTACCACAA CGGCCCGTAC AAAGGCTTCG AAGGTTGGGT GCGTTTCGCC CGCGATATTT ACAACGCCAT CTACTCGCCG ATCCATCAGC TCTCCGGTAT CGACATCACT AAAGACAATG CACCGGAGTG GGGTAATGGT TTCCGTACTC GCCAAATGCT GTCCGATGGC AACTTGAGCG ATGCAGTACG TAACTCGGAA ACCTTGCGCC AGTACACCGG CGGCTACGAC AGCGTGAGCA AGCTGCGCGA ACGGGAATAT CCCGCCTTCG AGCGCAAGGT CGGCTGA
|
Protein sequence | MPHHEFECSK VIPERKKHAV IKGKGETLAD ALPQGYLNTI PGSISERGCA YCGAKHVIGT PMKDVIHISH GPVGCTYDTW QTKRYISDND NFQLKYTYAT DVKEKHIVFG AEKLLKQNII EAFKAFPQIK RMTIYQTCAT ALIGDDINAI AEEVMEEMPE VDIFVCNSPG FAGPSQSGGH HKINIAWINQ KVGTVEPEIT GDHVINYVGE YNIQGDQEVM VDYFKRMGIQ VLSTFTGNGS YDGLRAMHRA HLNVLECARS AEYICNELRV RYGIPRLDID GFGFKPLADS LRKIGMFFGI EDRAKAIIDE EVARWKPELD WYKERLMGKK VCLWPGGSKL WHWAHVIEEE MGLKVVSVYT KFGHQGDMEK GIARCGEGTL AIDDPNELEG LEALEMLKPD IILTGKRPGE VAKKVRVPYL NAHAYHNGPY KGFEGWVRFA RDIYNAIYSP IHQLSGIDIT KDNAPEWGNG FRTRQMLSDG NLSDAVRNSE TLRQYTGGYD SVSKLREREY PAFERKVG
|
| |