Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_02770 |
Symbol | vnfE |
ID | 7759237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 260573 |
End bp | 261982 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643803201 |
Product | nitrogenase vanadium iron cofactor biosynthesis protein vnfE |
Protein accession | YP_002797512 |
Protein GI | 226942439 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.687288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAGA CCGAAATCCA GAACCTGCTC GACGAGCCAG CCTGCACCCA CAACACGGCG GGCAAGACCG GCTGCTCGCG CTCGCGCCCG GGAGCCACCC AGGGCGGCTG CGCCTTCGAC GGCGCGCAGA TCGCCATCCT GCCGATCGCC GACGCCGCGC ACATCGTCCA CGGTCCGATC GGCTGTGCCG GCAGTTCCTG GGACCTGCGC GGCAGCAATT CCTCCGGCCC ACAGTTGTAC CGCCTGGGCA TGACCACCGA ACTGTCCGAC GTCGACGTGA TCATGGGCCG CGGCGAGAAA AAGCTGTTCC ACGCCATCCG CCGTGCAGTC GAGCGCTACC AGCCGCAGGC AGTGTTCGTC TACGGCACCT GCGTGCCGGC GATGCAGGGC GACGACATCG AAGCGGTGGC CCGCGACGCC AGCCAGCGCT GGGGCGTGCC GGTGATTCCG GTGGACGGCG CCGGCTTCTA CGGCACCAAG AGCCTGGGCA ACCGCATCGC CGGCGAAACC CTCTACCGCC ACGTCATAGG TACCCGCGAA CCGGCGCCGC TGCCGCAAGG CGCCGTCGGC CACGGCATCA CGGTGCACGA CGTCAACCTG ATCGGCGAAT ACAACATCGC CGGCGAGTTC TGGCGCGTCG CGCCGCTGTT CGACGAACTC GGCCTGCGCA TTCTCTGCAC CCTGTCCGGC GACGCGCGCT TTCGCGAGGT CCAGACCATG CACCGCGCCG AAGCCAACAT GGTGGTCTGC TCCAAGGCCA TGCTCAACGT CGCCCGCCAC CTGCGCGAGG ACTACGGCAC GCCGTTCTTC GAGGGCAGCT TCTACGGTAT CGCCGATACC TCCCAGGCCC TGCGCGACTT CGCCAAGGCG ATCGGCGACC CGTCGCTGTC GGTACGCACC GAACTGCTGA TCCTGCGCGA GGAAAACAGG GCCAGGGCGG CGCTCGAACC CTGGCGCGAG CGACTGGCCG GCAAGCGCGC GCTGATCTTC TCCGGCGGCG TGAAATCCTG GTCGGTGGTT TCGGCGCTGC AGGACCTCGG CGTCGAGGTG ATCGCCACCG GCACCGAGAA ATCCACCGAG GAAGACCGCG CGCGCATCCG CGAGCTGATG GGGCCGAACG CCCGGATGAT CGACGACAAC GACCAGAGCG CGCTGATCGC CACCTGCATC GAGAGCGGCG CCGACATCCT CATCGCTGGC GGACGCTACC TGTACGCCGC GCTCAAGGCG CGCCTGGCGT TCCTCGACAT CAACCACGAA CGCGACTTCG GCTACGCCGG CTACGGCGGT TTCGTCGAAC TGGCCCGCCA GTTGGCGCTG GCCGTGCACA GCCCGGTATG GCAGCGCGTG CGCCAGGAGC CGCGCTGGGT ACGCGCCAGC ACCCGGGCCG CGCTGCTGGA GGAGGCCTGA
|
Protein sequence | MNQTEIQNLL DEPACTHNTA GKTGCSRSRP GATQGGCAFD GAQIAILPIA DAAHIVHGPI GCAGSSWDLR GSNSSGPQLY RLGMTTELSD VDVIMGRGEK KLFHAIRRAV ERYQPQAVFV YGTCVPAMQG DDIEAVARDA SQRWGVPVIP VDGAGFYGTK SLGNRIAGET LYRHVIGTRE PAPLPQGAVG HGITVHDVNL IGEYNIAGEF WRVAPLFDEL GLRILCTLSG DARFREVQTM HRAEANMVVC SKAMLNVARH LREDYGTPFF EGSFYGIADT SQALRDFAKA IGDPSLSVRT ELLILREENR ARAALEPWRE RLAGKRALIF SGGVKSWSVV SALQDLGVEV IATGTEKSTE EDRARIRELM GPNARMIDDN DQSALIATCI ESGADILIAG GRYLYAALKA RLAFLDINHE RDFGYAGYGG FVELARQLAL AVHSPVWQRV RQEPRWVRAS TRAALLEEA
|
| |