Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_01450 |
Symbol | nifE |
ID | 7759110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 143285 |
End bp | 144712 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643803069 |
Product | Nitrogenase MoFe cofactor biosynthesis protein NifE |
Protein accession | YP_002797385 |
Protein GI | 226942312 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCCA AGGATATTGC CGAACTGCTC GACGAGCCCG CCTGCAGTCA CAACAAGAAG GAAAAGTCCG GCTGCGCCAA GCCCAAGCCG GGCGCCACCG ACGGTGGCTG CTCCTTCGAC GGCGCGCAGA TCGCCCTGCT GCCCGTCGCC GACGTGGCGC ATATCGTTCA CGGGCCGATC GCTTGCGCCG GCAGTTCCTG GGACAACCGC GGCACCCGCT CCAGCGGGCC GGACCTGTAC CGCATCGGCA TGACCACCGA TCTCACCGAG AACGACGTGA TCATGGGGCG CGCCGAGAAG CGCCTGTTCC ATGCCATCCG CCAGGCGGTG GAAAGCTATT CGCCGCCGGC GGTGTTCGTC TACAACACCT GCGTGCCGGC GCTGATCGGC GACGACGTCG ACGCAGTGTG CAAAGCCGCC GCCGAGCGCT TCGGCACCCC GGTCATCCCG GTCGACTCGG CCGGCTTCTA CGGCACCAAG AACCTCGGCA ACCGCATCGC CGGTGAGGCC ATGCTCAAGT ACGTGATCGG CACCCGCGAG CCCGATCCGC TGCCCGTCGG CAGCGAGCGT CCGGGCATCC GCGTGCACGA CGTCAACCTG ATCGGCGAGT ACAACATCGC CGGCGAGTTC TGGCATGTCC TGCCGCTGCT CGACGAACTG GGCCTGCGGG TGCTCTGCAC CCTGGCCGGC GATGCGCGCT ACCGCGAGGT GCAGACCATG CACCGCGCCG AAGTGAACAT GATGGTCTGC TCCAAGGCCA TGCTCAATGT CGCTCGCAAG CTGCAGGAAA CCTACGGCAC GCCCTGGTTC GAGGGCAGCT TCTACGGCAT CACCGACACC TCCCAGGCGC TGCGCGACTT CGCCCGGCTG CTCGATGATC CCGACCTGAC CGCCCGCACC GAGGCGCTGA TCGCGCGCGA GGAGGCCAAG GTCCGCGCCG CCCTCGAACC CTGGCGTGCG CGTCTGGAGG GCAAGCGCGT GCTGCTCTAC ACCGGCGGCG TGAAGTCCTG GTCGGTGGTT TCCGCCCTGC AGGACCTGGG CATGAAGGTG GTCGCCACCG GCACCAAGAA GTCCACCGAG GAAGACAAGG CACGCATCCG CGAACTGATG GGCGACGACG TCAAGATGCT CGACGAGGGC AATGCGCGGG TGCTGCTGAA GACCGTCGAC GAGTACCAGG CCGACATCCT CATCGCCGGC GGACGCAACA TGTACACCGC GCTCAAGGGC CGCGTGCCCT TCCTCGACAT CAACCAGGAG CGCGAATTCG GCTATGCCGG CTACGACGGC ATGCTGGAAC TGGTGCGTCA GCTCTGCATC ACCCTGGAAT GCCCGGTGTG GGAGGCGGTG CGCCGCCCCG CGCCCTGGGA CATCCCCGCC AGCCAGGACG CCGCGCCGAG CGCGCCGGCC CGTTCGGCGA ACGCCTGA
|
Protein sequence | MKAKDIAELL DEPACSHNKK EKSGCAKPKP GATDGGCSFD GAQIALLPVA DVAHIVHGPI ACAGSSWDNR GTRSSGPDLY RIGMTTDLTE NDVIMGRAEK RLFHAIRQAV ESYSPPAVFV YNTCVPALIG DDVDAVCKAA AERFGTPVIP VDSAGFYGTK NLGNRIAGEA MLKYVIGTRE PDPLPVGSER PGIRVHDVNL IGEYNIAGEF WHVLPLLDEL GLRVLCTLAG DARYREVQTM HRAEVNMMVC SKAMLNVARK LQETYGTPWF EGSFYGITDT SQALRDFARL LDDPDLTART EALIAREEAK VRAALEPWRA RLEGKRVLLY TGGVKSWSVV SALQDLGMKV VATGTKKSTE EDKARIRELM GDDVKMLDEG NARVLLKTVD EYQADILIAG GRNMYTALKG RVPFLDINQE REFGYAGYDG MLELVRQLCI TLECPVWEAV RRPAPWDIPA SQDAAPSAPA RSANA
|
| |