Gene Avin_01450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_01450 
SymbolnifE 
ID7759110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp143285 
End bp144712 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content68% 
IMG OID643803069 
ProductNitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_002797385 
Protein GI226942312 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCA AGGATATTGC CGAACTGCTC GACGAGCCCG CCTGCAGTCA CAACAAGAAG 
GAAAAGTCCG GCTGCGCCAA GCCCAAGCCG GGCGCCACCG ACGGTGGCTG CTCCTTCGAC
GGCGCGCAGA TCGCCCTGCT GCCCGTCGCC GACGTGGCGC ATATCGTTCA CGGGCCGATC
GCTTGCGCCG GCAGTTCCTG GGACAACCGC GGCACCCGCT CCAGCGGGCC GGACCTGTAC
CGCATCGGCA TGACCACCGA TCTCACCGAG AACGACGTGA TCATGGGGCG CGCCGAGAAG
CGCCTGTTCC ATGCCATCCG CCAGGCGGTG GAAAGCTATT CGCCGCCGGC GGTGTTCGTC
TACAACACCT GCGTGCCGGC GCTGATCGGC GACGACGTCG ACGCAGTGTG CAAAGCCGCC
GCCGAGCGCT TCGGCACCCC GGTCATCCCG GTCGACTCGG CCGGCTTCTA CGGCACCAAG
AACCTCGGCA ACCGCATCGC CGGTGAGGCC ATGCTCAAGT ACGTGATCGG CACCCGCGAG
CCCGATCCGC TGCCCGTCGG CAGCGAGCGT CCGGGCATCC GCGTGCACGA CGTCAACCTG
ATCGGCGAGT ACAACATCGC CGGCGAGTTC TGGCATGTCC TGCCGCTGCT CGACGAACTG
GGCCTGCGGG TGCTCTGCAC CCTGGCCGGC GATGCGCGCT ACCGCGAGGT GCAGACCATG
CACCGCGCCG AAGTGAACAT GATGGTCTGC TCCAAGGCCA TGCTCAATGT CGCTCGCAAG
CTGCAGGAAA CCTACGGCAC GCCCTGGTTC GAGGGCAGCT TCTACGGCAT CACCGACACC
TCCCAGGCGC TGCGCGACTT CGCCCGGCTG CTCGATGATC CCGACCTGAC CGCCCGCACC
GAGGCGCTGA TCGCGCGCGA GGAGGCCAAG GTCCGCGCCG CCCTCGAACC CTGGCGTGCG
CGTCTGGAGG GCAAGCGCGT GCTGCTCTAC ACCGGCGGCG TGAAGTCCTG GTCGGTGGTT
TCCGCCCTGC AGGACCTGGG CATGAAGGTG GTCGCCACCG GCACCAAGAA GTCCACCGAG
GAAGACAAGG CACGCATCCG CGAACTGATG GGCGACGACG TCAAGATGCT CGACGAGGGC
AATGCGCGGG TGCTGCTGAA GACCGTCGAC GAGTACCAGG CCGACATCCT CATCGCCGGC
GGACGCAACA TGTACACCGC GCTCAAGGGC CGCGTGCCCT TCCTCGACAT CAACCAGGAG
CGCGAATTCG GCTATGCCGG CTACGACGGC ATGCTGGAAC TGGTGCGTCA GCTCTGCATC
ACCCTGGAAT GCCCGGTGTG GGAGGCGGTG CGCCGCCCCG CGCCCTGGGA CATCCCCGCC
AGCCAGGACG CCGCGCCGAG CGCGCCGGCC CGTTCGGCGA ACGCCTGA
 
Protein sequence
MKAKDIAELL DEPACSHNKK EKSGCAKPKP GATDGGCSFD GAQIALLPVA DVAHIVHGPI 
ACAGSSWDNR GTRSSGPDLY RIGMTTDLTE NDVIMGRAEK RLFHAIRQAV ESYSPPAVFV
YNTCVPALIG DDVDAVCKAA AERFGTPVIP VDSAGFYGTK NLGNRIAGEA MLKYVIGTRE
PDPLPVGSER PGIRVHDVNL IGEYNIAGEF WHVLPLLDEL GLRVLCTLAG DARYREVQTM
HRAEVNMMVC SKAMLNVARK LQETYGTPWF EGSFYGITDT SQALRDFARL LDDPDLTART
EALIAREEAK VRAALEPWRA RLEGKRVLLY TGGVKSWSVV SALQDLGMKV VATGTKKSTE
EDKARIRELM GDDVKMLDEG NARVLLKTVD EYQADILIAG GRNMYTALKG RVPFLDINQE
REFGYAGYDG MLELVRQLCI TLECPVWEAV RRPAPWDIPA SQDAAPSAPA RSANA