Gene Avin_02770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_02770 
SymbolvnfE 
ID7759237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp260573 
End bp261982 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content69% 
IMG OID643803201 
Productnitrogenase vanadium iron cofactor biosynthesis protein vnfE 
Protein accessionYP_002797512 
Protein GI226942439 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.687288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAGA CCGAAATCCA GAACCTGCTC GACGAGCCAG CCTGCACCCA CAACACGGCG 
GGCAAGACCG GCTGCTCGCG CTCGCGCCCG GGAGCCACCC AGGGCGGCTG CGCCTTCGAC
GGCGCGCAGA TCGCCATCCT GCCGATCGCC GACGCCGCGC ACATCGTCCA CGGTCCGATC
GGCTGTGCCG GCAGTTCCTG GGACCTGCGC GGCAGCAATT CCTCCGGCCC ACAGTTGTAC
CGCCTGGGCA TGACCACCGA ACTGTCCGAC GTCGACGTGA TCATGGGCCG CGGCGAGAAA
AAGCTGTTCC ACGCCATCCG CCGTGCAGTC GAGCGCTACC AGCCGCAGGC AGTGTTCGTC
TACGGCACCT GCGTGCCGGC GATGCAGGGC GACGACATCG AAGCGGTGGC CCGCGACGCC
AGCCAGCGCT GGGGCGTGCC GGTGATTCCG GTGGACGGCG CCGGCTTCTA CGGCACCAAG
AGCCTGGGCA ACCGCATCGC CGGCGAAACC CTCTACCGCC ACGTCATAGG TACCCGCGAA
CCGGCGCCGC TGCCGCAAGG CGCCGTCGGC CACGGCATCA CGGTGCACGA CGTCAACCTG
ATCGGCGAAT ACAACATCGC CGGCGAGTTC TGGCGCGTCG CGCCGCTGTT CGACGAACTC
GGCCTGCGCA TTCTCTGCAC CCTGTCCGGC GACGCGCGCT TTCGCGAGGT CCAGACCATG
CACCGCGCCG AAGCCAACAT GGTGGTCTGC TCCAAGGCCA TGCTCAACGT CGCCCGCCAC
CTGCGCGAGG ACTACGGCAC GCCGTTCTTC GAGGGCAGCT TCTACGGTAT CGCCGATACC
TCCCAGGCCC TGCGCGACTT CGCCAAGGCG ATCGGCGACC CGTCGCTGTC GGTACGCACC
GAACTGCTGA TCCTGCGCGA GGAAAACAGG GCCAGGGCGG CGCTCGAACC CTGGCGCGAG
CGACTGGCCG GCAAGCGCGC GCTGATCTTC TCCGGCGGCG TGAAATCCTG GTCGGTGGTT
TCGGCGCTGC AGGACCTCGG CGTCGAGGTG ATCGCCACCG GCACCGAGAA ATCCACCGAG
GAAGACCGCG CGCGCATCCG CGAGCTGATG GGGCCGAACG CCCGGATGAT CGACGACAAC
GACCAGAGCG CGCTGATCGC CACCTGCATC GAGAGCGGCG CCGACATCCT CATCGCTGGC
GGACGCTACC TGTACGCCGC GCTCAAGGCG CGCCTGGCGT TCCTCGACAT CAACCACGAA
CGCGACTTCG GCTACGCCGG CTACGGCGGT TTCGTCGAAC TGGCCCGCCA GTTGGCGCTG
GCCGTGCACA GCCCGGTATG GCAGCGCGTG CGCCAGGAGC CGCGCTGGGT ACGCGCCAGC
ACCCGGGCCG CGCTGCTGGA GGAGGCCTGA
 
Protein sequence
MNQTEIQNLL DEPACTHNTA GKTGCSRSRP GATQGGCAFD GAQIAILPIA DAAHIVHGPI 
GCAGSSWDLR GSNSSGPQLY RLGMTTELSD VDVIMGRGEK KLFHAIRRAV ERYQPQAVFV
YGTCVPAMQG DDIEAVARDA SQRWGVPVIP VDGAGFYGTK SLGNRIAGET LYRHVIGTRE
PAPLPQGAVG HGITVHDVNL IGEYNIAGEF WRVAPLFDEL GLRILCTLSG DARFREVQTM
HRAEANMVVC SKAMLNVARH LREDYGTPFF EGSFYGIADT SQALRDFAKA IGDPSLSVRT
ELLILREENR ARAALEPWRE RLAGKRALIF SGGVKSWSVV SALQDLGVEV IATGTEKSTE
EDRARIRELM GPNARMIDDN DQSALIATCI ESGADILIAG GRYLYAALKA RLAFLDINHE
RDFGYAGYGG FVELARQLAL AVHSPVWQRV RQEPRWVRAS TRAALLEEA