Gene Avin_48990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_48990 
SymbolanfD 
ID7763758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4958633 
End bp4960189 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content57% 
IMG OID643807739 
Productnitrogenase iron-iron protein, alpha chain 
Protein accessionYP_002801974 
Protein GI226946901 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01284] nitrogenase alpha chain
[TIGR01861] nitrogenase iron-iron protein, alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.546352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCATC ACGAGTTCGA GTGCAGCAAG GTTATTCCCG AGCGGAAGAA GCATGCCGTT 
ATCAAAGGTA AAGGCGAAAC GCTGGCCGAC GCCCTGCCTC AAGGGTATCT GAATACCATC
CCTGGTTCCA TCTCCGAGCG TGGTTGTGCC TACTGTGGTG CCAAGCACGT TATCGGGACT
CCCATGAAGG ATGTGATTCA CATCAGTCAT GGCCCGGTCG GCTGCACTTA CGATACCTGG
CAGACCAAGC GTTATATCAG CGACAACGAC AACTTCCAGC TCAAATACAC CTATGCCACC
GATGTGAAGG AAAAGCATAT CGTGTTCGGC GCCGAGAAGT TGCTGAAGCA GAACATCATC
GAAGCCTTCA AGGCGTTCCC GCAGATCAAG CGGATGACCA TCTACCAGAC CTGCGCCACG
GCGCTGATCG GAGACGACAT CAACGCCATC GCCGAAGAGG TGATGGAAGA GATGCCGGAG
GTGGATATCT TCGTCTGCAA CTCGCCCGGT TTCGCCGGTC CGAGCCAGTC CGGTGGTCAC
CACAAGATCA ACATCGCCTG GATCAACCAG AAGGTGGGTA CCGTCGAGCC GGAGATCACC
GGCGACCATG TGATCAACTA TGTGGGCGAG TACAACATTC AGGGCGACCA GGAAGTGATG
GTGGATTACT TCAAGCGCAT GGGTATCCAG GTGCTATCCA CTTTCACCGG CAACGGTTCC
TACGACGGCC TGCGTGCCAT GCACAGAGCC CATCTGAACG TACTGGAATG TGCCCGCTCC
GCCGAGTACA TCTGCAACGA ACTGCGTGTC CGTTACGGCA TTCCGCGTCT GGATATCGAC
GGTTTCGGTT TCAAGCCACT GGCGGATTCG CTGCGTAAGA TCGGTATGTT CTTCGGCATC
GAAGACCGTG CCAAGGCCAT CATCGACGAG GAAGTCGCCC GCTGGAAGCC GGAGTTGGAC
TGGTACAAGG AGCGGCTGAT GGGCAAGAAG GTCTGCCTGT GGCCGGGCGG TTCCAAACTC
TGGCACTGGG CCCATGTGAT CGAGGAAGAA ATGGGCCTCA AGGTGGTGTC GGTCTATACC
AAGTTCGGCC ATCAGGGCGA CATGGAGAAA GGCATCGCCC GTTGCGGCGA AGGCACTTTG
GCCATCGACG ACCCGAACGA ATTGGAAGGT CTGGAAGCCC TGGAGATGCT CAAGCCCGAC
ATCATCCTGA CCGGCAAGCG TCCGGGTGAA GTGGCCAAGA AAGTCCGGGT TCCCTACCTG
AACGCCCACG CCTACCACAA CGGCCCGTAC AAAGGCTTCG AAGGTTGGGT GCGTTTCGCC
CGCGATATTT ACAACGCCAT CTACTCGCCG ATCCATCAGC TCTCCGGTAT CGACATCACT
AAAGACAATG CACCGGAGTG GGGTAATGGT TTCCGTACTC GCCAAATGCT GTCCGATGGC
AACTTGAGCG ATGCAGTACG TAACTCGGAA ACCTTGCGCC AGTACACCGG CGGCTACGAC
AGCGTGAGCA AGCTGCGCGA ACGGGAATAT CCCGCCTTCG AGCGCAAGGT CGGCTGA
 
Protein sequence
MPHHEFECSK VIPERKKHAV IKGKGETLAD ALPQGYLNTI PGSISERGCA YCGAKHVIGT 
PMKDVIHISH GPVGCTYDTW QTKRYISDND NFQLKYTYAT DVKEKHIVFG AEKLLKQNII
EAFKAFPQIK RMTIYQTCAT ALIGDDINAI AEEVMEEMPE VDIFVCNSPG FAGPSQSGGH
HKINIAWINQ KVGTVEPEIT GDHVINYVGE YNIQGDQEVM VDYFKRMGIQ VLSTFTGNGS
YDGLRAMHRA HLNVLECARS AEYICNELRV RYGIPRLDID GFGFKPLADS LRKIGMFFGI
EDRAKAIIDE EVARWKPELD WYKERLMGKK VCLWPGGSKL WHWAHVIEEE MGLKVVSVYT
KFGHQGDMEK GIARCGEGTL AIDDPNELEG LEALEMLKPD IILTGKRPGE VAKKVRVPYL
NAHAYHNGPY KGFEGWVRFA RDIYNAIYSP IHQLSGIDIT KDNAPEWGNG FRTRQMLSDG
NLSDAVRNSE TLRQYTGGYD SVSKLREREY PAFERKVG