Gene Ava_3932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3932 
Symbol 
ID3682977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4886154 
End bp4887596 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content47% 
IMG OID637719284 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_324432 
Protein GI75910136 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.844513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0730658 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACA CCCAAGGCAA AATCAACGAG CTACTGAATG AGTCGGGATG CGAACATAAT 
CAGCATAAGC ATGGTGAGAA GAAGAATAAG TCTTGTTCAC AACAAGCGCA ACCTGGGGCG
GCTCAAGGGG GCTGTGCATT TGATGGGGCG ATGATTTCCC TAGTTCCAAT TGTGGATGCG
GCTCATTTGG TGCATGGCCC GATCGCCTGT GCTGGTAATT CTTGGGGTAG TCGGGGTAGT
CTCTCTTCCG GTCCCCAGCT TTACAAAATG GGCTTTACTA CCGATATGTC AGAAAATGAT
GTAATTTTCG GTGGTGAGAA GAAACTCTAT AAGGCAATTC TAGAAATTCA CAAACGTTAC
AATCCATCGG CGGTATTTGT CTACGCTACT TGCGTGACGG CGTTGATTGG TGATGATATT
GATGCTGTCT GCAAAACTGC GTCCGAGAAA ATTGGCACTC CTGTTATCCC CGTAATTGCT
CCTGGGTTTA TTGGGAGTAA GAACTTGGGG AACCGTTTTG GTGGTGAATC TTTATTAGAT
TATGTTGTCG GCACAGCAGA ACCGGAGTAC ACCACACCCT ATGATATAAA CTTAATCGGC
GAATATAACA TCGCCGGGGA AATGTGGGGA GTCCTGCCGT TACTAGAAAA ATTGGGTATT
CGCGTCCTAT CGAAAATCAC AGGCGATGCT CGGTTTGAAG AAATCCGCTA TGCACACCGC
GCCAAGCTGA ATGTGATGAT TTGTTCACGG GCGCTGCTCA ATATGGCGAG AAAGATGGAG
GAAAAATACG GCATCCCCTA CATTGAAGAG TCTTTCTATG GCATCGATGA TATGAATCGG
TGTTTGCGGA ATATTGCCGC CAAATTGGGC GACCCTGATT TGCAAACGCG GACAGAAAAG
CTGATTGCAG AGGAAACGGC GGCGCTGGAT TTGGCACTGG CTCCCTATCG CGCTCGTCTC
AAGGGTAAGC GAGTCGTACT CTATACCGGT GGTGTCAAGA GTTGGTCGAT TATCTCGGCG
GCGAAGGACT TGGGTATTGA AGTTGTGGCT ACCAGTACCA GAAAAAGTAC AGAAGAAGAT
AAAGCTAAAA TCAAACGGTT GTTGGGTGCT GATGGCATCA TGCTAGAAAA AGGCAACGCC
AAAGAACTCC TGCAACTGGT AAAAGATACG CAAGCTGATA TGTTAATTGC TGGTGGTCGT
AACCAATACA CCGCCCTCAA AGCCCGGATT CCCTTCCTTG ATATCAACCA AGAACGCCAT
CATCCTTATG CTGGTTATGT GGGCATGATT GAAATGGCGC GGGAATTGTA CGAAGCCCTC
TACAGCCCGA TTTGGGAACA AATTCGTAAG CCTGCGCCTT GGGATGAAGA TATGGGAATA
CTGGCTCACG AATATACAAG TAATCACGAT CATATCTTGG CATCTATAGA GGAGTTAATC
TGA
 
Protein sequence
MKNTQGKINE LLNESGCEHN QHKHGEKKNK SCSQQAQPGA AQGGCAFDGA MISLVPIVDA 
AHLVHGPIAC AGNSWGSRGS LSSGPQLYKM GFTTDMSEND VIFGGEKKLY KAILEIHKRY
NPSAVFVYAT CVTALIGDDI DAVCKTASEK IGTPVIPVIA PGFIGSKNLG NRFGGESLLD
YVVGTAEPEY TTPYDINLIG EYNIAGEMWG VLPLLEKLGI RVLSKITGDA RFEEIRYAHR
AKLNVMICSR ALLNMARKME EKYGIPYIEE SFYGIDDMNR CLRNIAAKLG DPDLQTRTEK
LIAEETAALD LALAPYRARL KGKRVVLYTG GVKSWSIISA AKDLGIEVVA TSTRKSTEED
KAKIKRLLGA DGIMLEKGNA KELLQLVKDT QADMLIAGGR NQYTALKARI PFLDINQERH
HPYAGYVGMI EMARELYEAL YSPIWEQIRK PAPWDEDMGI LAHEYTSNHD HILASIEELI