Gene Ava_4251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4251 
Symbol 
ID3680899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5331416 
End bp5334133 
Gene Length2718 bp 
Protein Length905 aa 
Translation table11 
GC content46% 
IMG OID637719599 
Productbifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN 
Protein accessionYP_324745 
Protein GI75910449 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE
[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA CCCAAGGCAA AATTAACGAG CTGCTAAGTG AGCCAGGATG CGAACACAAT 
CACCACAAAC ATGGGCAGAA AAAAAACAAA TCCTGTCATC AACAAGCCCA ACCTGGTGCA
GCACAAGGAG GTTGTGCTTT TGATGGCGCA TCAATCGCCC TCGTTCCCAT TACTGATGCA
GCTCATTTAG TCCACGGGCC GATCGCCTGC TCTGGTAATT CTTGGGGTGG TCGTGGTAGT
CTTTCCTCTG GTTCCCATCT CTACAAAATG GGTTTTACAA CCGACCTGAG TGAGAATGAT
ATTATCTTCG GTGGCGAAAA GAAGCTGTAC AAAGCCATCT TGGAGGTACA GCAACGCTAT
CAACCTGCGG CAGTTTTTGT CTACTCCACT TGTGTTACAG CATTGATTGG TGATGATTTG
GATCCAGTCT GTGAAGCCGC AGCCAAGAAA ACAGGTATAC CTGTAATCCC TGTCAATTCC
CCCGGTTTTA TTGGTAGTAA AAACCTTGGT AATCGTGTCG GTGGGGAAGC CTTACTAGAA
TATGTTATCG GTAGTGCTGA ACCAGAATAT ACAACACCAT TAGATATCAA CCTCATTGGT
GAGTACAACA TTGCCGGCGA ATTGTGGGGT GTTCTGCCCT TATTTGAAAA ATTAGGTATT
CGTGTCCTCG CTAAAATTAC GGGTAATGCT AAGTATAAAG AAGTGCAATA TGCTCACCGC
GCCAAGCTGA ATGTGATGAT TTGCTCCAAA GCCCTGATCA ACGTGGCAAG AAAGATGGAG
GAGCGCTATG GCATTCCCTA CATTGAAGAA TCTTTCTATG GCGTGGATGA CATGAATCGC
TGTTTGCGGA ATATTGCTGC TAAATTAGGT GATCAAGGTC TGCAAGAACG AGTAGAACAG
TTAATTGCCC AAGAAACTGC TGCTTTAGAT ATTGCTTTGG CTATTTATCG CGATCGCCTC
AAAGGTAAAC GTGTTGTTCT TTATACCGGA GGTGTGAAAA GTTGGTCAAT TATCTCGGCG
GCTAAGGACT TGGGGATGGA AGTTGTCGCC ACTAGCAGCA AGAAAAGTAC CGAGGAGGAT
AAAGCTAGAA TTAGAGACTT ACTGGGCAAA GATGGCATCG TCATGGAAAA AGGCAATGCT
CAAGAACTGT TACGAGTAAT CGCCCAAACA AAAGCCGATA TGCTGATTGC TGGTGGTCGC
AATCAATACA CTGCCCTAAA AGCCCGCATC CCCTTTTTAG ATATTAACCA AGAACGGCAT
CATCCCTATG CTGGCTATGT GGGTATGGTG GAAATGGCGC GAGAACTGGA TGAAGCCCTT
TATAGTCCAG TATGGGGGCA GGTGCGTAAG TCGGCATTGT GGCAGGAGGG AGTAGGGGAG
CAGAGGAGCA GGGGAGCAGA GGAGCAGAGG GGGAAAACTG TAGTCCAAAA TTCGCATAAA
TCGGTTGCGG TTAATCCTTT GAAGCAAAGT CAACCTTTGG GTGCAGCCTT GGCATTTTTA
GGTTTGAAAG GTGTAATGCC TTTGTTTCAT GGTTCCCAGG GTTGTACTGC CTTCGCCAAA
GTCATGTTAG TGCGGCATTT TCGGGAAGCT ATTCCCTTAT CCACCACTGC TATGACTGAA
GTTACTACGA TTTTGGGTGG TGAGGATAAT ATTGAGCAGG CTATCCTGAC TTTGGTTGAG
AAGTCAAAGC CAGAAATCAT TGGTCTGTTA ACTACCGGAC TTACGGAAAC CAGAGGGGAT
GATATGGAAG GTATCCTCAG AAGTATCCGC AAACGCCACC CAGAATTATA TGATTTGCCG
ATAATATTTG CTTCTACTCC AGATTTTCAG GGTGCATTGC AGGATGGTTT TGCCACCGCA
GTCGAAAGCA TAGTTAAGGA AATTCCCCAA CCGGGAGAAA CCAGACTAGA CCAAATCAAT
ATTTTGGTGA GTTCTGCCTT TACCCCAGGG GATATACAGG AAATTAAAGA GATTGTCTCA
GCTTTTGGAC TAGAAACGAT TGTTGTCCCT GATCATTCCA CCTCCCTTGA TGGACACTTG
GATGATTCCT ACAGTGCGGT GACTGGTGGT GGTACAACTT TGGCAGAACT GCGACAGATG
GGTAGTTCGG TGTTTACCCT TGCACTAGGC GAAAGTATGC GCCGTGCAGC CGAGAGTTTG
CAAACACAGT TTGGCATTCC TTACGAGGTG TTTCCTCAAC TGACTGGGTT AGATGCAGTG
GATAACTTCT TGCAAGGGTT GGTAGATATT AGTGGTAATG CAGTTCCAGA AAAATATCGC
CACCAACGCC GTCAGTTGCA AGATGCGATG CTAGATACTC ACTTTTACTT TGGGCGTAAG
CGGGTATCGT TAGCACTAGA ACCTGATTTG TTGTGGTCGA TCGCCTCGTT TTTGGCATCA
ATGGGTGCGG AAATTCACGC CGCAGTGACT ACTACAAAGT CGCCGCTTTT AGAAAAACTG
CCAGTGGAAA AAGTCACTAT TGGGGATTTG GAAGACTTTG AGCGACTTGC TGTAGGGTCT
GATTTGGTCA TTGCTAATTC TCATGGTAAA GCCATTTCCC GCCGTCTACA AACTTCTTTT
TATCGTCTTG GTTTTCCTAT TTTTGACCGC TTGGGTAATG GACAGCGTTG TACTGTCGGC
TATCGAGGTA CTACACAACT TTTGTTTGAT ATTGGCAATT TATTTCTGGA AGCAGAAGAA
GAAAAAGCCA AGCATTAG
 
Protein sequence
MKITQGKINE LLSEPGCEHN HHKHGQKKNK SCHQQAQPGA AQGGCAFDGA SIALVPITDA 
AHLVHGPIAC SGNSWGGRGS LSSGSHLYKM GFTTDLSEND IIFGGEKKLY KAILEVQQRY
QPAAVFVYST CVTALIGDDL DPVCEAAAKK TGIPVIPVNS PGFIGSKNLG NRVGGEALLE
YVIGSAEPEY TTPLDINLIG EYNIAGELWG VLPLFEKLGI RVLAKITGNA KYKEVQYAHR
AKLNVMICSK ALINVARKME ERYGIPYIEE SFYGVDDMNR CLRNIAAKLG DQGLQERVEQ
LIAQETAALD IALAIYRDRL KGKRVVLYTG GVKSWSIISA AKDLGMEVVA TSSKKSTEED
KARIRDLLGK DGIVMEKGNA QELLRVIAQT KADMLIAGGR NQYTALKARI PFLDINQERH
HPYAGYVGMV EMARELDEAL YSPVWGQVRK SALWQEGVGE QRSRGAEEQR GKTVVQNSHK
SVAVNPLKQS QPLGAALAFL GLKGVMPLFH GSQGCTAFAK VMLVRHFREA IPLSTTAMTE
VTTILGGEDN IEQAILTLVE KSKPEIIGLL TTGLTETRGD DMEGILRSIR KRHPELYDLP
IIFASTPDFQ GALQDGFATA VESIVKEIPQ PGETRLDQIN ILVSSAFTPG DIQEIKEIVS
AFGLETIVVP DHSTSLDGHL DDSYSAVTGG GTTLAELRQM GSSVFTLALG ESMRRAAESL
QTQFGIPYEV FPQLTGLDAV DNFLQGLVDI SGNAVPEKYR HQRRQLQDAM LDTHFYFGRK
RVSLALEPDL LWSIASFLAS MGAEIHAAVT TTKSPLLEKL PVEKVTIGDL EDFERLAVGS
DLVIANSHGK AISRRLQTSF YRLGFPIFDR LGNGQRCTVG YRGTTQLLFD IGNLFLEAEE
EKAKH