Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4251 |
Symbol | |
ID | 3680899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5331416 |
End bp | 5334133 |
Gene Length | 2718 bp |
Protein Length | 905 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637719599 |
Product | bifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN |
Protein accession | YP_324745 |
Protein GI | 75910449 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE [TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA CCCAAGGCAA AATTAACGAG CTGCTAAGTG AGCCAGGATG CGAACACAAT CACCACAAAC ATGGGCAGAA AAAAAACAAA TCCTGTCATC AACAAGCCCA ACCTGGTGCA GCACAAGGAG GTTGTGCTTT TGATGGCGCA TCAATCGCCC TCGTTCCCAT TACTGATGCA GCTCATTTAG TCCACGGGCC GATCGCCTGC TCTGGTAATT CTTGGGGTGG TCGTGGTAGT CTTTCCTCTG GTTCCCATCT CTACAAAATG GGTTTTACAA CCGACCTGAG TGAGAATGAT ATTATCTTCG GTGGCGAAAA GAAGCTGTAC AAAGCCATCT TGGAGGTACA GCAACGCTAT CAACCTGCGG CAGTTTTTGT CTACTCCACT TGTGTTACAG CATTGATTGG TGATGATTTG GATCCAGTCT GTGAAGCCGC AGCCAAGAAA ACAGGTATAC CTGTAATCCC TGTCAATTCC CCCGGTTTTA TTGGTAGTAA AAACCTTGGT AATCGTGTCG GTGGGGAAGC CTTACTAGAA TATGTTATCG GTAGTGCTGA ACCAGAATAT ACAACACCAT TAGATATCAA CCTCATTGGT GAGTACAACA TTGCCGGCGA ATTGTGGGGT GTTCTGCCCT TATTTGAAAA ATTAGGTATT CGTGTCCTCG CTAAAATTAC GGGTAATGCT AAGTATAAAG AAGTGCAATA TGCTCACCGC GCCAAGCTGA ATGTGATGAT TTGCTCCAAA GCCCTGATCA ACGTGGCAAG AAAGATGGAG GAGCGCTATG GCATTCCCTA CATTGAAGAA TCTTTCTATG GCGTGGATGA CATGAATCGC TGTTTGCGGA ATATTGCTGC TAAATTAGGT GATCAAGGTC TGCAAGAACG AGTAGAACAG TTAATTGCCC AAGAAACTGC TGCTTTAGAT ATTGCTTTGG CTATTTATCG CGATCGCCTC AAAGGTAAAC GTGTTGTTCT TTATACCGGA GGTGTGAAAA GTTGGTCAAT TATCTCGGCG GCTAAGGACT TGGGGATGGA AGTTGTCGCC ACTAGCAGCA AGAAAAGTAC CGAGGAGGAT AAAGCTAGAA TTAGAGACTT ACTGGGCAAA GATGGCATCG TCATGGAAAA AGGCAATGCT CAAGAACTGT TACGAGTAAT CGCCCAAACA AAAGCCGATA TGCTGATTGC TGGTGGTCGC AATCAATACA CTGCCCTAAA AGCCCGCATC CCCTTTTTAG ATATTAACCA AGAACGGCAT CATCCCTATG CTGGCTATGT GGGTATGGTG GAAATGGCGC GAGAACTGGA TGAAGCCCTT TATAGTCCAG TATGGGGGCA GGTGCGTAAG TCGGCATTGT GGCAGGAGGG AGTAGGGGAG CAGAGGAGCA GGGGAGCAGA GGAGCAGAGG GGGAAAACTG TAGTCCAAAA TTCGCATAAA TCGGTTGCGG TTAATCCTTT GAAGCAAAGT CAACCTTTGG GTGCAGCCTT GGCATTTTTA GGTTTGAAAG GTGTAATGCC TTTGTTTCAT GGTTCCCAGG GTTGTACTGC CTTCGCCAAA GTCATGTTAG TGCGGCATTT TCGGGAAGCT ATTCCCTTAT CCACCACTGC TATGACTGAA GTTACTACGA TTTTGGGTGG TGAGGATAAT ATTGAGCAGG CTATCCTGAC TTTGGTTGAG AAGTCAAAGC CAGAAATCAT TGGTCTGTTA ACTACCGGAC TTACGGAAAC CAGAGGGGAT GATATGGAAG GTATCCTCAG AAGTATCCGC AAACGCCACC CAGAATTATA TGATTTGCCG ATAATATTTG CTTCTACTCC AGATTTTCAG GGTGCATTGC AGGATGGTTT TGCCACCGCA GTCGAAAGCA TAGTTAAGGA AATTCCCCAA CCGGGAGAAA CCAGACTAGA CCAAATCAAT ATTTTGGTGA GTTCTGCCTT TACCCCAGGG GATATACAGG AAATTAAAGA GATTGTCTCA GCTTTTGGAC TAGAAACGAT TGTTGTCCCT GATCATTCCA CCTCCCTTGA TGGACACTTG GATGATTCCT ACAGTGCGGT GACTGGTGGT GGTACAACTT TGGCAGAACT GCGACAGATG GGTAGTTCGG TGTTTACCCT TGCACTAGGC GAAAGTATGC GCCGTGCAGC CGAGAGTTTG CAAACACAGT TTGGCATTCC TTACGAGGTG TTTCCTCAAC TGACTGGGTT AGATGCAGTG GATAACTTCT TGCAAGGGTT GGTAGATATT AGTGGTAATG CAGTTCCAGA AAAATATCGC CACCAACGCC GTCAGTTGCA AGATGCGATG CTAGATACTC ACTTTTACTT TGGGCGTAAG CGGGTATCGT TAGCACTAGA ACCTGATTTG TTGTGGTCGA TCGCCTCGTT TTTGGCATCA ATGGGTGCGG AAATTCACGC CGCAGTGACT ACTACAAAGT CGCCGCTTTT AGAAAAACTG CCAGTGGAAA AAGTCACTAT TGGGGATTTG GAAGACTTTG AGCGACTTGC TGTAGGGTCT GATTTGGTCA TTGCTAATTC TCATGGTAAA GCCATTTCCC GCCGTCTACA AACTTCTTTT TATCGTCTTG GTTTTCCTAT TTTTGACCGC TTGGGTAATG GACAGCGTTG TACTGTCGGC TATCGAGGTA CTACACAACT TTTGTTTGAT ATTGGCAATT TATTTCTGGA AGCAGAAGAA GAAAAAGCCA AGCATTAG
|
Protein sequence | MKITQGKINE LLSEPGCEHN HHKHGQKKNK SCHQQAQPGA AQGGCAFDGA SIALVPITDA AHLVHGPIAC SGNSWGGRGS LSSGSHLYKM GFTTDLSEND IIFGGEKKLY KAILEVQQRY QPAAVFVYST CVTALIGDDL DPVCEAAAKK TGIPVIPVNS PGFIGSKNLG NRVGGEALLE YVIGSAEPEY TTPLDINLIG EYNIAGELWG VLPLFEKLGI RVLAKITGNA KYKEVQYAHR AKLNVMICSK ALINVARKME ERYGIPYIEE SFYGVDDMNR CLRNIAAKLG DQGLQERVEQ LIAQETAALD IALAIYRDRL KGKRVVLYTG GVKSWSIISA AKDLGMEVVA TSSKKSTEED KARIRDLLGK DGIVMEKGNA QELLRVIAQT KADMLIAGGR NQYTALKARI PFLDINQERH HPYAGYVGMV EMARELDEAL YSPVWGQVRK SALWQEGVGE QRSRGAEEQR GKTVVQNSHK SVAVNPLKQS QPLGAALAFL GLKGVMPLFH GSQGCTAFAK VMLVRHFREA IPLSTTAMTE VTTILGGEDN IEQAILTLVE KSKPEIIGLL TTGLTETRGD DMEGILRSIR KRHPELYDLP IIFASTPDFQ GALQDGFATA VESIVKEIPQ PGETRLDQIN ILVSSAFTPG DIQEIKEIVS AFGLETIVVP DHSTSLDGHL DDSYSAVTGG GTTLAELRQM GSSVFTLALG ESMRRAAESL QTQFGIPYEV FPQLTGLDAV DNFLQGLVDI SGNAVPEKYR HQRRQLQDAM LDTHFYFGRK RVSLALEPDL LWSIASFLAS MGAEIHAAVT TTKSPLLEKL PVEKVTIGDL EDFERLAVGS DLVIANSHGK AISRRLQTSF YRLGFPIFDR LGNGQRCTVG YRGTTQLLFD IGNLFLEAEE EKAKH
|
| |