Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4051 |
Symbol | |
ID | 3681672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5036448 |
End bp | 5037818 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637719403 |
Product | nitrogenase MoFe cofactor biosynthesis protein NifE |
Protein accession | YP_324551 |
Protein GI | 75910255 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.283872 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAATA CTCAGGAAGA AATCAACAAA CTACTCAACG AAATAGCCTG CGAATATAAC GTCAAGAAAT TGCAGAGCAA GCCCAATAAA ACTACTTGTA AAAAACCACC AAAACCAGGT GCAGCGCAGG GAAATTGTCC TTTTGAGGGG GCGATGGTGG CTCTTGGTCC AATTACTGAT GCAGTGCATT TAGTACATGG CCCTGTTGGT TGCACGCGCA ATCCTTGGGG TAGTCGGGGT AGTCTTTCTT CTAGCTCTGA ACTCTACCAA ATGGGTTTCT CTACTGATTT GAGTGAGAAT GAAATCATTT TTGGCGGTGA AAAGAAGCTA GAAAAATCTA TTTTACATAT TGCTCAACGT TATAATCCGG CTGCTATTTT TGTCTATTCT ACTTGTGTCA CGGCTTTGAT TGGTGACGAT ATCGAAAGTG TTTGTAAAAA AGCTACAAAT CAAGTCGGGA TTCCGGTTGT TTATATCAAC GCACCTGGAT TTCTTGGGAG TAAAAATTTA GGCAATCGTA TTGCCGGGGA AGCTTTATTA GATTATGTAG TGGGAACAGC CGAACCAGAA TTTACTACGC CATTTGATAT CAATATTATT GGTGATTATA ATGTTGCTGG GGAAATCTGG AATATTTTAC CACTGTTGCA GAAATTAGGT ATTCGCGTTC TTTCCAAAAT TACCGGTGAT GCACGTTATC AAGAAATTTG CTATGCTCAC CGCGCGAAAT TAAATGTGGT AATTTGCTCG AATGTGTCAC TAAAAATGGC ACAAACAATG CAGGAGCGTT ACGGCATTCC TTATATTGAA GAGTCTTTCT TTGGCATAGA GAACATGAAT AGTTGTTTGC GAAATATTGC AGCAGCATTC GGTGATCAAT ACTTACAAGA AAGAACGGAA TGGTTAATTG CGGAGGAAAC CGCAGCTTTA GATATTGCCC TTGCTCCTTA TCGTTCTCAG TTACAAGGAA AACGTATTGT CCTTTATAGT GGAGGTGTGA AAAGTTGGTC AGTAATTTTA GCAGCAAAAG ACTTGGGTAT GAAGGTTGTT GCTACTAGTG ATAGGAAGAA TACTCAAGAT GAAAAAGTTA AAATTAAGGA ATTACTTGGT CAAGATGGCA TGGTTTTGGC TAAGGGTGGC CCCAAGGCTT TATTACAAGT AATAGAAGAT ATGAATGCGG ATATTTTAGT TGCTGGTGCT AGTAATCAAT ACACAGCAAT TAAAGCACGT ATTCCTTTTT TAGATATTAA TCATGAGCGT CATCATGCTT ACGCTGGTTA TGCGGGAATG GTGGAAATGG CGCGGGAATT TTATGAAGCC TTGTATAGTC CTGTGTGGAA ACAAGTTAGG CAACCTGCTC CTTGGGAGTA G
|
Protein sequence | MRNTQEEINK LLNEIACEYN VKKLQSKPNK TTCKKPPKPG AAQGNCPFEG AMVALGPITD AVHLVHGPVG CTRNPWGSRG SLSSSSELYQ MGFSTDLSEN EIIFGGEKKL EKSILHIAQR YNPAAIFVYS TCVTALIGDD IESVCKKATN QVGIPVVYIN APGFLGSKNL GNRIAGEALL DYVVGTAEPE FTTPFDINII GDYNVAGEIW NILPLLQKLG IRVLSKITGD ARYQEICYAH RAKLNVVICS NVSLKMAQTM QERYGIPYIE ESFFGIENMN SCLRNIAAAF GDQYLQERTE WLIAEETAAL DIALAPYRSQ LQGKRIVLYS GGVKSWSVIL AAKDLGMKVV ATSDRKNTQD EKVKIKELLG QDGMVLAKGG PKALLQVIED MNADILVAGA SNQYTAIKAR IPFLDINHER HHAYAGYAGM VEMAREFYEA LYSPVWKQVR QPAPWE
|
| |