Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Anae109_3027 |
Symbol | |
ID | 5374745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. Fw109-5 |
Kingdom | Bacteria |
Replicon accession | NC_009675 |
Strand | - |
Start bp | 3523834 |
End bp | 3526587 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640844552 |
Product | bifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN |
Protein accession | YP_001380208 |
Protein GI | 153005883 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE [TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGCC CCGACTACTA CGACGCGCCC GAGTGCGAGA CGCAGGAGAA GGGCGCTCCG AAGTTCTGCA GGAGGTCCGA GCCCGGCGAG GGCACCGAGC GGAGCTGCGC CTACGACGGA GCGCGGGTGG TGCTCATGCC GGTGACGGAC GCCATCCACC TCGTCCACGG CCCGATCGCC TGCGCCGGCA ACTCCTGGGA CAACCGCGGC GCGCGCTCCT CCGGCTCGCA GCTGTACCGG CGCGGCTTCA CGACGGAGCT GCTCGAGAAC GACGTGGTCT TCGGCGGGGA GAAGAAGCTC CGCCGCGCCA TCCTCGACCT CGCGGCGCGC TACCCGGAGG CGCGGGCCGT CTTCGTGTAC GCGACCTGCG TGTCGGCCAT GACCGGCGAC GACGTCGAGG CGGTCTGCCG CTCCGTCGCC GGCGAGGTGG CGATCCCGGT CGTGCCGGTG AACACGCCGG GGTTCATCGG CGACAAGAAC ATCGGCAACC GGCTGGCCGG CGAGGTGCTG CTCGAGCACG TCATCGGCAC GGCCGAGCCG GCCACGACGA CCCCGTCCGA CGTCAACCTC ATCGGCGAGT ACAACATCGC CGGGGACCTG TGGGGGATGC TGCCGCTCTT CGAGCGGCTC GGGATCCGGG TCCTCTCCTG CATCTCCGGC GACGCGCGCT TCGACGAGCT GCGCTGGGCG CACCGCGCCC GCCTGAACGT CATCATCTGC TCGAAGAGCC TGACGAACCT CGCGCGCAAG ATGAAGAAGC GCTGGGGCAT CCCGTACCTG GAGGAGTCGT TCTACGGGAT GACGGACACG GCGAAGGCGC TGCGCCACAT CGCGCGCGAG CTCGACCTCG CGCGCGGCGA CGGGGCGAGC GTGATGGCCG AGGCGGTGGA GGCGCTCGTC GCGGAGGAGG AGGAACGCTG CCGGGCGCGC CTCGCGCCCT ACCGGGCGCG GCTCGAGGGG AAGCGCGCGG TGCTCTTCAC GGGCGGCGTG AAGACCTGGT CGATGGTCAA CGCCTTGCGC GAGCTGGGCG TCGAGGTCCT CGCGGCGGGG ACGCAGAACT CCACGCTCGA GGACTTCCAC CGGATGAAGG CGCTCATGCA CCGGGACGCC CGCATCATCG AGGACACCTC GACCGCCGGG CTCCTCGAGA TCATGCGCGA GAAGCTGCCG GACCTGGTGG TGGCGGGCGG GAAGACCAAG TTCCTCGCGC TCAAGACCCG CACGCCGTTC CTCGACATCA ACCACGGGCG GGCGCACCCC TACGCCGGGT ACGAGGGGAT GGTCACCTTC GCGCGGCAGC TCGACCTCAC GGTGAGCAAC CCCATCTGGT CCGCGCTGAC CGCTCCCGCC CCCTGGGAGC GGAGCGCCGC CGGCCTCGAG GCGGAGCGCG CCGCCGCGCG CGGCCACGGC GCGGCGCTCC TCGCGGAGGA GCTCTCCGCC TCGCGCGTGA AGGTCCCCGC CAAGCCCGCG ACGGTGAACC CGCAGAAGAA CTCGCCCGCG CTGGGGGCCA CGCTGGCGTA CCTGGGCGTC GACCGGATGC TCGCGCTGCT CCACGGCGCG CAGGGCTGCT CGACGTTCAT CCGGCTGCAG CTCTCCCGGC ACTTCAAGGA GTCGATCGCG CTCAACTCGA CGGCGATGAG CGAGGACGCC GCGATCTTCG GTGGCTGGAG CAACCTGAAA GCCGGCCTCG CCCGCGTGAT CGAGAAGTTC CGGCCCGGCG TCGTCGGGGT GATGACCTCC GGCCTCACCG AGACGATGGG CGACGACGTG CGGAGCGCGA TCGCGCAGTT CCGCGAGGAG CACCCCGAGC ACGCCGGCGT GCCGATCGTC TGGGCCTCTA CGCCGGACTA CTGCGGCTCG CTGCAGGAGG GCTACGCCGC GGCGGTCGAG GCGCTCGTCT CGACGCTCGT CGACGGAGGC GCCCCGATCC CGGGCCAGGT GACGCTGCTC CCCGGCGCCC ACCTCACCCC GGCGGACGTG GAGGAGCTGA AGGGCACGAT CGAGGCCTTC GGGCTCTCCG TCGTCGCGGT GCCCGACGTG GCGAACGCGC TCGACGGCCA CATCGACGCC GAGGTCTCGC CGCTCTCGAC CGGCGGCGCC GGGGTGGACG CGATCCGCTC CGCCGGCCGC AGCGTCGCGA CCCTCTACGT GGGCGACTCG CTCGCGCGCG CGGCGCGGAG CCTCGAGGAG GCGCACGGCG TCCCCGCGTA CGGGTTCACC TCGCTCACCG GGATCGGCGA GGTCGATCGC CTCGTGGCGA CGCTCGCGGC GATCTCCGGC CGGCCGGTGC CCGGCGCGCT CCGCCGCGCG CGCAGCCGGC TCATGGACGC GATGGTCGAC AGCCACTACC AGCTCGGCGG CAAGCGCGTC GCGCTCGCCC TGGAGGCGGA CCCGCTGAAG GTGCTCACCC GCTTCTTGCA CGGCATGGGC TGCGAGGTCA CCGCCGCCCT GGCCGCGACG CGCACGCGCG GGCTGCACGA GCTCCCCGCC GCGACGGTGG CCGCCGGTGA CCTCGAGGAC CTCGAGGGCG CGGCCGAGGG GGTCGACCTC GTCGTGGCCA ACTCCAACGG CCGCCAGGCG GTCGCGAGGC TCGGGGTGAA GGCGCACCTG CGCGCGGGGT TGCCGGTGTT CGACCGGCTC GGCGCCCACC AGAAGGTGTG GGTCGGCTAC CGCGGCACGA TGAACCTCGT CTTCGAGGTG GCGAACCTGT TCCAGGCGAG CGCGACCGAG GCGCAGCGGC TCGCGCACAA CTGA
|
Protein sequence | MPRPDYYDAP ECETQEKGAP KFCRRSEPGE GTERSCAYDG ARVVLMPVTD AIHLVHGPIA CAGNSWDNRG ARSSGSQLYR RGFTTELLEN DVVFGGEKKL RRAILDLAAR YPEARAVFVY ATCVSAMTGD DVEAVCRSVA GEVAIPVVPV NTPGFIGDKN IGNRLAGEVL LEHVIGTAEP ATTTPSDVNL IGEYNIAGDL WGMLPLFERL GIRVLSCISG DARFDELRWA HRARLNVIIC SKSLTNLARK MKKRWGIPYL EESFYGMTDT AKALRHIARE LDLARGDGAS VMAEAVEALV AEEEERCRAR LAPYRARLEG KRAVLFTGGV KTWSMVNALR ELGVEVLAAG TQNSTLEDFH RMKALMHRDA RIIEDTSTAG LLEIMREKLP DLVVAGGKTK FLALKTRTPF LDINHGRAHP YAGYEGMVTF ARQLDLTVSN PIWSALTAPA PWERSAAGLE AERAAARGHG AALLAEELSA SRVKVPAKPA TVNPQKNSPA LGATLAYLGV DRMLALLHGA QGCSTFIRLQ LSRHFKESIA LNSTAMSEDA AIFGGWSNLK AGLARVIEKF RPGVVGVMTS GLTETMGDDV RSAIAQFREE HPEHAGVPIV WASTPDYCGS LQEGYAAAVE ALVSTLVDGG APIPGQVTLL PGAHLTPADV EELKGTIEAF GLSVVAVPDV ANALDGHIDA EVSPLSTGGA GVDAIRSAGR SVATLYVGDS LARAARSLEE AHGVPAYGFT SLTGIGEVDR LVATLAAISG RPVPGALRRA RSRLMDAMVD SHYQLGGKRV ALALEADPLK VLTRFLHGMG CEVTAALAAT RTRGLHELPA ATVAAGDLED LEGAAEGVDL VVANSNGRQA VARLGVKAHL RAGLPVFDRL GAHQKVWVGY RGTMNLVFEV ANLFQASATE AQRLAHN
|
| |