Gene Anae109_3027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3027 
Symbol 
ID5374745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3523834 
End bp3526587 
Gene Length2754 bp 
Protein Length917 aa 
Translation table11 
GC content73% 
IMG OID640844552 
Productbifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN 
Protein accessionYP_001380208 
Protein GI153005883 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE
[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCC CCGACTACTA CGACGCGCCC GAGTGCGAGA CGCAGGAGAA GGGCGCTCCG 
AAGTTCTGCA GGAGGTCCGA GCCCGGCGAG GGCACCGAGC GGAGCTGCGC CTACGACGGA
GCGCGGGTGG TGCTCATGCC GGTGACGGAC GCCATCCACC TCGTCCACGG CCCGATCGCC
TGCGCCGGCA ACTCCTGGGA CAACCGCGGC GCGCGCTCCT CCGGCTCGCA GCTGTACCGG
CGCGGCTTCA CGACGGAGCT GCTCGAGAAC GACGTGGTCT TCGGCGGGGA GAAGAAGCTC
CGCCGCGCCA TCCTCGACCT CGCGGCGCGC TACCCGGAGG CGCGGGCCGT CTTCGTGTAC
GCGACCTGCG TGTCGGCCAT GACCGGCGAC GACGTCGAGG CGGTCTGCCG CTCCGTCGCC
GGCGAGGTGG CGATCCCGGT CGTGCCGGTG AACACGCCGG GGTTCATCGG CGACAAGAAC
ATCGGCAACC GGCTGGCCGG CGAGGTGCTG CTCGAGCACG TCATCGGCAC GGCCGAGCCG
GCCACGACGA CCCCGTCCGA CGTCAACCTC ATCGGCGAGT ACAACATCGC CGGGGACCTG
TGGGGGATGC TGCCGCTCTT CGAGCGGCTC GGGATCCGGG TCCTCTCCTG CATCTCCGGC
GACGCGCGCT TCGACGAGCT GCGCTGGGCG CACCGCGCCC GCCTGAACGT CATCATCTGC
TCGAAGAGCC TGACGAACCT CGCGCGCAAG ATGAAGAAGC GCTGGGGCAT CCCGTACCTG
GAGGAGTCGT TCTACGGGAT GACGGACACG GCGAAGGCGC TGCGCCACAT CGCGCGCGAG
CTCGACCTCG CGCGCGGCGA CGGGGCGAGC GTGATGGCCG AGGCGGTGGA GGCGCTCGTC
GCGGAGGAGG AGGAACGCTG CCGGGCGCGC CTCGCGCCCT ACCGGGCGCG GCTCGAGGGG
AAGCGCGCGG TGCTCTTCAC GGGCGGCGTG AAGACCTGGT CGATGGTCAA CGCCTTGCGC
GAGCTGGGCG TCGAGGTCCT CGCGGCGGGG ACGCAGAACT CCACGCTCGA GGACTTCCAC
CGGATGAAGG CGCTCATGCA CCGGGACGCC CGCATCATCG AGGACACCTC GACCGCCGGG
CTCCTCGAGA TCATGCGCGA GAAGCTGCCG GACCTGGTGG TGGCGGGCGG GAAGACCAAG
TTCCTCGCGC TCAAGACCCG CACGCCGTTC CTCGACATCA ACCACGGGCG GGCGCACCCC
TACGCCGGGT ACGAGGGGAT GGTCACCTTC GCGCGGCAGC TCGACCTCAC GGTGAGCAAC
CCCATCTGGT CCGCGCTGAC CGCTCCCGCC CCCTGGGAGC GGAGCGCCGC CGGCCTCGAG
GCGGAGCGCG CCGCCGCGCG CGGCCACGGC GCGGCGCTCC TCGCGGAGGA GCTCTCCGCC
TCGCGCGTGA AGGTCCCCGC CAAGCCCGCG ACGGTGAACC CGCAGAAGAA CTCGCCCGCG
CTGGGGGCCA CGCTGGCGTA CCTGGGCGTC GACCGGATGC TCGCGCTGCT CCACGGCGCG
CAGGGCTGCT CGACGTTCAT CCGGCTGCAG CTCTCCCGGC ACTTCAAGGA GTCGATCGCG
CTCAACTCGA CGGCGATGAG CGAGGACGCC GCGATCTTCG GTGGCTGGAG CAACCTGAAA
GCCGGCCTCG CCCGCGTGAT CGAGAAGTTC CGGCCCGGCG TCGTCGGGGT GATGACCTCC
GGCCTCACCG AGACGATGGG CGACGACGTG CGGAGCGCGA TCGCGCAGTT CCGCGAGGAG
CACCCCGAGC ACGCCGGCGT GCCGATCGTC TGGGCCTCTA CGCCGGACTA CTGCGGCTCG
CTGCAGGAGG GCTACGCCGC GGCGGTCGAG GCGCTCGTCT CGACGCTCGT CGACGGAGGC
GCCCCGATCC CGGGCCAGGT GACGCTGCTC CCCGGCGCCC ACCTCACCCC GGCGGACGTG
GAGGAGCTGA AGGGCACGAT CGAGGCCTTC GGGCTCTCCG TCGTCGCGGT GCCCGACGTG
GCGAACGCGC TCGACGGCCA CATCGACGCC GAGGTCTCGC CGCTCTCGAC CGGCGGCGCC
GGGGTGGACG CGATCCGCTC CGCCGGCCGC AGCGTCGCGA CCCTCTACGT GGGCGACTCG
CTCGCGCGCG CGGCGCGGAG CCTCGAGGAG GCGCACGGCG TCCCCGCGTA CGGGTTCACC
TCGCTCACCG GGATCGGCGA GGTCGATCGC CTCGTGGCGA CGCTCGCGGC GATCTCCGGC
CGGCCGGTGC CCGGCGCGCT CCGCCGCGCG CGCAGCCGGC TCATGGACGC GATGGTCGAC
AGCCACTACC AGCTCGGCGG CAAGCGCGTC GCGCTCGCCC TGGAGGCGGA CCCGCTGAAG
GTGCTCACCC GCTTCTTGCA CGGCATGGGC TGCGAGGTCA CCGCCGCCCT GGCCGCGACG
CGCACGCGCG GGCTGCACGA GCTCCCCGCC GCGACGGTGG CCGCCGGTGA CCTCGAGGAC
CTCGAGGGCG CGGCCGAGGG GGTCGACCTC GTCGTGGCCA ACTCCAACGG CCGCCAGGCG
GTCGCGAGGC TCGGGGTGAA GGCGCACCTG CGCGCGGGGT TGCCGGTGTT CGACCGGCTC
GGCGCCCACC AGAAGGTGTG GGTCGGCTAC CGCGGCACGA TGAACCTCGT CTTCGAGGTG
GCGAACCTGT TCCAGGCGAG CGCGACCGAG GCGCAGCGGC TCGCGCACAA CTGA
 
Protein sequence
MPRPDYYDAP ECETQEKGAP KFCRRSEPGE GTERSCAYDG ARVVLMPVTD AIHLVHGPIA 
CAGNSWDNRG ARSSGSQLYR RGFTTELLEN DVVFGGEKKL RRAILDLAAR YPEARAVFVY
ATCVSAMTGD DVEAVCRSVA GEVAIPVVPV NTPGFIGDKN IGNRLAGEVL LEHVIGTAEP
ATTTPSDVNL IGEYNIAGDL WGMLPLFERL GIRVLSCISG DARFDELRWA HRARLNVIIC
SKSLTNLARK MKKRWGIPYL EESFYGMTDT AKALRHIARE LDLARGDGAS VMAEAVEALV
AEEEERCRAR LAPYRARLEG KRAVLFTGGV KTWSMVNALR ELGVEVLAAG TQNSTLEDFH
RMKALMHRDA RIIEDTSTAG LLEIMREKLP DLVVAGGKTK FLALKTRTPF LDINHGRAHP
YAGYEGMVTF ARQLDLTVSN PIWSALTAPA PWERSAAGLE AERAAARGHG AALLAEELSA
SRVKVPAKPA TVNPQKNSPA LGATLAYLGV DRMLALLHGA QGCSTFIRLQ LSRHFKESIA
LNSTAMSEDA AIFGGWSNLK AGLARVIEKF RPGVVGVMTS GLTETMGDDV RSAIAQFREE
HPEHAGVPIV WASTPDYCGS LQEGYAAAVE ALVSTLVDGG APIPGQVTLL PGAHLTPADV
EELKGTIEAF GLSVVAVPDV ANALDGHIDA EVSPLSTGGA GVDAIRSAGR SVATLYVGDS
LARAARSLEE AHGVPAYGFT SLTGIGEVDR LVATLAAISG RPVPGALRRA RSRLMDAMVD
SHYQLGGKRV ALALEADPLK VLTRFLHGMG CEVTAALAAT RTRGLHELPA ATVAAGDLED
LEGAAEGVDL VVANSNGRQA VARLGVKAHL RAGLPVFDRL GAHQKVWVGY RGTMNLVFEV
ANLFQASATE AQRLAHN