Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2142 |
Symbol | |
ID | 8137478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2497421 |
End bp | 2500177 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869757 |
Product | bifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN |
Protein accession | YP_003021952 |
Protein GI | 253700763 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE [TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 0.0853728 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAGC CAGACTGGTA CGACACAACC GATTGCGACA CTCACGATGC CGGGGCTCCG AAATTCTGTA AAAAGTCGGA ACCGGGCGAG GGAACGGAAC GAAGCTGCGC CTACGACGGC GCCCGCGTGG TCCTCATGCC CATTACCGAC GTCATCCACC TGGTGCACGG CCCCATCGCC TGCGCCGGAA ACTCCTGGGA CAACCGCGGG GCTCGCTCCT CGGATTCCCA GCTCTACCGC CGCGGCTTCA CTACGGAGAT GCTGCAAAAC GACGTGATCT TCGGCGGCGA AAAGAAGCTC TACCGCGCCA TCCAGGAACT GGCGGCACGC TACCCGGAGG CGAAGGCGAT CTTCGTCTAC GCCACCTGCG TTACCTCCAT GACCGGCGAC GACATCGAGG CGGTCTGCAA AGCGGCACAG GACAAGGTGA GCGTCCCGGT GATCCCGGTG AACACGCCCG GCTTCATCGG GGACAAGAAC ATCGGCAACC GCCTGGCCGG AGAGACTCTC TTCAAATACG TGATCGGGAC GGCGGAGCCG GAATACACGA CGGACTACGA CATCAACCTG ATCGGCGAGT ACAACATCGC CGGCGACCTC TGGGGGATGC TGCCGCTCTT CGACCGGCTC GGGATCCGCG TCCTCTCCTG CATCAGCGGC GACGCCAAGT TCGAGGACCT GCGCTACGCG CACCGCGCCA AACTGAACAT CATCGTCTGC TCCAAGAGCC TCACCAACCT CGCCAAGAAG ATGCAGAAGG TGTACGGCAT CCCGTACCTG GAGGAATCCT TCTACGGCAT GACCGACGTG GCCAAGGCGC TGCGTGACAT AGCGCGGGAA TTGGACGACC GGGTGAACGG CCTTGAGAAG CGGGTGATGC AGGACCGGGT GGAAAAGCTG ATCGCCGAGT GCGAGGAGAG CTGCCGCGCG GAGCTTGCTC CCTATCGGGA GCGGCTCGCC GGGAAGAAGG CGGTGCTCTT TACCGGGGGG GTGAAGACCT GGTCCATGGT GAACGCGCTG GCGGAGCTGG GGGTGGAGAT TCTCGCCGCC GGGACTCAGA ATTCGACGCT GGAAGACTTC TACCGCATGA AGGCGCTGAT GCACGAGGAC GCCTCCATCA TCGCCGACAC CAGTACTGCG GGCCTTCTGT CGGTCATGTA CGAAAAACTC CCCGACCTGA TCGTCGCGGG GGGGAAGACC AAATTCCTGG CGCTGAAGAC GAAGACGCCG TTTCTGGACA TAAACCACGG GCGGACGCAC CCTTACGCCG GTTACGCCGG AATGGTGACT TTTGCCAAGC AGCTCGACCT CACGGTGAAT AACCCGATCT GGCCGGTCCT GAACGAAAGA GCCCCCTGGG ACAAGGACGC CGAGGCGCAG AAAGCGGACC TGGCCTACGC CGCCGGGCAC GCCGATCGCT TCGCCGCCGA GGAGATCAAG GCCTCGCGGG TCAAGGTGCC GACCAAAAAC GCGACGGTAA ATCCGCAGAA GAACTCGCCC GCGCTGGGGG CGACGCTCGC CTATCTCGGC ATCGACGGGA TGCTGGGGCT TTTGCATGGC GCCCAGGGAT GCTCCACCTT CATCAGGCTG CAGCTCTCCC GGCATTTCAA GGAATCCATA GCGCTCAACT CCACCAGCAT GAGCGAGGAG ACCGCCATCT TCGGCGGCTG GGACAACCTG AGGATCGGCC TGAACCGGGT CATGGAGAAA TTCAAGCCTG AGGTGGTCGG GGTGATGACC ACCGGGCTCA CCGAGACCAT GGGGGACGAC GTCAGAAGCG CCATCGTCAA GTTCCGGGAG GCGCACCCGG AGCATGACGG GATTCCCGTG ATCCACGCCT CGACCCCGGA TTACTGCGGC TCGATGCAGG AGGGTTACGC CGCGGCAGTC GAGGCCATCG TGGCGACGGT GCCGGAAGGC GGCATCGGGA TACCTGGACA GGTAAGTATC CTTCCGGGTT GCCAGCTAAC TCCTGCCGAG GTTGAGGAAA TTGCGGAAAT CTGCGAGGCG TTCGGGCTGG ATCCGGTGGT GGTGCCCGAC ATCTCCAACG CGCTGGACGG GCACATCGAC GCGACCGTAT CGGCGCTCTC CATGGGGGGA GCCACCGTCG AGCGGATCAA GGCGGCGGGG CGGAGCGAGG CGACGCTTTA TTTCGGCGAT TCGCTTGCCG ACGCGGCCGG GGTGCTGCAG GAGAAATTCG GCATCCCGAG TTACGGCTTC ACCTCCGTCA CCGGACTCAA AGAGACGGAC CTTTTGATGA CCACGCTCTC GGCCTTGTCC GGGCGCCCGA TACCGGAGAA ATTCCGGCGC TGGAGAAGCC GCCTCACGGA CGCAATGATC GACAGCCACT ACCAGTTCGG GCAGAAGAAG GTGGCGCTGG CGCTCGAGGC GGACCACCTG AAGGGAATGA CGCACTTCCT TGCCGGCCTC GGTTGCGAGA TACAGGTCGC CATCGCCGCC ACGAGGACGC GGGGCTTGGA CCGGTTGCCG ACGGAAAACG TCTTCGTCGG GGACCTGGAG GACCTGGAAC AGGCGGCGGT GGGAGCCGAC CTGCTGGTTG CCAACTCCAA CGGGCGCCAA GCCGCAAAGA AGCTCGGTAT CAACGCGCAC CTGAGAACCG GAATGCCGGT ATTCGACCGG CTCGGGGCGC ACCAAAAGAT GTGGGTCGGC TATCGGGGGA CGCTGAATTT GGTCTTCGAG GTGGCGAACA TATTCCAGGC CAACGCCAAG GAAGCGCAGA AACTGGCGCA CAACTGA
|
Protein sequence | MAKPDWYDTT DCDTHDAGAP KFCKKSEPGE GTERSCAYDG ARVVLMPITD VIHLVHGPIA CAGNSWDNRG ARSSDSQLYR RGFTTEMLQN DVIFGGEKKL YRAIQELAAR YPEAKAIFVY ATCVTSMTGD DIEAVCKAAQ DKVSVPVIPV NTPGFIGDKN IGNRLAGETL FKYVIGTAEP EYTTDYDINL IGEYNIAGDL WGMLPLFDRL GIRVLSCISG DAKFEDLRYA HRAKLNIIVC SKSLTNLAKK MQKVYGIPYL EESFYGMTDV AKALRDIARE LDDRVNGLEK RVMQDRVEKL IAECEESCRA ELAPYRERLA GKKAVLFTGG VKTWSMVNAL AELGVEILAA GTQNSTLEDF YRMKALMHED ASIIADTSTA GLLSVMYEKL PDLIVAGGKT KFLALKTKTP FLDINHGRTH PYAGYAGMVT FAKQLDLTVN NPIWPVLNER APWDKDAEAQ KADLAYAAGH ADRFAAEEIK ASRVKVPTKN ATVNPQKNSP ALGATLAYLG IDGMLGLLHG AQGCSTFIRL QLSRHFKESI ALNSTSMSEE TAIFGGWDNL RIGLNRVMEK FKPEVVGVMT TGLTETMGDD VRSAIVKFRE AHPEHDGIPV IHASTPDYCG SMQEGYAAAV EAIVATVPEG GIGIPGQVSI LPGCQLTPAE VEEIAEICEA FGLDPVVVPD ISNALDGHID ATVSALSMGG ATVERIKAAG RSEATLYFGD SLADAAGVLQ EKFGIPSYGF TSVTGLKETD LLMTTLSALS GRPIPEKFRR WRSRLTDAMI DSHYQFGQKK VALALEADHL KGMTHFLAGL GCEIQVAIAA TRTRGLDRLP TENVFVGDLE DLEQAAVGAD LLVANSNGRQ AAKKLGINAH LRTGMPVFDR LGAHQKMWVG YRGTLNLVFE VANIFQANAK EAQKLAHN
|
| |