Gene GM21_2142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2142 
Symbol 
ID8137478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2497421 
End bp2500177 
Gene Length2757 bp 
Protein Length918 aa 
Translation table11 
GC content63% 
IMG OID644869757 
Productbifunctional nitrogenase molybdenum-cofactor biosynthesis protein NifE/NifN 
Protein accessionYP_003021952 
Protein GI253700763 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE
[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.0853728 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGC CAGACTGGTA CGACACAACC GATTGCGACA CTCACGATGC CGGGGCTCCG 
AAATTCTGTA AAAAGTCGGA ACCGGGCGAG GGAACGGAAC GAAGCTGCGC CTACGACGGC
GCCCGCGTGG TCCTCATGCC CATTACCGAC GTCATCCACC TGGTGCACGG CCCCATCGCC
TGCGCCGGAA ACTCCTGGGA CAACCGCGGG GCTCGCTCCT CGGATTCCCA GCTCTACCGC
CGCGGCTTCA CTACGGAGAT GCTGCAAAAC GACGTGATCT TCGGCGGCGA AAAGAAGCTC
TACCGCGCCA TCCAGGAACT GGCGGCACGC TACCCGGAGG CGAAGGCGAT CTTCGTCTAC
GCCACCTGCG TTACCTCCAT GACCGGCGAC GACATCGAGG CGGTCTGCAA AGCGGCACAG
GACAAGGTGA GCGTCCCGGT GATCCCGGTG AACACGCCCG GCTTCATCGG GGACAAGAAC
ATCGGCAACC GCCTGGCCGG AGAGACTCTC TTCAAATACG TGATCGGGAC GGCGGAGCCG
GAATACACGA CGGACTACGA CATCAACCTG ATCGGCGAGT ACAACATCGC CGGCGACCTC
TGGGGGATGC TGCCGCTCTT CGACCGGCTC GGGATCCGCG TCCTCTCCTG CATCAGCGGC
GACGCCAAGT TCGAGGACCT GCGCTACGCG CACCGCGCCA AACTGAACAT CATCGTCTGC
TCCAAGAGCC TCACCAACCT CGCCAAGAAG ATGCAGAAGG TGTACGGCAT CCCGTACCTG
GAGGAATCCT TCTACGGCAT GACCGACGTG GCCAAGGCGC TGCGTGACAT AGCGCGGGAA
TTGGACGACC GGGTGAACGG CCTTGAGAAG CGGGTGATGC AGGACCGGGT GGAAAAGCTG
ATCGCCGAGT GCGAGGAGAG CTGCCGCGCG GAGCTTGCTC CCTATCGGGA GCGGCTCGCC
GGGAAGAAGG CGGTGCTCTT TACCGGGGGG GTGAAGACCT GGTCCATGGT GAACGCGCTG
GCGGAGCTGG GGGTGGAGAT TCTCGCCGCC GGGACTCAGA ATTCGACGCT GGAAGACTTC
TACCGCATGA AGGCGCTGAT GCACGAGGAC GCCTCCATCA TCGCCGACAC CAGTACTGCG
GGCCTTCTGT CGGTCATGTA CGAAAAACTC CCCGACCTGA TCGTCGCGGG GGGGAAGACC
AAATTCCTGG CGCTGAAGAC GAAGACGCCG TTTCTGGACA TAAACCACGG GCGGACGCAC
CCTTACGCCG GTTACGCCGG AATGGTGACT TTTGCCAAGC AGCTCGACCT CACGGTGAAT
AACCCGATCT GGCCGGTCCT GAACGAAAGA GCCCCCTGGG ACAAGGACGC CGAGGCGCAG
AAAGCGGACC TGGCCTACGC CGCCGGGCAC GCCGATCGCT TCGCCGCCGA GGAGATCAAG
GCCTCGCGGG TCAAGGTGCC GACCAAAAAC GCGACGGTAA ATCCGCAGAA GAACTCGCCC
GCGCTGGGGG CGACGCTCGC CTATCTCGGC ATCGACGGGA TGCTGGGGCT TTTGCATGGC
GCCCAGGGAT GCTCCACCTT CATCAGGCTG CAGCTCTCCC GGCATTTCAA GGAATCCATA
GCGCTCAACT CCACCAGCAT GAGCGAGGAG ACCGCCATCT TCGGCGGCTG GGACAACCTG
AGGATCGGCC TGAACCGGGT CATGGAGAAA TTCAAGCCTG AGGTGGTCGG GGTGATGACC
ACCGGGCTCA CCGAGACCAT GGGGGACGAC GTCAGAAGCG CCATCGTCAA GTTCCGGGAG
GCGCACCCGG AGCATGACGG GATTCCCGTG ATCCACGCCT CGACCCCGGA TTACTGCGGC
TCGATGCAGG AGGGTTACGC CGCGGCAGTC GAGGCCATCG TGGCGACGGT GCCGGAAGGC
GGCATCGGGA TACCTGGACA GGTAAGTATC CTTCCGGGTT GCCAGCTAAC TCCTGCCGAG
GTTGAGGAAA TTGCGGAAAT CTGCGAGGCG TTCGGGCTGG ATCCGGTGGT GGTGCCCGAC
ATCTCCAACG CGCTGGACGG GCACATCGAC GCGACCGTAT CGGCGCTCTC CATGGGGGGA
GCCACCGTCG AGCGGATCAA GGCGGCGGGG CGGAGCGAGG CGACGCTTTA TTTCGGCGAT
TCGCTTGCCG ACGCGGCCGG GGTGCTGCAG GAGAAATTCG GCATCCCGAG TTACGGCTTC
ACCTCCGTCA CCGGACTCAA AGAGACGGAC CTTTTGATGA CCACGCTCTC GGCCTTGTCC
GGGCGCCCGA TACCGGAGAA ATTCCGGCGC TGGAGAAGCC GCCTCACGGA CGCAATGATC
GACAGCCACT ACCAGTTCGG GCAGAAGAAG GTGGCGCTGG CGCTCGAGGC GGACCACCTG
AAGGGAATGA CGCACTTCCT TGCCGGCCTC GGTTGCGAGA TACAGGTCGC CATCGCCGCC
ACGAGGACGC GGGGCTTGGA CCGGTTGCCG ACGGAAAACG TCTTCGTCGG GGACCTGGAG
GACCTGGAAC AGGCGGCGGT GGGAGCCGAC CTGCTGGTTG CCAACTCCAA CGGGCGCCAA
GCCGCAAAGA AGCTCGGTAT CAACGCGCAC CTGAGAACCG GAATGCCGGT ATTCGACCGG
CTCGGGGCGC ACCAAAAGAT GTGGGTCGGC TATCGGGGGA CGCTGAATTT GGTCTTCGAG
GTGGCGAACA TATTCCAGGC CAACGCCAAG GAAGCGCAGA AACTGGCGCA CAACTGA
 
Protein sequence
MAKPDWYDTT DCDTHDAGAP KFCKKSEPGE GTERSCAYDG ARVVLMPITD VIHLVHGPIA 
CAGNSWDNRG ARSSDSQLYR RGFTTEMLQN DVIFGGEKKL YRAIQELAAR YPEAKAIFVY
ATCVTSMTGD DIEAVCKAAQ DKVSVPVIPV NTPGFIGDKN IGNRLAGETL FKYVIGTAEP
EYTTDYDINL IGEYNIAGDL WGMLPLFDRL GIRVLSCISG DAKFEDLRYA HRAKLNIIVC
SKSLTNLAKK MQKVYGIPYL EESFYGMTDV AKALRDIARE LDDRVNGLEK RVMQDRVEKL
IAECEESCRA ELAPYRERLA GKKAVLFTGG VKTWSMVNAL AELGVEILAA GTQNSTLEDF
YRMKALMHED ASIIADTSTA GLLSVMYEKL PDLIVAGGKT KFLALKTKTP FLDINHGRTH
PYAGYAGMVT FAKQLDLTVN NPIWPVLNER APWDKDAEAQ KADLAYAAGH ADRFAAEEIK
ASRVKVPTKN ATVNPQKNSP ALGATLAYLG IDGMLGLLHG AQGCSTFIRL QLSRHFKESI
ALNSTSMSEE TAIFGGWDNL RIGLNRVMEK FKPEVVGVMT TGLTETMGDD VRSAIVKFRE
AHPEHDGIPV IHASTPDYCG SMQEGYAAAV EAIVATVPEG GIGIPGQVSI LPGCQLTPAE
VEEIAEICEA FGLDPVVVPD ISNALDGHID ATVSALSMGG ATVERIKAAG RSEATLYFGD
SLADAAGVLQ EKFGIPSYGF TSVTGLKETD LLMTTLSALS GRPIPEKFRR WRSRLTDAMI
DSHYQFGQKK VALALEADHL KGMTHFLAGL GCEIQVAIAA TRTRGLDRLP TENVFVGDLE
DLEQAAVGAD LLVANSNGRQ AAKKLGINAH LRTGMPVFDR LGAHQKMWVG YRGTLNLVFE
VANIFQANAK EAQKLAHN