Gene GM21_2143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2143 
Symbol 
ID8137479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2500418 
End bp2501881 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content61% 
IMG OID644869758 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_003021953 
Protein GI253700764 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value0.305963 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCAG AAGCTGCCGT GAAATACGTT ACCGAGATTC CGGAAGAGGA AGTGAAACGG 
GTCTCCGCGT GGATCAATAC GGAGGAGTAC AAGGAAAAGA ACTTCGCCCG CCAGGCGCTG
GTGATCAACC CGCCGCACGC TTGCCAGCCG CTGGGCGCGG AACTCGCCGC GCACGGCTTC
GAGGGGACGC TTCCCTTCGT GCACGGATCT CAAGGGTGCG CCTCGTACTA CCGCTCCACC
TTCAACCGCC ACTTCCGCGA GCCGGCTCCG GCCGTCTCTG ACTCTATGAC CGAGGACGGC
GCGGTGTTCG GAGGACAGAA CAACCTGCAC GAGGCGCTGG AGAACGCCTA CACCATCTAC
AAACCGAAGA TGATGGCGGT CTTCACCTCC TGCATGCCTG AGGTCATCGG CGACGACCTG
ACCGCGTTCA TCAAGAACGC GAGGAACAAG GAGATCGTGC CGCAGGATTA CCCGCTCCCC
TACGCCAACA CCCCGAGCTT CAACGGCTCG CACGTTCACG GCTACGACGC CATGCTCCTC
TCCATCTTGC AGTCGCTGAC CGAGGGGAAA AAAGTCGAAG GGCGCTGCAC CGGGAAGCTG
AACCTGATCC CGGGCTTTGA CTGCAACACC GGAAACTACC GGGAGTACAA GAGGATCCTG
AAGGAATTCG GCATCCCCTA CACGCTTTTG GCCGACATCT CCGACACATT CGATTCGCCC
CTGGACGGCA CCTACCGTCC CTATCCCGGC GGCACCAAAC TCGAAGACGC GGCTGACTCC
ATCAACGGCA AGGTCACTCT GACCGTGGCG CCTTTTTCCA GCGCCAAGAC CTTTACCTGG
ATCAAGGACA ACTACTCAGG AACCCACGTC TCGCTGCCGA CCCCGTTCGG GGTTGCCAAG
ACCGACGCCC TGCTTTTGAA GCTTTCTGAG CTCTTCGGCA AGCCGGTGCC CGAGTCACTC
AAGGCCGAGC GCGGCTACGC CGTCGACGCC ATGACCGACG CGCACCAGTA CATCCACGGC
AAGAAGTTCG CACTTTACGG CGATCCGGAC TACCTGATCG GGTACGTTTC CTTCCTGCTG
GAGATGGGTG CCAAGCCGTA CCACATCCTT TGCAGCAAGG GGAGCAAGAA GGTCGAGAAA
GAGCTTCAGG CGCTTCTGGA CGCCTCCCCC GACGGCAAGG GGTGCAAGAT CTACATGGGC
AAGGATCTCT GGCACATGAG GAGCCTGTTG GTGACCGACC CGGTCGACGC CATGATCGGC
GACACGCATG GTAAGTTCGC CGCGCGCGAC GCAGGGATAC CGCTCTTCCG GTTCGGCTTC
CCCGTCTTCG ACCGCGTCAA CCTGCACCGC TCGCCTCTGA TCGGCTACCA GGGCGCCATC
AACATGCTGA CCGCCATCTG CAACAAGTTC ATCGAACTGC GTGACGAGAC CTGCGAGGAT
CGGCACTTCG AGATGATGAG ATAG
 
Protein sequence
MTAEAAVKYV TEIPEEEVKR VSAWINTEEY KEKNFARQAL VINPPHACQP LGAELAAHGF 
EGTLPFVHGS QGCASYYRST FNRHFREPAP AVSDSMTEDG AVFGGQNNLH EALENAYTIY
KPKMMAVFTS CMPEVIGDDL TAFIKNARNK EIVPQDYPLP YANTPSFNGS HVHGYDAMLL
SILQSLTEGK KVEGRCTGKL NLIPGFDCNT GNYREYKRIL KEFGIPYTLL ADISDTFDSP
LDGTYRPYPG GTKLEDAADS INGKVTLTVA PFSSAKTFTW IKDNYSGTHV SLPTPFGVAK
TDALLLKLSE LFGKPVPESL KAERGYAVDA MTDAHQYIHG KKFALYGDPD YLIGYVSFLL
EMGAKPYHIL CSKGSKKVEK ELQALLDASP DGKGCKIYMG KDLWHMRSLL VTDPVDAMIG
DTHGKFAARD AGIPLFRFGF PVFDRVNLHR SPLIGYQGAI NMLTAICNKF IELRDETCED
RHFEMMR