Gene GM21_2144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2144 
Symbol 
ID8137480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2502134 
End bp2503576 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content62% 
IMG OID644869759 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_003021954 
Protein GI253700765 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.683896 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGG CACCAATAAA GAAGGTAGAA GGAATCACGA AGGAATCTAC CCAGGCGATG 
ATCGACCACG CCCTGGAGAT CTATCCAGAG AAGGCCAAGA AGAAGAGGGC GCCGCACTTG
GCCCCCAACG AGGGGGAAGG GGCCTGCGTG AAGAGCAACC GCAAGACCGT CCCGGGCATC
ATGAGCGCCC GCGGCTGCGC CTACGCAGGG GCCAAGGGGG TGGTCTGGGG CCCGATCCGC
GACATGGTTC ACGTCTCCCA TGGCCCGGTC GGCTGCGGCT GGTACTCCTG GGGGACCCGT
CGCAACCTGG TCACCGGCGT GAACGGCGTC AGCCAGTTCG CCATGCAGTT CACCTCCGAC
TTCCAGGAAA AGGACATCGT CTACGGCGGC GACAAGAAGC TGAAGACCCT GCTGGCCGAG
GCGCACGACC TGTTCCCGCT GGCCAAAGGG ATCTCGGTCC TTTCCGAGTG CCCGGTCGGC
CTCATCGGCG ACGACATCAA CTCCGTGGCG AAGATCGCCT CAAAGGAGCT GGACATCCCG
GTCATCCCCT GCAACTGCGA GGGGTTCCGC GGCGTGTCCC AGTCGCTTGG CCACCACATC
TCCAACGACA CCATCCGCGA CCACATCATC GGCACCCGGG AATTCGCCGA GCCGGAGACA
CCGTACGACA TCGCCCTCAT CGGCGACTAC AACATCGGCG GCGACGTCTG GAGCGTTAAA
CCTCTGCTTG AAGAAATAGG GTTGAACGTC AAGGCGGTAT GGACCGGCGA CGGCGAATTG
GAAAAGATCG CAGCCACCCA CAAGGTGAAG CTGAACCTGA TCCACTGCTA CCGCTCCATG
AACTACATGT GCCGCGTCAT GGAAGAGAAG TACGGCATCC CGTGGCTGGA GTTCAACTTC
TTCGGCCCGA CCAAGATCCG CGAGAGCCTG AGGAAAATCG CCGAGTACTT CGACGACAGC
ATCAAGGAGA AGGTGGAAAA GGCGATCGCC AAGTACGACC CGATCATGCA GGCGGTGATC
GACGAGTACC GTCCGCGCCT GGAAGGAAAG AAAGTGATGC TCTACGTCGG CGGCTTGCGC
CCCCGTCACA CGGTGAACGC CTACGCCGAC CTCGGCATGA CCGTGGTCGG TTCCGGCTAC
GAGTTCGCCC ACGGCGACGA CTACGAGAGG ACCTCGGTGG AGATGCCTGA GGCGACCGTC
ATCTACGACG ACGCCTCCGA GCATGAGCTG GAGCAGTTCG TGGACGAGCT TCGTCCCGAC
CTGGTCGGTT CGGGGATCAA GGAGAAGTAC CTGTTCCAGA AGATGGGGAT CCCGTTCCGC
CAGATGCACA GCTGGGACTA CTCCGGACCG TACCACGCCT ACAACGGCTT CCCGATCTTC
GCCCGCGACG TCGACATGGC CGTCAACAGC CCGACCTGGA AGCTGGTGAA GGCGCCCTTC
TAG
 
Protein sequence
MSEAPIKKVE GITKESTQAM IDHALEIYPE KAKKKRAPHL APNEGEGACV KSNRKTVPGI 
MSARGCAYAG AKGVVWGPIR DMVHVSHGPV GCGWYSWGTR RNLVTGVNGV SQFAMQFTSD
FQEKDIVYGG DKKLKTLLAE AHDLFPLAKG ISVLSECPVG LIGDDINSVA KIASKELDIP
VIPCNCEGFR GVSQSLGHHI SNDTIRDHII GTREFAEPET PYDIALIGDY NIGGDVWSVK
PLLEEIGLNV KAVWTGDGEL EKIAATHKVK LNLIHCYRSM NYMCRVMEEK YGIPWLEFNF
FGPTKIRESL RKIAEYFDDS IKEKVEKAIA KYDPIMQAVI DEYRPRLEGK KVMLYVGGLR
PRHTVNAYAD LGMTVVGSGY EFAHGDDYER TSVEMPEATV IYDDASEHEL EQFVDELRPD
LVGSGIKEKY LFQKMGIPFR QMHSWDYSGP YHAYNGFPIF ARDVDMAVNS PTWKLVKAPF