Gene GM21_2321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2321 
Symbol 
ID8137661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2698708 
End bp2700957 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content62% 
IMG OID644869935 
Productmalic enzyme 
Protein accessionYP_003022127 
Protein GI253700938 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID[TIGR00651] phosphate acetyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones119 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAA AATTGGGTGC CCTTGATTAC CACTCAAGCG GCAGAAAAGG TAAGATCGAA 
GTCATCGCCA CCAAGCCGTG CCAGACCGCA GCCGACCTGT CCCTCGCTTA CTCTCCGGGA
GTCGCGGAAC CCTGCCTGGC TATCCAGCAG AACCCGGACG ACGCCTACAA GTACACCGCG
AAGGGGAACC TCGTGGCGGT CGTCTCCAAC GGCACCGCGG TGCTGGGCCT CGGCAACCTG
GGCGCCCTCG CCGGCAAGCC AGTCATGGAG GGGAAAGGGG TCCTCTTCAA GCGTTTCGCC
GACATCGACG TCTTCGACAT CGAGCTGAAC ACCGAGAACC CCGACGAGAT CATAAGAGCA
TGCCAGCTCC TGGAGCCGAC CTTCGGCGGC ATCAACCTGG AGGACATCAA GGCGCCCGAG
TGCTTCTATA TCGAGGAAGA GCTCAAGAAG ACCATGAACA TCCCGGTCTT CCACGACGAC
CAGCACGGCA CAGCCATCAT CTGCTCGGCG GCCCTTCTGA ACGCCCTGAT GCTGGTGCAA
AAGAAGATCG AGGACATAAG GATCGTCGTC AACGGCGCCG GCGCCTCGGC CAACTCCTGC
GCCAAGCTCG CCATCGCGCT CGGGGTGAAA CCCAACAACA TGATCATGTG CGACACCAAG
GGGGTCATCT ACAAGGGGCG CGTCGAGGGG ATGAACCCCT ACAAGGAACT CTTCGCAGCC
GAGACCCACT TCCGCACCCT GGAGGAGGCG GCGGTAGGGG CCGACGTCCT TTTCGGCCTC
TCGGCCAAGG GGGCCTTCAC CCCGGAGATG GTCCGCTCCA TGGCGCCGAA CCCGATCATC
TTCGCCATGG CGAACCCCGA CCCGGAAATC ACCCCGGAAG AGGCGCACGC GGTGCGCGGC
GACGTGATCA TCGCGACCGG CAGGAGCGAC TACGCCAACC AGGTCAATAA CGTCCTCGGC
TTCCCCTTCA TCTTCCGCGG TGCGCTCGAC GTGCGCGCCA CGGCCATCAA CGAGGAGATG
AAGCTCGCCG CAGTGCACGC CCTGGCGAAA CTCGCGCGCG AGGAGGTGCC CGACTCCGTC
AGCAAGGCCT ACGGCAACGA GAAGTTCAGC TTCGGCCCGA GCTACATCAT CCCCAAACCC
TTCGACCCGC GCGTGCTTTT GCACGTGGCC CCGGCGATCG CCCAGGCCGC CATGGACACG
GGGGTCGCGC GCATGCCGAT CGCCGACATG GCCAAGTACA TCGAGCAGCT CGAGTCCTCG
CAGGGCAAGT CCAAAGAGAT CATGCGCATG ATCATCAATA AGGCGAAGAG CGATCCGAAG
AAGGTGGTCT TTTCCGAGGG AGAGGACGAC AAGATCCTGC GCGCGGCACA GGTGCTGGTC
GAGGAGGGTA TCGCCCAGCC GATCCTGATC GGAGACCAGA AGAAGATCAA GCAGAAGATG
GACGATCTCA ACCTGGACCT CGATGTGCCG ATCATGGACC CGTCCGACTC CGAGCTCACC
GAGGAATACG CCGCGGAGCT CTACCGCTTA AGGCAAAGAA AAGGGCTCAC CATTTCCGAG
TGCCGCCGCA TCATGCGGCG CAAGTCGCGG GCACATTTCG GCAACATGAT GGTGCACATG
GGGCACGCCG ACGCGCTCCT GGGCGGGATC GACACCCACT ACCCGGAAAC CATCCGCCCC
GCGCTGCAGG TCTTAGGCAA GCAGGAGGGG CTCTCCAGCG TGCACGGCCT CTACATGATG
GTGTCCAAAA AGAACGTCTA CTTCCTGGCC GACACCACGG TGACCATCGA TCCGACCGCG
GAGGAGCTGG CCGAGACGGC CATTCTCGCC GCCGAGATGG TCCACAAGCT CGACATCGAG
CCCCGCGTCG CCATGCTCTC CTTCTCCAAC TTCGGCTCGG TGGACCACCC GCAGACGCGC
AAGGTCAAGC GCGCCGTCGA GATCGTCAAG GAGCGCGCCC CCAACCTGAT AGTCGAGGGG
GAGATGCAGG CCGATACCGC CGTGGTCCCC GATCTTCTGG ACGGCTTCAC CTTCTCGAAA
CTGAAGACCC CGGCCAACAT CCTGATCTTC CCCGACCTCA ACTCCGGGAA CATCTGCTAC
AAGCTTTTGC ACCACCTGGG CGGCGCCGAG GCGATAGGCC CGATCCTCAT GGGGATGAAC
AAGCCGGTTC ACGTACTGCA GCGCGGCGAC GACGTCAACG ACATCGTCAA CATGGCCGCC
ATCGCCGTGG TGGACGTACA GAACCTGTAA
 
Protein sequence
MSKKLGALDY HSSGRKGKIE VIATKPCQTA ADLSLAYSPG VAEPCLAIQQ NPDDAYKYTA 
KGNLVAVVSN GTAVLGLGNL GALAGKPVME GKGVLFKRFA DIDVFDIELN TENPDEIIRA
CQLLEPTFGG INLEDIKAPE CFYIEEELKK TMNIPVFHDD QHGTAIICSA ALLNALMLVQ
KKIEDIRIVV NGAGASANSC AKLAIALGVK PNNMIMCDTK GVIYKGRVEG MNPYKELFAA
ETHFRTLEEA AVGADVLFGL SAKGAFTPEM VRSMAPNPII FAMANPDPEI TPEEAHAVRG
DVIIATGRSD YANQVNNVLG FPFIFRGALD VRATAINEEM KLAAVHALAK LAREEVPDSV
SKAYGNEKFS FGPSYIIPKP FDPRVLLHVA PAIAQAAMDT GVARMPIADM AKYIEQLESS
QGKSKEIMRM IINKAKSDPK KVVFSEGEDD KILRAAQVLV EEGIAQPILI GDQKKIKQKM
DDLNLDLDVP IMDPSDSELT EEYAAELYRL RQRKGLTISE CRRIMRRKSR AHFGNMMVHM
GHADALLGGI DTHYPETIRP ALQVLGKQEG LSSVHGLYMM VSKKNVYFLA DTTVTIDPTA
EELAETAILA AEMVHKLDIE PRVAMLSFSN FGSVDHPQTR KVKRAVEIVK ERAPNLIVEG
EMQADTAVVP DLLDGFTFSK LKTPANILIF PDLNSGNICY KLLHHLGGAE AIGPILMGMN
KPVHVLQRGD DVNDIVNMAA IAVVDVQNL