Gene GM21_2638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2638 
Symbol 
ID8137980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3075312 
End bp3078491 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content63% 
IMG OID644870242 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_003022432 
Protein GI253701243 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value0.409853 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTGT CACGGAGGAA CTTTTTAAAG ATCTCAGGTG TCGGCGTGGC GGCGACCACC 
CTGGGCCTGA ACCTCGATCC GGTCGAGGCG AAAGCTCAGG ATCTCGCCAT CCAGCATGCC
AAGGAAACCA CGACCATCTG CCCCTACTGT TCCGTCGGGT GCGGCATGAT CGTGCACACC
CAGGGGGACA AGGTGATCAA CGTCGAAGGG GACCCGGACC ATCCCATCAA CGAAGGGGCC
CTCTGCCCCA AGGGTTCCTC CGTGTACCAA CTGCGCGACA ACAAGGCGCG CATCACCAAG
CCGATGTATC GGGCCGCCGG CGCCGCAACC TGGCAGGAGG TGACCTGGGA ATGGGCGCTC
GACCAGATAG CCAGAAAGAC CAAGGCGACC CGCGACGCCT CCTTCGTCCC CACCAGCAAG
ATAAAGGTCA AGGAGAAGGT CGCCGGGGTT GAGGTCGAGA AGGAGATCGA GGCCGTGGTG
AACCGCACCA TGGGGATCGC CTCGGTCGGC AGCGCCGCGC TGGACAACGA GGAATGCTAC
CTGTACCAAA AATTTTTAAG GGGGTTGGGC CTGGTGTACA TCGAACATCA GGCACGCATT
TGACACAGCG CCACTGTAGC GGCTCTGGCA GAGTCGTTTG GACGCGGTGC GATGACGAAC
CACTGGATAG ATTTCAAGAA CGCCGACGTA ATCCTCATCA TGGGCGCGAA CCCCGCCGAG
AACCACCCGG TCGCCTTCCG CTGGATCTTG AAGGCGAAGG AGTCGGGGGC GAAGGTGATC
TGCGTCGATC CCCGCTTCAC CCGCAGCGCC GCCAAGTCGG ACCTGTACGC GCAGCTTCGC
TCGGGGACCG ATATCGCCTT TCTGGGCGGA ATGATCAACC ACATCATGCA GAACAAGCTC
TACTTCGAGC AGTACGTCGC CGAGTACACC AACGCCTCCT ATCTGGTGAA CGGCGACTTC
AAGCTCCCGG GAGAACTGGA CGGCCTCTTC TCCGGCTACG ACCCGCAAAA GCGGAGCTAC
GACGTTAAAA GCTGGTCCTT CCAGAAGGAG GCCGACGACA GCATCAGGAA GGACCCTACC
CTGCAGGACC CGAACTGCGT GTTCCAGCTC CTGAAGAAGC ACTACAGCCG CTACACCCCG
GAGCTGGTCT CGAAGACCAC GGGGACGCCC AAGGAAAAGC TCCTGGAGGT GTACCAGCTC
TACGCCTCCA CCGGGAGGCC CGACCGCGCC GGCACCTCGC TTTACGCCAT GGGGTGGACC
CAGCACACGG TCGGCACCCA GAACATACGC ACCATGTCCA TCATCCAGCT CCTGTTGGGG
AACATCGGCG TCGCCGGAGG CGGGGTCAAC GCCCTTCGCG GCGAATCGAA CGTGCAGGGA
TCTACCGACC AGGGGCTCTT GTTCCACATC CTCCCCGGTT ACATGCCGGT CCCCTCGGCC
GAGCTCCCCA CCACGGCCGC CTACATCGAG AAGCACACCC CGAAGAGCAA GGACCCGCAG
AGCGCCAACT GGTGGGGGAA CCGCAACAAG TACCTGGTCA GCTACCTGAA GGCGATCTAC
GGCAACAACG CCACCAAGGA GAACGACTTC GGCTACAACT GGCTCCCGAA ACTGGACCCG
GGGATGAACG GCTCCTGGCT GATGATCTTC GACAACATGC TCAAGGGTAA GTTGAAGGGG
TTCTACGCCT GGGGGCAGAA TCCGGCCTGT TCGGGGGCTA ACTCCAACAA GGTGAGAAAC
GCCCTGGCCA AGCTCGACTG GATGGTGGCG GTCAACCTCT TCGACAACGA GACCGCCTCC
TTCTGGAAAG GACCGGGGAT GGACCCGGCC AAGGTGAAGA CCGAGGTCTT CTTCCTTCCG
GCGGCGGCAT CCTTCGAGAA GGAAGGCTCC ATCACCAACT CCGGGCGCTG GGCCCAGTGG
CGCTATCAGG CGGTGAAGCC CCTCGGGCAG AGCAAGCCCG ACGCCGAGAT CATGAACGAC
CTGTACCAGG CGATCAAGGG GCTCTACGCG AAAGAAGGAG GCGCGCTCCC CGAACAGCTC
TTGAAGCTGA CCTGGAACTA CGGCTTCAAA AGGGCCGACG GCAGCATCCG CTCCATCGAC
ATCCACCAGG TGGCCAAGGA GATCAACGGC TACTTCCTCC AGGACGTATC CGAGCCCGTG
AAGCCGCTAA AGCCCGGGCA AGCGCCGCCG AAACCGCGGG AGCCCTGGGA GGCGAAGAAG
CTTTTAGGCA AGAAGGGAGA ACTGGTGGGA GGTTTCGCCC AGCTTCAGGC CGACGGGAGC
ACCTCCTGCG GCTCGTGGAT CTTCAGCCAG AGCTACAACG AAAAGGGGAA CATGATGGCA
CGCCGCGGCA AGAAGGACCC GACCGGCCTC GGCATGTTCC CCGAGTGGAG CTGGTGCTGG
CCGGTGAACC GCCGCATCAT CTACAACCGC GCCTCGTGCG ACCCAAGCGG CAAACCGTTC
AACGTGAAAA AGGCGGTGGT CTACTGGAAC CCCCTGGCGG TGCAGCCCGG CGGCAAAATG
GGGGCCTGGG TGGGCGATGT CCCCGACGGC CCCTGGCCGC CTCTCGCCGC CGGAGCGGAG
GGGCGCAAAC CCTTCATCAT GAGGGCGGAC GGGGTGGCCG CGATCTTCGG CCCGGGGCTT
AAGGACGGCC CCTTCCCCGA GCACTACGAG CCGATGGAGT CGCCGCTGGC CCAGAACCTG
ATGTCCAAGC AGCTCAACAA CCCCGCCGTG AAGATCTTCA AGGACCCCGC GGTGAAGGAG
GACGTCTTGG CCAGCGCCGA CCCGCGCTTT CCGCTGGTCG CCACCACCTA CCGGGTCACC
GAGCACTGGC AGACCGGCGT CATGACCCGC AACACCCCTT GGCTATTGGA ACTGCAACCG
CGCCAGTTCT GCGAGATCAG CCAGGAGCTT GCCAAGGAGA AGGGAATCGC CAACGGCGAC
CTGGTGGAGG TAGCCTCGGC CCGCGGCAAG GTCGAGGCGG TGGCCATGGT GACCGTGCGC
ATGAAGCCGA TGAAGATAGG GGACAAGGTC GTGCACCAGA TCGGCCTCCC CTGGTGCTTT
GGCTGGCACA CCCCCGGCGT GGGAGACGCC GCCAACCTGC TGACCCCCAC CGCGGGGGAC
GCCAATACGA TGATTCCCGA GACCAAGGCG TTCATGGCCG GGATCGCGAG AAAGGGGTGA
 
Protein sequence
MEVSRRNFLK ISGVGVAATT LGLNLDPVEA KAQDLAIQHA KETTTICPYC SVGCGMIVHT 
QGDKVINVEG DPDHPINEGA LCPKGSSVYQ LRDNKARITK PMYRAAGAAT WQEVTWEWAL
DQIARKTKAT RDASFVPTSK IKVKEKVAGV EVEKEIEAVV NRTMGIASVG SAALDNEECY
LYQKFLRGLG LVYIEHQARI UHSATVAALA ESFGRGAMTN HWIDFKNADV ILIMGANPAE
NHPVAFRWIL KAKESGAKVI CVDPRFTRSA AKSDLYAQLR SGTDIAFLGG MINHIMQNKL
YFEQYVAEYT NASYLVNGDF KLPGELDGLF SGYDPQKRSY DVKSWSFQKE ADDSIRKDPT
LQDPNCVFQL LKKHYSRYTP ELVSKTTGTP KEKLLEVYQL YASTGRPDRA GTSLYAMGWT
QHTVGTQNIR TMSIIQLLLG NIGVAGGGVN ALRGESNVQG STDQGLLFHI LPGYMPVPSA
ELPTTAAYIE KHTPKSKDPQ SANWWGNRNK YLVSYLKAIY GNNATKENDF GYNWLPKLDP
GMNGSWLMIF DNMLKGKLKG FYAWGQNPAC SGANSNKVRN ALAKLDWMVA VNLFDNETAS
FWKGPGMDPA KVKTEVFFLP AAASFEKEGS ITNSGRWAQW RYQAVKPLGQ SKPDAEIMND
LYQAIKGLYA KEGGALPEQL LKLTWNYGFK RADGSIRSID IHQVAKEING YFLQDVSEPV
KPLKPGQAPP KPREPWEAKK LLGKKGELVG GFAQLQADGS TSCGSWIFSQ SYNEKGNMMA
RRGKKDPTGL GMFPEWSWCW PVNRRIIYNR ASCDPSGKPF NVKKAVVYWN PLAVQPGGKM
GAWVGDVPDG PWPPLAAGAE GRKPFIMRAD GVAAIFGPGL KDGPFPEHYE PMESPLAQNL
MSKQLNNPAV KIFKDPAVKE DVLASADPRF PLVATTYRVT EHWQTGVMTR NTPWLLELQP
RQFCEISQEL AKEKGIANGD LVEVASARGK VEAVAMVTVR MKPMKIGDKV VHQIGLPWCF
GWHTPGVGDA ANLLTPTAGD ANTMIPETKA FMAGIARKG