Gene GM21_1955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1955 
Symbol 
ID8137289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2265925 
End bp2268954 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content62% 
IMG OID644869569 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_003021766 
Protein GI253700577 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones127 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTTT CACGGAGGGA GTTTCTGCAG GGAGGGGCGC TGGCTACGGC ACTGGTCCTT 
TCCGGCAAAA AAGCTGAAGC GGGCGGAGCC GACGCCCCGC AGATGCGCAC CAAGGGGCTG
AAAAGCTCGA CCACCATCTG CCCTTTCTGC GCGGTAGGTT GCGGGCTTGT GGTGCATACC
AAGAACGGCA AGATCGTCAA CATCGAGGGA GACATCCAGC ATCCGATCAA CCAGGGTGCG
CTCTGCTCGA AGGGTAGCGC GCTGTTCCAG GTCGCCAACA ACGAGCGCCG CCTGCAGAAG
GTCATGTACC GTGCTCCCGG CAGCGACAAG TTCGAAGAGA AGAGCTGGGA CTGGGCGCTG
GAGCGGATCA GCCAGAAGAT GAAGGAGACC CGCGACAAGA GCTTCAAGGC CAAAGAGATC
AACAAGAAGG ACAACAAGGA ATACGTGGTG AACCGCACCG AAGGGATGGC TTTCCTGGGG
GGCGCCGGTC TCGACAACGA GGAGTGCTAC CTCTGGAGCA AGTTCGCCCG CTCCATGGGG
GTCGCGAACC TCGAGCACCA AGCCCGAATA TGACACTCCG CTACAGTCGC CGGTCTGGCG
GCTTCGTTTG GACGTGGGGC AATGACCAAC CACTGGATTG ACCTGAAAAA CGCCGATGCC
ATCCTGGCCA TCGGTTGCAA CCCCGCTGAG AACCACCCGA TCTCGATGAA GTGGATCGAG
GCGGCCATGG ATAACGGCGG CAAGCTTTTA GCCGTCGACC CGCGCTTCAC CAGGACCGGT
TCCAAGGCCG ACCACTACGC GCAGATCCGT CCCGGCACCG ACATCGCCTT CCTGGGCGGG
ATGATCAACT TCGCGCTGCA GAACAACATG ATCCACGAGG AGTACGTCCG CGAGTACACC
AACGCCTCCT TCATCGTCAA CGACAAGTAC GACTTCAACG AGGGTATCTT CTGCGCCTTC
GACGACCAGG AGAAGACCTA CGACTCGAAG GCCTGGGCCT ACGCGCTCGA CGGCTCCGGC
AACGCCAAGC GCGACAAGTC GCTTAAGGAC CCGCGCTGCG TCTATCAGCT GCTGAAGAAG
CACTACTCCC GCTACACCGT CGACATGGTC TGCTCTATCA CCGGCACCAA GAAGGAAGAG
TACGTCGCCG TCGCCAAGGC TTTCTGCTCC ACCGGGCGCG CCGACAAGGC AGGCACCATC
CTCTACGCGA TGGGTATCAC CCAGTCCACC CACGGCACCC AGAACGTGCG CGCGGTAGCC
ATGCTGCAGA TGCTTCTGGG CAACATCGGC ATCGCCGGCG GCGGCGTCAA CGCGCTGCGC
GGCGAGAGCA ACGTGCAAGG CTCGACCGAC TACGGCCTGC TCTTCCACAT CCTGCCGGGC
TACCTGAAGT CCCCTGAGAT CGACAACGTG GACCTGAAGG CATACCTGGA GAAGTGGACC
CCGAAGACCA AGGACGCAAA GAGCGCCAAC TGGTGGGGGA ACACCCCGAA GTACACCGTA
AGCCTCCTGA AGGCCTGGTA CGGCGACAAC GCCAGCGCGG AAAACGGCTT CTGCTACGAC
TACCTCCCCA AGAGAAGCGG CAACTACTCC TTCATGAAAC TGATGGAGAA GATGGGCAAT
GGGGAACTGC AGGGTCTCGT CTGCATGGGG CAGAACCCGG CCGTAGGCGG CCCGGACTCC
CTGAAGACCC GCGAGGCGCT GGGGAAACTC GACTGGCTCG TCACCGTGGA CCTCTGGGAG
ACCGAGACCT CCATCTTCTG GAAGCGCCCC GGCGTTAACC CCAAGGAGAT CAAGACCGAG
GTCTTCATGC TGCCCGCCGC CTCCTCCGTC GAGAAGGAAG GCTCCATCTC CAACTCCGGC
CGCTGGGCCC AGTGGCGCTA CAAGGCCGCC GAACCGGTAG GGGAGGCGAA GAGCGACCTC
TGGATCATCG ACAAGTTCTT CAAGGGGATG AAGAAGGCCT ACGAGAAAGG GGGCGCGTTC
CCCGAGCCGA TCACCAAGCT CTCCTGGAAC TACGGCAACC ACGAGGAGCC CGATGTCCAC
CTGGTCGCCA AGGAGATCAA CGGCTACTTC ACCAAGGACA TGACCATCGT CGACAAGGAC
AAGACCCTGG AGTTCAAGAA GGGGGATCAG GTACCGATGT TCAAGTACCT GCAGGACGAC
GGCTCCACCG TCTCCGGCAA CTGGATCTAC TGCGGCTCCT ACACAAAAGA CGGGAACCTG
ATGGCCCGCC GCGACGAAAC CGACCCGACC GGGCTCGGCT TGTTCCCGAA GTGGTCCTGG
TGCTGGCCGG TCAACCGCCG CATCATCTAC AACAGGGGCT CCGTCAACCC CGACGGCGCG
CCGTTCAACC CCAAGCGCGC GGTCATCGCC TGGGACGCGC TGGAGAAGAA GTGGAAAGGG
GACGTACCCG ACGGCCCCTG GCCGCCGATG AACGACGCGA AGGAAGGGAA GTACCCCTTC
ATCATGCTGC CGGAAGGGCA CGGCCGCCTC TATGCGCTGG ACCTGAAGGA CGGTCCTTTC
CCGGAGCACT ACGAGCCTAT CGAGAGCCCG GCCAGGAACC TCATGTCCAA GGTGCAGAGC
AACCCCGCGG TCAAGGTCCC GGCCAACATG TCGAGCGACC TTAACAAGTA CCCGTTCGTC
GGCACCACCT ACAGGATGAC CGAGCACTGG CAGGCAGGGG CCATGACCCG CAGCCTGCCG
TGGCTGGTGG AGCTGGTCCC GACCATGTTC GTCGAGATCT CGCAGACGCT CGCCTCCTCC
AAGGGGATCA ACAACGGCGA CCAGGTGAGG ATCAGCACCG AGCGCGGTTC CATCGAGGCG
AAGGCCCTGG TCACCTCCAG GCTGAAGCCG TTCAACGTGC AGGGAAAAAT GGTGGAGCAG
GTGGGCCTTC CCTGGCACTT CGGTTATGCA GGTCTTGCTA CCGGCGACTC GGGCAACGTC
CTGACCCCGT CCGTCGGCTG CGCGAATACG AGCATCCCCG AGTTCAAGGC ATTCCTCTGC
AACATCGAGA AAGGGGGTAA ACGCGCATGA
 
Protein sequence
MAVSRREFLQ GGALATALVL SGKKAEAGGA DAPQMRTKGL KSSTTICPFC AVGCGLVVHT 
KNGKIVNIEG DIQHPINQGA LCSKGSALFQ VANNERRLQK VMYRAPGSDK FEEKSWDWAL
ERISQKMKET RDKSFKAKEI NKKDNKEYVV NRTEGMAFLG GAGLDNEECY LWSKFARSMG
VANLEHQARI UHSATVAGLA ASFGRGAMTN HWIDLKNADA ILAIGCNPAE NHPISMKWIE
AAMDNGGKLL AVDPRFTRTG SKADHYAQIR PGTDIAFLGG MINFALQNNM IHEEYVREYT
NASFIVNDKY DFNEGIFCAF DDQEKTYDSK AWAYALDGSG NAKRDKSLKD PRCVYQLLKK
HYSRYTVDMV CSITGTKKEE YVAVAKAFCS TGRADKAGTI LYAMGITQST HGTQNVRAVA
MLQMLLGNIG IAGGGVNALR GESNVQGSTD YGLLFHILPG YLKSPEIDNV DLKAYLEKWT
PKTKDAKSAN WWGNTPKYTV SLLKAWYGDN ASAENGFCYD YLPKRSGNYS FMKLMEKMGN
GELQGLVCMG QNPAVGGPDS LKTREALGKL DWLVTVDLWE TETSIFWKRP GVNPKEIKTE
VFMLPAASSV EKEGSISNSG RWAQWRYKAA EPVGEAKSDL WIIDKFFKGM KKAYEKGGAF
PEPITKLSWN YGNHEEPDVH LVAKEINGYF TKDMTIVDKD KTLEFKKGDQ VPMFKYLQDD
GSTVSGNWIY CGSYTKDGNL MARRDETDPT GLGLFPKWSW CWPVNRRIIY NRGSVNPDGA
PFNPKRAVIA WDALEKKWKG DVPDGPWPPM NDAKEGKYPF IMLPEGHGRL YALDLKDGPF
PEHYEPIESP ARNLMSKVQS NPAVKVPANM SSDLNKYPFV GTTYRMTEHW QAGAMTRSLP
WLVELVPTMF VEISQTLASS KGINNGDQVR ISTERGSIEA KALVTSRLKP FNVQGKMVEQ
VGLPWHFGYA GLATGDSGNV LTPSVGCANT SIPEFKAFLC NIEKGGKRA