Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1955 |
Symbol | |
ID | 8137289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2265925 |
End bp | 2268954 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644869569 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_003021766 |
Protein GI | 253700577 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 127 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTTT CACGGAGGGA GTTTCTGCAG GGAGGGGCGC TGGCTACGGC ACTGGTCCTT TCCGGCAAAA AAGCTGAAGC GGGCGGAGCC GACGCCCCGC AGATGCGCAC CAAGGGGCTG AAAAGCTCGA CCACCATCTG CCCTTTCTGC GCGGTAGGTT GCGGGCTTGT GGTGCATACC AAGAACGGCA AGATCGTCAA CATCGAGGGA GACATCCAGC ATCCGATCAA CCAGGGTGCG CTCTGCTCGA AGGGTAGCGC GCTGTTCCAG GTCGCCAACA ACGAGCGCCG CCTGCAGAAG GTCATGTACC GTGCTCCCGG CAGCGACAAG TTCGAAGAGA AGAGCTGGGA CTGGGCGCTG GAGCGGATCA GCCAGAAGAT GAAGGAGACC CGCGACAAGA GCTTCAAGGC CAAAGAGATC AACAAGAAGG ACAACAAGGA ATACGTGGTG AACCGCACCG AAGGGATGGC TTTCCTGGGG GGCGCCGGTC TCGACAACGA GGAGTGCTAC CTCTGGAGCA AGTTCGCCCG CTCCATGGGG GTCGCGAACC TCGAGCACCA AGCCCGAATA TGACACTCCG CTACAGTCGC CGGTCTGGCG GCTTCGTTTG GACGTGGGGC AATGACCAAC CACTGGATTG ACCTGAAAAA CGCCGATGCC ATCCTGGCCA TCGGTTGCAA CCCCGCTGAG AACCACCCGA TCTCGATGAA GTGGATCGAG GCGGCCATGG ATAACGGCGG CAAGCTTTTA GCCGTCGACC CGCGCTTCAC CAGGACCGGT TCCAAGGCCG ACCACTACGC GCAGATCCGT CCCGGCACCG ACATCGCCTT CCTGGGCGGG ATGATCAACT TCGCGCTGCA GAACAACATG ATCCACGAGG AGTACGTCCG CGAGTACACC AACGCCTCCT TCATCGTCAA CGACAAGTAC GACTTCAACG AGGGTATCTT CTGCGCCTTC GACGACCAGG AGAAGACCTA CGACTCGAAG GCCTGGGCCT ACGCGCTCGA CGGCTCCGGC AACGCCAAGC GCGACAAGTC GCTTAAGGAC CCGCGCTGCG TCTATCAGCT GCTGAAGAAG CACTACTCCC GCTACACCGT CGACATGGTC TGCTCTATCA CCGGCACCAA GAAGGAAGAG TACGTCGCCG TCGCCAAGGC TTTCTGCTCC ACCGGGCGCG CCGACAAGGC AGGCACCATC CTCTACGCGA TGGGTATCAC CCAGTCCACC CACGGCACCC AGAACGTGCG CGCGGTAGCC ATGCTGCAGA TGCTTCTGGG CAACATCGGC ATCGCCGGCG GCGGCGTCAA CGCGCTGCGC GGCGAGAGCA ACGTGCAAGG CTCGACCGAC TACGGCCTGC TCTTCCACAT CCTGCCGGGC TACCTGAAGT CCCCTGAGAT CGACAACGTG GACCTGAAGG CATACCTGGA GAAGTGGACC CCGAAGACCA AGGACGCAAA GAGCGCCAAC TGGTGGGGGA ACACCCCGAA GTACACCGTA AGCCTCCTGA AGGCCTGGTA CGGCGACAAC GCCAGCGCGG AAAACGGCTT CTGCTACGAC TACCTCCCCA AGAGAAGCGG CAACTACTCC TTCATGAAAC TGATGGAGAA GATGGGCAAT GGGGAACTGC AGGGTCTCGT CTGCATGGGG CAGAACCCGG CCGTAGGCGG CCCGGACTCC CTGAAGACCC GCGAGGCGCT GGGGAAACTC GACTGGCTCG TCACCGTGGA CCTCTGGGAG ACCGAGACCT CCATCTTCTG GAAGCGCCCC GGCGTTAACC CCAAGGAGAT CAAGACCGAG GTCTTCATGC TGCCCGCCGC CTCCTCCGTC GAGAAGGAAG GCTCCATCTC CAACTCCGGC CGCTGGGCCC AGTGGCGCTA CAAGGCCGCC GAACCGGTAG GGGAGGCGAA GAGCGACCTC TGGATCATCG ACAAGTTCTT CAAGGGGATG AAGAAGGCCT ACGAGAAAGG GGGCGCGTTC CCCGAGCCGA TCACCAAGCT CTCCTGGAAC TACGGCAACC ACGAGGAGCC CGATGTCCAC CTGGTCGCCA AGGAGATCAA CGGCTACTTC ACCAAGGACA TGACCATCGT CGACAAGGAC AAGACCCTGG AGTTCAAGAA GGGGGATCAG GTACCGATGT TCAAGTACCT GCAGGACGAC GGCTCCACCG TCTCCGGCAA CTGGATCTAC TGCGGCTCCT ACACAAAAGA CGGGAACCTG ATGGCCCGCC GCGACGAAAC CGACCCGACC GGGCTCGGCT TGTTCCCGAA GTGGTCCTGG TGCTGGCCGG TCAACCGCCG CATCATCTAC AACAGGGGCT CCGTCAACCC CGACGGCGCG CCGTTCAACC CCAAGCGCGC GGTCATCGCC TGGGACGCGC TGGAGAAGAA GTGGAAAGGG GACGTACCCG ACGGCCCCTG GCCGCCGATG AACGACGCGA AGGAAGGGAA GTACCCCTTC ATCATGCTGC CGGAAGGGCA CGGCCGCCTC TATGCGCTGG ACCTGAAGGA CGGTCCTTTC CCGGAGCACT ACGAGCCTAT CGAGAGCCCG GCCAGGAACC TCATGTCCAA GGTGCAGAGC AACCCCGCGG TCAAGGTCCC GGCCAACATG TCGAGCGACC TTAACAAGTA CCCGTTCGTC GGCACCACCT ACAGGATGAC CGAGCACTGG CAGGCAGGGG CCATGACCCG CAGCCTGCCG TGGCTGGTGG AGCTGGTCCC GACCATGTTC GTCGAGATCT CGCAGACGCT CGCCTCCTCC AAGGGGATCA ACAACGGCGA CCAGGTGAGG ATCAGCACCG AGCGCGGTTC CATCGAGGCG AAGGCCCTGG TCACCTCCAG GCTGAAGCCG TTCAACGTGC AGGGAAAAAT GGTGGAGCAG GTGGGCCTTC CCTGGCACTT CGGTTATGCA GGTCTTGCTA CCGGCGACTC GGGCAACGTC CTGACCCCGT CCGTCGGCTG CGCGAATACG AGCATCCCCG AGTTCAAGGC ATTCCTCTGC AACATCGAGA AAGGGGGTAA ACGCGCATGA
|
Protein sequence | MAVSRREFLQ GGALATALVL SGKKAEAGGA DAPQMRTKGL KSSTTICPFC AVGCGLVVHT KNGKIVNIEG DIQHPINQGA LCSKGSALFQ VANNERRLQK VMYRAPGSDK FEEKSWDWAL ERISQKMKET RDKSFKAKEI NKKDNKEYVV NRTEGMAFLG GAGLDNEECY LWSKFARSMG VANLEHQARI UHSATVAGLA ASFGRGAMTN HWIDLKNADA ILAIGCNPAE NHPISMKWIE AAMDNGGKLL AVDPRFTRTG SKADHYAQIR PGTDIAFLGG MINFALQNNM IHEEYVREYT NASFIVNDKY DFNEGIFCAF DDQEKTYDSK AWAYALDGSG NAKRDKSLKD PRCVYQLLKK HYSRYTVDMV CSITGTKKEE YVAVAKAFCS TGRADKAGTI LYAMGITQST HGTQNVRAVA MLQMLLGNIG IAGGGVNALR GESNVQGSTD YGLLFHILPG YLKSPEIDNV DLKAYLEKWT PKTKDAKSAN WWGNTPKYTV SLLKAWYGDN ASAENGFCYD YLPKRSGNYS FMKLMEKMGN GELQGLVCMG QNPAVGGPDS LKTREALGKL DWLVTVDLWE TETSIFWKRP GVNPKEIKTE VFMLPAASSV EKEGSISNSG RWAQWRYKAA EPVGEAKSDL WIIDKFFKGM KKAYEKGGAF PEPITKLSWN YGNHEEPDVH LVAKEINGYF TKDMTIVDKD KTLEFKKGDQ VPMFKYLQDD GSTVSGNWIY CGSYTKDGNL MARRDETDPT GLGLFPKWSW CWPVNRRIIY NRGSVNPDGA PFNPKRAVIA WDALEKKWKG DVPDGPWPPM NDAKEGKYPF IMLPEGHGRL YALDLKDGPF PEHYEPIESP ARNLMSKVQS NPAVKVPANM SSDLNKYPFV GTTYRMTEHW QAGAMTRSLP WLVELVPTMF VEISQTLASS KGINNGDQVR ISTERGSIEA KALVTSRLKP FNVQGKMVEQ VGLPWHFGYA GLATGDSGNV LTPSVGCANT SIPEFKAFLC NIEKGGKRA
|
| |