Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3374 |
Symbol | |
ID | 5165386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 3959750 |
End bp | 3962935 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640550860 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001232104 |
Protein GI | 148265398 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTATTT CAAGGAGGGA TTTTTTCAGG ATCTCCGGGG CGGGGGTGGC GGCGACCACC CTCGGTCTCA ACCTCGCGCC GGTCGAGGCG AAAGCGGGGG AGCTCGCCAT CCGCTACGCC AAAGAGACGA CGACCATCTG CCCCTACTGT TCGGTGGGGT GCGGGATGAT CGTCCACACG CTGAACGGCA GCGTCATCAA CATCGAGGGG GACCCCGATC ACCCGATCAG CGAGGGGAGC CTCTGCCCGA AGGGTTCGTC GGTCTACCAG CTCCGGGACA ACCCGGCCCG GGTGACGCGG CCGATGTACC GGGCCCCCGG CTCACACGAG TGGCAGACGG TCACCTGGGA GTGGGCCATT GACGAGATCG CGAAAAGGGT GAAGAAAACC AGGGACGCCT CCTTCGTCCC CTCTTCGAAG ATCAAGGTGA AGGAGAAGGT GGCGGGCGCG GAGGTGGAGA AGGAGATCGA GGCGGTGGTG AACCGGACCA TGGGGATCGC CTCGGTGGGG AGCGCCGCCC TCGACAACGA GGAGTGCTAT CTGTATCAGA AATTCTTGAG GGGGTTGGGC CTGGTGTATA TCGAACATCA GGCGCGCATT TGACACAGCG CAACTGTAGC GGCTCTGGCA GAGTCGTTTG GACGCGGTGC AATGACGAAC CACTGGATCG ATTTCAAGAA TGCGGACGTA ATCCTCATCA TGGGTGCGAA CCCGGCGGAG AACCACCCGG TCTCCTTCCG CTGGATCATG AAGGCGAAGG ACGCGGGCGC CAAGGTGATC TGCGTCGACC CCCGCTTCAC CCGCAGCGCT TCGAAGGCGG ACATCTACGC GCCGCTTCGC TCGGGGACCG ACATCGCCTT CCTCGGCGGG ATGATCAGCT ACATCCTGGA GAACAGGCTC TATTTCGACG AGTACGTGAA GAATTACACC AACGCCTCCT TCCTGGTGAA CCCGGAATTC AAGCTGCCGG GGGAGCTTGC CGGCCTCTTC TCCGGCTACG ACCCGAAGAA GCGGAGCTAC GACCCGAAGG CGTGGGGATT CCAGAAGGAC GGGGACGACA ACGTGCTGCG GGACCCGGCC CTCAAGGACC CGAACTGCGT CTTCCAGCTC CTGCGGCGGC ACTACGCACG TTACACCCCC GACAAGGTGT CGCAGATCAC CGGCACCCCC AAGGATAAGC TCCTGGAGGT ATACAAGGCC TACGCCTCCA CCGGCGCGCC GGACCGGGCG GGGACGTCCC TCTACGCCAT GGGGTGGACC CAGCACACGG TCGGCACCCA GAACATCCGC GCCATGTCCA TCATCCAGCT CCTTCTGGGC AACATGGGGG TGGCGGGTGG CGGCATCAAC GCCCTGCGCG GGGAATCCAA CGTCCAGGGC TCCACGGACC ACGGGCTCCT CTTCCATATT CTCCCCGGCT ATCTCCCGGT CCCTTCGGCC GACCTCAAGG ACCTCTCCGC CTACAACGAG AAGCACACCC CGAAGACGAA GGACCCCCAG AGCGCCAACT GGTGGCAGAA CCGCCCCAAG TACATCGCGA GCTACCTGAA GGCGATCTAC GGGGCAAAGG CGACGAAGGA GAACGACTTC GGCTACCAGT GGCTCCCCAA GCTCGACCCG GGGATGAACG GCTCCTGGCT CATGATCTTC GACAACATGA TCCGCGGCAA GTTCAAGGGG TTCTTCGCCT GGGGGCAGAA CCCGGCCTGT TCCGGGAGCA ATGCCAACAA GGTGAGGAAG GCCCTCGCCA AGCTCGACTG GATGGTGACG GTCAACCTCT TCGACAACGA GACCGCCTCC TTCTGGAAAG GACCGGGCAT GGAGCCCTCC AAGGTGCCGA CCGAGGTCTT CTTCCTCCCT GCGGCCGCCT CCTTCGAGAA GGAGGGGAGC ATCTCCAACT CCGGCCGCTG GGCCCAGTGG CGCTACGCGG CGGTGAAGCC CCTGGGGCAG TCGAAGCCGG ACGCGGAGAT CATCAACGAG CTCTTCTTCA GGCTGAAGGG ACTCTATGCC AAAGATGGGG GGGCGCTCCC CGAGCAACTC ACCAACCTCA CCTGGAACTA CGGTTTCAAA CGGGCCGACG GCACCATCAG GAGCGTCGAC ATCCATGCGG TGGCGAAGGA GATCAACGGC TCCTTCCTGG AGGAGGTGGA GGAGAAGCCG AAGCCTCTGA AACCTGGTGA GAATCCACCG CCACCTAAGG TGATACCCAA GCCCTGGGAG AAGAAGCCCC TGGGGAAGAA GGGGGAGCTC ATCGACGGCT TCGCCAAGCT CCAGGCCGAC GGCACCACCT CGTGCGGCAA CTGGATCTAC TGCCAGAGCT ACAACGAGAA GGGGAACCTC ATGGCGCGGC GGGCAAAGAA GGACCCGACC GGCCTCGGCC TCTTCCCCGA ATGGGCGTGG GCCTGGCCGG TGAACCGGCG CATCATCTAC AACCGGGCCT CGGTCAACCC GGACGGCAAG CCCTACAACA TGAAGAAGGC GGTCGTCTAC TGGAACCCGA CGGCCGTCCT CCCCGACGGA AAGGTGGGCA AGTGGGAGGG GGATGTCCCC GACGGCCCCT GGCCTCCCAT GGCAGATGCC AAGGAGGGGA GAAAGCCTTT CATCATGCGG CCTGACGGGG TGGGCGCCCT GTTCGGCCCC GGCATGAAGG ACGGGCCGTT CCCAGAGCAT TACGAGCCCC TCGAATGCCC GGTGCCGGAA AACCTCATGT CGAAGCAGAC GGTCAACCCG GCCATCAAGC TCTTCGCCAA CGCGGGGCTT GCCGAGGATG CCTATGCCAC CTGCGACGTC CGCTTCCCTT ACGTGGGGAC CACCTACCGG GTGACCGAGC ACTGGCAGAC CGGGGTCATG ACCCGCAATA CCCCGTGGCT GCTGGAGTTG CAGCCGCGCC AGTTCGTCGA GATGAGCGTC GAGCTGGCGA AAGAAAAGGG GATCAGAAAC GGCGATATCG TAGAGGTGGC CTCGGTCAGG GGCGCGATTG AGGCTGTCGC CGTCGTCACC CCGCGCATGC GGCCGTTCCA GATCGGCGGC CGGACCGTGC ACGAGGTCGG ACTTCCCTGG TGCTTCGGCT GGTTCACGCC GGGGGTGGGG GATGCGGCAA ACCTGCTGAC GCCGACTGCC GGCGATGCAA ATACCATGAT TCCCGAAACC AAGGCGTTTA TGGTCGGGAT CAAGAGAAAG GGGTGA
|
Protein sequence | MGISRRDFFR ISGAGVAATT LGLNLAPVEA KAGELAIRYA KETTTICPYC SVGCGMIVHT LNGSVINIEG DPDHPISEGS LCPKGSSVYQ LRDNPARVTR PMYRAPGSHE WQTVTWEWAI DEIAKRVKKT RDASFVPSSK IKVKEKVAGA EVEKEIEAVV NRTMGIASVG SAALDNEECY LYQKFLRGLG LVYIEHQARI UHSATVAALA ESFGRGAMTN HWIDFKNADV ILIMGANPAE NHPVSFRWIM KAKDAGAKVI CVDPRFTRSA SKADIYAPLR SGTDIAFLGG MISYILENRL YFDEYVKNYT NASFLVNPEF KLPGELAGLF SGYDPKKRSY DPKAWGFQKD GDDNVLRDPA LKDPNCVFQL LRRHYARYTP DKVSQITGTP KDKLLEVYKA YASTGAPDRA GTSLYAMGWT QHTVGTQNIR AMSIIQLLLG NMGVAGGGIN ALRGESNVQG STDHGLLFHI LPGYLPVPSA DLKDLSAYNE KHTPKTKDPQ SANWWQNRPK YIASYLKAIY GAKATKENDF GYQWLPKLDP GMNGSWLMIF DNMIRGKFKG FFAWGQNPAC SGSNANKVRK ALAKLDWMVT VNLFDNETAS FWKGPGMEPS KVPTEVFFLP AAASFEKEGS ISNSGRWAQW RYAAVKPLGQ SKPDAEIINE LFFRLKGLYA KDGGALPEQL TNLTWNYGFK RADGTIRSVD IHAVAKEING SFLEEVEEKP KPLKPGENPP PPKVIPKPWE KKPLGKKGEL IDGFAKLQAD GTTSCGNWIY CQSYNEKGNL MARRAKKDPT GLGLFPEWAW AWPVNRRIIY NRASVNPDGK PYNMKKAVVY WNPTAVLPDG KVGKWEGDVP DGPWPPMADA KEGRKPFIMR PDGVGALFGP GMKDGPFPEH YEPLECPVPE NLMSKQTVNP AIKLFANAGL AEDAYATCDV RFPYVGTTYR VTEHWQTGVM TRNTPWLLEL QPRQFVEMSV ELAKEKGIRN GDIVEVASVR GAIEAVAVVT PRMRPFQIGG RTVHEVGLPW CFGWFTPGVG DAANLLTPTA GDANTMIPET KAFMVGIKRK G
|
| |