Gene Gura_3374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3374 
Symbol 
ID5165386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp3959750 
End bp3962935 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content64% 
IMG OID640550860 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_001232104 
Protein GI148265398 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTATTT CAAGGAGGGA TTTTTTCAGG ATCTCCGGGG CGGGGGTGGC GGCGACCACC 
CTCGGTCTCA ACCTCGCGCC GGTCGAGGCG AAAGCGGGGG AGCTCGCCAT CCGCTACGCC
AAAGAGACGA CGACCATCTG CCCCTACTGT TCGGTGGGGT GCGGGATGAT CGTCCACACG
CTGAACGGCA GCGTCATCAA CATCGAGGGG GACCCCGATC ACCCGATCAG CGAGGGGAGC
CTCTGCCCGA AGGGTTCGTC GGTCTACCAG CTCCGGGACA ACCCGGCCCG GGTGACGCGG
CCGATGTACC GGGCCCCCGG CTCACACGAG TGGCAGACGG TCACCTGGGA GTGGGCCATT
GACGAGATCG CGAAAAGGGT GAAGAAAACC AGGGACGCCT CCTTCGTCCC CTCTTCGAAG
ATCAAGGTGA AGGAGAAGGT GGCGGGCGCG GAGGTGGAGA AGGAGATCGA GGCGGTGGTG
AACCGGACCA TGGGGATCGC CTCGGTGGGG AGCGCCGCCC TCGACAACGA GGAGTGCTAT
CTGTATCAGA AATTCTTGAG GGGGTTGGGC CTGGTGTATA TCGAACATCA GGCGCGCATT
TGACACAGCG CAACTGTAGC GGCTCTGGCA GAGTCGTTTG GACGCGGTGC AATGACGAAC
CACTGGATCG ATTTCAAGAA TGCGGACGTA ATCCTCATCA TGGGTGCGAA CCCGGCGGAG
AACCACCCGG TCTCCTTCCG CTGGATCATG AAGGCGAAGG ACGCGGGCGC CAAGGTGATC
TGCGTCGACC CCCGCTTCAC CCGCAGCGCT TCGAAGGCGG ACATCTACGC GCCGCTTCGC
TCGGGGACCG ACATCGCCTT CCTCGGCGGG ATGATCAGCT ACATCCTGGA GAACAGGCTC
TATTTCGACG AGTACGTGAA GAATTACACC AACGCCTCCT TCCTGGTGAA CCCGGAATTC
AAGCTGCCGG GGGAGCTTGC CGGCCTCTTC TCCGGCTACG ACCCGAAGAA GCGGAGCTAC
GACCCGAAGG CGTGGGGATT CCAGAAGGAC GGGGACGACA ACGTGCTGCG GGACCCGGCC
CTCAAGGACC CGAACTGCGT CTTCCAGCTC CTGCGGCGGC ACTACGCACG TTACACCCCC
GACAAGGTGT CGCAGATCAC CGGCACCCCC AAGGATAAGC TCCTGGAGGT ATACAAGGCC
TACGCCTCCA CCGGCGCGCC GGACCGGGCG GGGACGTCCC TCTACGCCAT GGGGTGGACC
CAGCACACGG TCGGCACCCA GAACATCCGC GCCATGTCCA TCATCCAGCT CCTTCTGGGC
AACATGGGGG TGGCGGGTGG CGGCATCAAC GCCCTGCGCG GGGAATCCAA CGTCCAGGGC
TCCACGGACC ACGGGCTCCT CTTCCATATT CTCCCCGGCT ATCTCCCGGT CCCTTCGGCC
GACCTCAAGG ACCTCTCCGC CTACAACGAG AAGCACACCC CGAAGACGAA GGACCCCCAG
AGCGCCAACT GGTGGCAGAA CCGCCCCAAG TACATCGCGA GCTACCTGAA GGCGATCTAC
GGGGCAAAGG CGACGAAGGA GAACGACTTC GGCTACCAGT GGCTCCCCAA GCTCGACCCG
GGGATGAACG GCTCCTGGCT CATGATCTTC GACAACATGA TCCGCGGCAA GTTCAAGGGG
TTCTTCGCCT GGGGGCAGAA CCCGGCCTGT TCCGGGAGCA ATGCCAACAA GGTGAGGAAG
GCCCTCGCCA AGCTCGACTG GATGGTGACG GTCAACCTCT TCGACAACGA GACCGCCTCC
TTCTGGAAAG GACCGGGCAT GGAGCCCTCC AAGGTGCCGA CCGAGGTCTT CTTCCTCCCT
GCGGCCGCCT CCTTCGAGAA GGAGGGGAGC ATCTCCAACT CCGGCCGCTG GGCCCAGTGG
CGCTACGCGG CGGTGAAGCC CCTGGGGCAG TCGAAGCCGG ACGCGGAGAT CATCAACGAG
CTCTTCTTCA GGCTGAAGGG ACTCTATGCC AAAGATGGGG GGGCGCTCCC CGAGCAACTC
ACCAACCTCA CCTGGAACTA CGGTTTCAAA CGGGCCGACG GCACCATCAG GAGCGTCGAC
ATCCATGCGG TGGCGAAGGA GATCAACGGC TCCTTCCTGG AGGAGGTGGA GGAGAAGCCG
AAGCCTCTGA AACCTGGTGA GAATCCACCG CCACCTAAGG TGATACCCAA GCCCTGGGAG
AAGAAGCCCC TGGGGAAGAA GGGGGAGCTC ATCGACGGCT TCGCCAAGCT CCAGGCCGAC
GGCACCACCT CGTGCGGCAA CTGGATCTAC TGCCAGAGCT ACAACGAGAA GGGGAACCTC
ATGGCGCGGC GGGCAAAGAA GGACCCGACC GGCCTCGGCC TCTTCCCCGA ATGGGCGTGG
GCCTGGCCGG TGAACCGGCG CATCATCTAC AACCGGGCCT CGGTCAACCC GGACGGCAAG
CCCTACAACA TGAAGAAGGC GGTCGTCTAC TGGAACCCGA CGGCCGTCCT CCCCGACGGA
AAGGTGGGCA AGTGGGAGGG GGATGTCCCC GACGGCCCCT GGCCTCCCAT GGCAGATGCC
AAGGAGGGGA GAAAGCCTTT CATCATGCGG CCTGACGGGG TGGGCGCCCT GTTCGGCCCC
GGCATGAAGG ACGGGCCGTT CCCAGAGCAT TACGAGCCCC TCGAATGCCC GGTGCCGGAA
AACCTCATGT CGAAGCAGAC GGTCAACCCG GCCATCAAGC TCTTCGCCAA CGCGGGGCTT
GCCGAGGATG CCTATGCCAC CTGCGACGTC CGCTTCCCTT ACGTGGGGAC CACCTACCGG
GTGACCGAGC ACTGGCAGAC CGGGGTCATG ACCCGCAATA CCCCGTGGCT GCTGGAGTTG
CAGCCGCGCC AGTTCGTCGA GATGAGCGTC GAGCTGGCGA AAGAAAAGGG GATCAGAAAC
GGCGATATCG TAGAGGTGGC CTCGGTCAGG GGCGCGATTG AGGCTGTCGC CGTCGTCACC
CCGCGCATGC GGCCGTTCCA GATCGGCGGC CGGACCGTGC ACGAGGTCGG ACTTCCCTGG
TGCTTCGGCT GGTTCACGCC GGGGGTGGGG GATGCGGCAA ACCTGCTGAC GCCGACTGCC
GGCGATGCAA ATACCATGAT TCCCGAAACC AAGGCGTTTA TGGTCGGGAT CAAGAGAAAG
GGGTGA
 
Protein sequence
MGISRRDFFR ISGAGVAATT LGLNLAPVEA KAGELAIRYA KETTTICPYC SVGCGMIVHT 
LNGSVINIEG DPDHPISEGS LCPKGSSVYQ LRDNPARVTR PMYRAPGSHE WQTVTWEWAI
DEIAKRVKKT RDASFVPSSK IKVKEKVAGA EVEKEIEAVV NRTMGIASVG SAALDNEECY
LYQKFLRGLG LVYIEHQARI UHSATVAALA ESFGRGAMTN HWIDFKNADV ILIMGANPAE
NHPVSFRWIM KAKDAGAKVI CVDPRFTRSA SKADIYAPLR SGTDIAFLGG MISYILENRL
YFDEYVKNYT NASFLVNPEF KLPGELAGLF SGYDPKKRSY DPKAWGFQKD GDDNVLRDPA
LKDPNCVFQL LRRHYARYTP DKVSQITGTP KDKLLEVYKA YASTGAPDRA GTSLYAMGWT
QHTVGTQNIR AMSIIQLLLG NMGVAGGGIN ALRGESNVQG STDHGLLFHI LPGYLPVPSA
DLKDLSAYNE KHTPKTKDPQ SANWWQNRPK YIASYLKAIY GAKATKENDF GYQWLPKLDP
GMNGSWLMIF DNMIRGKFKG FFAWGQNPAC SGSNANKVRK ALAKLDWMVT VNLFDNETAS
FWKGPGMEPS KVPTEVFFLP AAASFEKEGS ISNSGRWAQW RYAAVKPLGQ SKPDAEIINE
LFFRLKGLYA KDGGALPEQL TNLTWNYGFK RADGTIRSVD IHAVAKEING SFLEEVEEKP
KPLKPGENPP PPKVIPKPWE KKPLGKKGEL IDGFAKLQAD GTTSCGNWIY CQSYNEKGNL
MARRAKKDPT GLGLFPEWAW AWPVNRRIIY NRASVNPDGK PYNMKKAVVY WNPTAVLPDG
KVGKWEGDVP DGPWPPMADA KEGRKPFIMR PDGVGALFGP GMKDGPFPEH YEPLECPVPE
NLMSKQTVNP AIKLFANAGL AEDAYATCDV RFPYVGTTYR VTEHWQTGVM TRNTPWLLEL
QPRQFVEMSV ELAKEKGIRN GDIVEVASVR GAIEAVAVVT PRMRPFQIGG RTVHEVGLPW
CFGWFTPGVG DAANLLTPTA GDANTMIPET KAFMVGIKRK G