Gene Gmet_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_0789 
Symbol 
ID3738145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp869826 
End bp871451 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content66% 
IMG OID637778067 
Productaldehyde dehydrogenase 
Protein accessionYP_383756 
Protein GI78222009 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCA GTGAAAAAAT CGCAACCCTC TTTCCCGCAT CGGAGGATAT CCCGCCCCAA 
CACCGGCTCC CGGCCCCGGT GGAACTGCGG ACCTACCTCG TGGCAGGGGA ACTTCGCCAC
TGGGAGGGTG CATCCCAGGA AGTCTTCTCC CCGGTCTGCG TCAGGACCCC GGACGGACCC
GAGCGGGTCA GGATCGGGAG CTTTCCCCTC ATGGGGGGGG ACGACGCCAT GGCGGCCCTG
GACACGGCAG TGGCCACCTA TGACAACGGC AGGGGTGAAT GGCCCACCAT GACCGTTGCC
GACCGCATCG GCCACGTGCA ACGGTTCGCG GCCAGGATGA AGGAACGGCG GAGCGAGGTG
GTCCGGCTCC TCATGTGGGA GATCGGCAAG ACCGAAAAGG ACGCCGCCAA GGAGTTCGAC
CGGACCATCA CCTACATCGA CGACACCATC GATGCCCTCA AGGACTTGGA CCGGGTCTCC
TCCCGCTTCG TGATCCACGA GGGGATCATC GGGCAGATCC GCCGGGCACC CCTCGGCGTG
GCCCTCTGCA TGGGGCCCTA CAACTACCCC CTCAACGAAA CCTTCACCAC CCTCATTCCG
GCCCTCATCA TGGGGAATAC GGTGCTCCTG AAGCCACCGC GCCACGGGGT CCTCCTCTTC
GCACCGCTGC TCGAAGCCTT CCGGGACTCC TTCCCCCCCG GCGTGGTGAA TACCCTCTTC
GGGGCCGGCC GCACCGTCAC CCCGCCGCTC ATGGCGTCGG GAAAGGTGGA CGTCCTCGCC
TTCATCGGCA CGAGCCCCGC GGCCAACGCG CTCCACAAGG AGCACCCCAA GCCCCACCGG
CTCCGGACGG TCCTGGGGCT CGAGGCCAAG AACCCGGCCA TCGTCCTCCC CGACGCCGAC
CTGAACCTGG CGGTGGAGGA GTGCATCGCC GGAAGCCTCT CCTTCAACGG CCAGCGCTGC
ACCGCCCTCA AGATCATCTT CGTCCACGAA TCGGTGGCCG ACCACTTCCT GAATCTCTTC
TCCCGAGCCC TCGCCGCCCT CGGCTGCGGG ATGCCGTGGG AGCCGGGCGT GATGATCACC
CCCCTTCCCG AACCGGGAAA GGCGGTCTAT CTCTCAGAGC TGCTGGAAGA CGCCCGGAGC
CACGGCGCCC GGATCGTCAA TGACGGGGGA GGCTCCGTGA ACTGCTCCTT CTTCTCCCCG
GCCGTGGTCT ATCCGGTGAC CCCGGCCATG CGCCTCTACG CCGAGGAGCA GTTCGGCCCC
ATCGTGCCGG TGGTTCCCTT CACCGACGTC AGCACCCCCA TCCGCGCCAT CGAGGAGTCG
GACTACGGCC AGCAGGTGAG CATCTTCGGC CGCGATCCCC AGACCCTGGC GAACCTCGTG
GACCACCTGG TGAACCAGGT CTCCCGCGTG AACATCAACA GCCAATGCCA GCGGGGCCCC
GACATCTTCC CCTTCACCGG GCGGAAGGAC TCGGCTGTGG GGACCCTGTC GGTCTCGGAC
GCTCTGCGCT CCTTCTCCAT CCGGACCCTG GTGGCGGCCA AGGAAATAGA TCTGAACAAG
GAGATCATTG CCGATATCGT CCGGGAGCAC CGCTCCAACT TCCTCTCCAC GGACTTCATC
CTCTGA
 
Protein sequence
MTISEKIATL FPASEDIPPQ HRLPAPVELR TYLVAGELRH WEGASQEVFS PVCVRTPDGP 
ERVRIGSFPL MGGDDAMAAL DTAVATYDNG RGEWPTMTVA DRIGHVQRFA ARMKERRSEV
VRLLMWEIGK TEKDAAKEFD RTITYIDDTI DALKDLDRVS SRFVIHEGII GQIRRAPLGV
ALCMGPYNYP LNETFTTLIP ALIMGNTVLL KPPRHGVLLF APLLEAFRDS FPPGVVNTLF
GAGRTVTPPL MASGKVDVLA FIGTSPAANA LHKEHPKPHR LRTVLGLEAK NPAIVLPDAD
LNLAVEECIA GSLSFNGQRC TALKIIFVHE SVADHFLNLF SRALAALGCG MPWEPGVMIT
PLPEPGKAVY LSELLEDARS HGARIVNDGG GSVNCSFFSP AVVYPVTPAM RLYAEEQFGP
IVPVVPFTDV STPIRAIEES DYGQQVSIFG RDPQTLANLV DHLVNQVSRV NINSQCQRGP
DIFPFTGRKD SAVGTLSVSD ALRSFSIRTL VAAKEIDLNK EIIADIVREH RSNFLSTDFI
L