Gene Gura_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1033 
Symbol 
ID5166784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp1234570 
End bp1236195 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content58% 
IMG OID640548529 
Productaldehyde dehydrogenase 
Protein accessionYP_001229812 
Protein GI148263106 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTGA AAGAGAGGAT CGATAAATTG TTCCCGACGG AAGCGGAGAT AAGCAAGTCT 
TTCCGCCTGC CGGAACCGAT CGAGTTGAAC AGTTTTCTCA TCAACGGAGA ACTGCGCAGT
TGGAACGGTC CCATGCAGGA GGTTTTTTCC CCGGTCTGCG TGAAGACCGA AGCGGGGCTT
TTCCGGCAGA TGATCGGGAG GTTCCCGTTG ATGGCAGAGT CCGACGCGTT GTCTGTCCTT
GATGCGGCAG TCGGGGCCTA CGACTGCGGC CGGGGGCGCT GGCCGACCAT GTCCGTTGAG
GAGCGGATTG CCTGCGTTCA GGAATTCGCC TACCGGATGA AAGAAAAGCG GTCTGAGGTG
GTGAGCCTTC TCATGTGGGA AATCGGCAAG TCGCTCAAGG ATTCCGAAAA GGAATTCGAC
AGGACGGTCG ATTACATAGC CGATACCATC GATGCGCTGA AAGAACTGGA TCGGGTTTCG
TCCCGGTTTG TCGTCGCCCA GGGGATCATC GGCCAGATCC GTCGCGCGCC GATAGGGGTC
GCCCTTTGCA TGGGCCCCTA CAACTATCCC TTGAATGAAA CCTTCACCAC TCTGATTCCG
GCATTGATCA TGGGGAATAC GGTTATCCTC AAGCCGCCGC GCCACGGGGT ACTCCTATTT
TACCCCCTTC TGGAGGCGTT CCGCGATTCT TTCCCTCCCG GGGTGGTGAA CACGCTTTTC
GGCGCCGGAA GGACGGTCAC TCCGCCGCTG ATGGCTTCCG GCAAGGTGGA CGTGCTCGCC
TTCATCGGTA CGAGCAAGGC TGCCGATAGC TTGCAAAAAG GGCATCCCAG GATGCATCGG
CTCCGCTTGG TGCTGGGGCT GGAGGCAAAA AATCCCGCCA TTGTCCTCCC TGACGCCGAC
CTGGAGTCCG CTGTCGAGGA GTGTGTGGCC GGGAGCCTGT CGTTCAACGG CCAACGCTGC
ACTGCAATCA AGATCGTTTT CGTTCACGAG AGCATTGCGG ATGAATTCCT CAGCCGCTTT
GCAGCGGCAA TCGCCGTCAT GAAATGCGGT ATGCCATGGG AGTCGGGGGT CGGCATAACG
CCGTTGCCGG AGCCGGGTAA GCCGGAATAT CTGTCCTGCC TGGTTGCAGA CGCCGTACGC
CTTGGGGCAC GGGTAGCCAA TGAGGCGGGG GGGACGGTCA ACGGTACCTT TTTCTACCCG
GCCCTGGTGT ATCCGGTAAC GGCGGAGATG AAGCTTTATA ATGAAGAGCA GTTCGGTCCT
GTCATACCGG TCCTGCCGTT TACGGATATC GAGACGCCGA TCGAGTATCT CACGGCATCG
GACTACGGCC AGCAGGTGAG TATTTTCGGC CGGGATGCAG CGGTTCTGGC AAAGCTCATC
GATCCCCTGG TCAACCAGGT TTCCCGCGTC AATATCAACA GCCAGTGCCA GCGTGGCCCG
GATATCTTCC CCTTTACGGG CAGGAAAGAT TCGGCGGTCG GCACCCTCTC CGTTTCCGAT
GCCCTGCGGG CCTTTTCCAT CCGCACCCTC GTGGCCGCCA GAGATACCGA ACTCAATAAG
GAGATCATTC GCACTATCGT CCGCGAGCAA AAATCCAACT TTCTTTCCAC GGATTTCATT
CTGTAA
 
Protein sequence
MTLKERIDKL FPTEAEISKS FRLPEPIELN SFLINGELRS WNGPMQEVFS PVCVKTEAGL 
FRQMIGRFPL MAESDALSVL DAAVGAYDCG RGRWPTMSVE ERIACVQEFA YRMKEKRSEV
VSLLMWEIGK SLKDSEKEFD RTVDYIADTI DALKELDRVS SRFVVAQGII GQIRRAPIGV
ALCMGPYNYP LNETFTTLIP ALIMGNTVIL KPPRHGVLLF YPLLEAFRDS FPPGVVNTLF
GAGRTVTPPL MASGKVDVLA FIGTSKAADS LQKGHPRMHR LRLVLGLEAK NPAIVLPDAD
LESAVEECVA GSLSFNGQRC TAIKIVFVHE SIADEFLSRF AAAIAVMKCG MPWESGVGIT
PLPEPGKPEY LSCLVADAVR LGARVANEAG GTVNGTFFYP ALVYPVTAEM KLYNEEQFGP
VIPVLPFTDI ETPIEYLTAS DYGQQVSIFG RDAAVLAKLI DPLVNQVSRV NINSQCQRGP
DIFPFTGRKD SAVGTLSVSD ALRAFSIRTL VAARDTELNK EIIRTIVREQ KSNFLSTDFI
L