Gene Sfum_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0833 
Symbol 
ID4460780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1033681 
End bp1036671 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content62% 
IMG OID639701595 
Productaldehyde dehydrogenase 
Protein accessionYP_844965 
Protein GI116748278 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0506] Proline dehydrogenase
[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01237] delta-1-pyrroline-5-carboxylate dehydrogenase, group 2, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.286586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.902377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCTG ACCTTGAAAG ACGCGTGCAG CAAACGGGAT TGTGGCTCTA CGAATTGATC 
GAAGGGGAGT CTCCATCGGT CTTCCGGAAG GAATTCTGGA CCGGCAAGAT GCTGGAATGG
TGCATGCAGA ATGAGGCCTT CAAGGTCGAG ATGTTCCGGT TCGTGGATGT TTTTCCCTAT
CTTACCCGTC CCGAATCCGT GGCACGGCAC GTGCAAGAGT ACTTTTCCAG GCCCGGCGTG
AACTTCCCCG CGGTCCTCCA GTGGGGGCTG AGAGCCGTTT CCCCCGGGTC GCTCACTGCC
AAGGTGATCG CCAGAAGCAT CACGCACAAC CTCCACAACA TGGCAAGACA GTTCATCGTC
GGCTCGAATC CTTCCGAAGC CCTTCCGAAT CTCGAAAGGC TTCGCGGCCA GGGCATGGCC
TTCACGATCG ACCTGCTCGG TGAAGCCGTC GTTTCCGAGA AGGAAGCCGA GGAATACGTG
AGCCGCTACC TGGAGCTGTT CGACATCCTC GACGAGGCGC AACGGAAATG GCCGGCAATC
GGCGGGGGGG CCCAGCAGGC GGACTGGGGA CACGCGCCCA AGGTGAACGT CTCCATCAAA
GCCTCGGCGA TGTATTCTCA GATGAGCGCC CGCTCGTTCG AAGATTCCGT CGCCCGATCG
AAGGAAAAGC TGCGGCCGAT CCTGCGGAAG GCCCTGGCCA CCGGGTCCTT CGTCAATCTG
GACATGGAAC GGCATGCGCT CAAAGACCTT ACCCTGGCGC TTTTCCGCAG CTTGATGGAA
GAGGATGAAT TCCGGGACTA TCCACACGTG GGCATCGTCA TCCAGGCCTA TCTCAAGGAC
AGCGAGCGGG ACCTGGAGGA AATCCTCGGA TGGGCGAAGG CGACCGGCCG ACATTTCACG
ATACGCCTGG TGAAAGGGGC TTACTGGGAT TCGGAGGTCA TCTGGGCGCG CCAGAGCGAG
TGGCCCGTTC CCGTGTTCAC TTCCAAGCCG GAGACCGATG CCAATTTCGA AAAACTGGCC
GACCTCATCA TGGAGAATCA CCAGTGGGTG TCGCTGGCCT GCGCCTCTCA CAACATGCGT
TCCATCTCCT ACGTGATGGA ACGCGCCAGG GACTTGTCCG TTCCCGCCGG TCGCCTGGAA
TACCAGGTGC TCTACGGCAT GGGGGAGCCG GTGCGCAACG CCCTGCGCAA GGCCGGACTG
CCGGTCCGGC TCTACACTCC GGTCGGGGAC ATGATTCAGG GCATGTCTTA TCTCGTTCGC
AGGCTGCTGG AGAACACAGC GAACGAGTCG TTTCTGAGGA AGAGCTTCTT CCAGGGCGTA
TCCCGTGAAC TGCTGCTCCG CAATCCGATG GACGTTCTTG CGGAGGAGCG GACGGCGGGC
CCCGTTCCCG CCCGCGATGC ACCGGAATAC GGCGACAAAG GCCCCTTCTG CAATGAACCC
TGCTTCGATT GGACGATTCC CGAGCACCGG GCCGGCTTTC GGGACGCACT GGATCGGGTC
CGCGCAACCT TTCCCATCAA GGTGCCGTTG ACCATCGGAG GCTCAAGATT CGACACCCCC
GTGCGCCTTC GATCGGTAAA CCCGAACCGC GCGGAAGAGA TGGTCGGTGA TGTGGCCGGT
GCCGGACCGC TGGAAGCGGA TGCGGCGGTG GAGGCGGCCA AAGCCGCTTT TGCGGCGTGG
CGCGACACGC CCCCCGGGGA GCGCGCCGAG TATTTGTTCA AAGCGGCGGC GGCTGCCCGG
AGGATACGAT ACGACCTGGC CGCGCTGCAG GTGTACGAGG TGGGAAAGGC GTGGAGCGAG
GCCGATGCGG ACGTCTGCGA GGCCATAGAT TTCCTGGAGT ATTACGGCAG GGAAATGATC
CGATTGAGCC GTCCGAAGAG GATGGGACAC GCCCCCGGGG AGATCAGCCA CCTGTTCTAC
GAGCCGAGAG GGGTTGCCGC AGTGATCGCC CCCTGGAATT TCCCGATGGC CATATCGACC
GGAATGACCT CCGCCGCGCT GGTCACCGGG AACACCGTGG TCTACAAGCC CGCCTCCCAG
TCTCCCGTGG TCGGTTCGAT GGTGATGAAC GTCTTCGAAG AGGCGGGGCT GCCCAAAGGT
GTCCTGAGCT TTCTTCCGGG ACCCGGCGCC CAGATCGGCG ACTACCTCGT TCATCATCCG
GACGTAGCGG TGATCGCGTT CACCGGCAGC AAGAAGGTTG GACTCGACAT CATCGCCCAG
GCCAACCGGG ATGCGGAGCG TGCCGGCCAC GTCAAGACCG TTGTCGCCGA AATGGGGGGA
AAGAACGCGA TCGTCGTGGA TGCCGATGCC GATCTGGATG AAGCGCTTGC GCAGATCGTT
CACTCGGCGT TCGGCTACCA GGGGCAGAAA TGCTCCGCTT GCTCGCGCCT GATCGTCCTC
GAGGAGATTT ACGACAAGCT GGTCGAACGG CTGAAGGCCG CCGCCGAAAG CATTCACCTG
GGCCCTCCCG AAGACCCGAA GAATCTGATG GGGGCCGTGA TCGAAGCCGG CGCGCGGAAA
CGAATCATGG AATACATCGA GCTCGGGCGC AAGGACGGCA CGGTACTCGT GGAGCGGACC
GTTCCGGGAA ACGAGGGCTT TTTCGTGCCG CTCACGATCC TCGCGGACCT CCCGCCGGAT
CATCGCCTGG CCCGGGAAGA GATTTTCGGT CCGGTCCTGG TGGTCTTCAA GGTCAAGGAT
TTTGCCCGGG CGATTGAAAT CGCCAACGAC ACCGAATACG CCCTGACCGG CGGGGTGTTC
TCCAGGAGCC CGGCCAACAT CGACCTGGCC AGGCGCGAGT TCCGCACCGG CAACCTCTAC
ATCAACCGCG GCTGCACGGG AGCCGTCGTG GAAAGGCATC CGTTCGGAGG GTTCAAGCTT
TCGGGCATCG GCTCGAAAGC AGGGGGGCCG GATTATCTCC TTCAGTTTAT GGTGCCTCGC
AACGTCGTCG AAAACACCCT GCGGCGCGGC TTCGCACCCG CGGATGAATA G
 
Protein sequence
MDSDLERRVQ QTGLWLYELI EGESPSVFRK EFWTGKMLEW CMQNEAFKVE MFRFVDVFPY 
LTRPESVARH VQEYFSRPGV NFPAVLQWGL RAVSPGSLTA KVIARSITHN LHNMARQFIV
GSNPSEALPN LERLRGQGMA FTIDLLGEAV VSEKEAEEYV SRYLELFDIL DEAQRKWPAI
GGGAQQADWG HAPKVNVSIK ASAMYSQMSA RSFEDSVARS KEKLRPILRK ALATGSFVNL
DMERHALKDL TLALFRSLME EDEFRDYPHV GIVIQAYLKD SERDLEEILG WAKATGRHFT
IRLVKGAYWD SEVIWARQSE WPVPVFTSKP ETDANFEKLA DLIMENHQWV SLACASHNMR
SISYVMERAR DLSVPAGRLE YQVLYGMGEP VRNALRKAGL PVRLYTPVGD MIQGMSYLVR
RLLENTANES FLRKSFFQGV SRELLLRNPM DVLAEERTAG PVPARDAPEY GDKGPFCNEP
CFDWTIPEHR AGFRDALDRV RATFPIKVPL TIGGSRFDTP VRLRSVNPNR AEEMVGDVAG
AGPLEADAAV EAAKAAFAAW RDTPPGERAE YLFKAAAAAR RIRYDLAALQ VYEVGKAWSE
ADADVCEAID FLEYYGREMI RLSRPKRMGH APGEISHLFY EPRGVAAVIA PWNFPMAIST
GMTSAALVTG NTVVYKPASQ SPVVGSMVMN VFEEAGLPKG VLSFLPGPGA QIGDYLVHHP
DVAVIAFTGS KKVGLDIIAQ ANRDAERAGH VKTVVAEMGG KNAIVVDADA DLDEALAQIV
HSAFGYQGQK CSACSRLIVL EEIYDKLVER LKAAAESIHL GPPEDPKNLM GAVIEAGARK
RIMEYIELGR KDGTVLVERT VPGNEGFFVP LTILADLPPD HRLAREEIFG PVLVVFKVKD
FARAIEIAND TEYALTGGVF SRSPANIDLA RREFRTGNLY INRGCTGAVV ERHPFGGFKL
SGIGSKAGGP DYLLQFMVPR NVVENTLRRG FAPADE