Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_0833 |
Symbol | |
ID | 4460780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | - |
Start bp | 1033681 |
End bp | 1036671 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639701595 |
Product | aldehyde dehydrogenase |
Protein accession | YP_844965 |
Protein GI | 116748278 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0506] Proline dehydrogenase [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01237] delta-1-pyrroline-5-carboxylate dehydrogenase, group 2, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.286586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.902377 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTCTG ACCTTGAAAG ACGCGTGCAG CAAACGGGAT TGTGGCTCTA CGAATTGATC GAAGGGGAGT CTCCATCGGT CTTCCGGAAG GAATTCTGGA CCGGCAAGAT GCTGGAATGG TGCATGCAGA ATGAGGCCTT CAAGGTCGAG ATGTTCCGGT TCGTGGATGT TTTTCCCTAT CTTACCCGTC CCGAATCCGT GGCACGGCAC GTGCAAGAGT ACTTTTCCAG GCCCGGCGTG AACTTCCCCG CGGTCCTCCA GTGGGGGCTG AGAGCCGTTT CCCCCGGGTC GCTCACTGCC AAGGTGATCG CCAGAAGCAT CACGCACAAC CTCCACAACA TGGCAAGACA GTTCATCGTC GGCTCGAATC CTTCCGAAGC CCTTCCGAAT CTCGAAAGGC TTCGCGGCCA GGGCATGGCC TTCACGATCG ACCTGCTCGG TGAAGCCGTC GTTTCCGAGA AGGAAGCCGA GGAATACGTG AGCCGCTACC TGGAGCTGTT CGACATCCTC GACGAGGCGC AACGGAAATG GCCGGCAATC GGCGGGGGGG CCCAGCAGGC GGACTGGGGA CACGCGCCCA AGGTGAACGT CTCCATCAAA GCCTCGGCGA TGTATTCTCA GATGAGCGCC CGCTCGTTCG AAGATTCCGT CGCCCGATCG AAGGAAAAGC TGCGGCCGAT CCTGCGGAAG GCCCTGGCCA CCGGGTCCTT CGTCAATCTG GACATGGAAC GGCATGCGCT CAAAGACCTT ACCCTGGCGC TTTTCCGCAG CTTGATGGAA GAGGATGAAT TCCGGGACTA TCCACACGTG GGCATCGTCA TCCAGGCCTA TCTCAAGGAC AGCGAGCGGG ACCTGGAGGA AATCCTCGGA TGGGCGAAGG CGACCGGCCG ACATTTCACG ATACGCCTGG TGAAAGGGGC TTACTGGGAT TCGGAGGTCA TCTGGGCGCG CCAGAGCGAG TGGCCCGTTC CCGTGTTCAC TTCCAAGCCG GAGACCGATG CCAATTTCGA AAAACTGGCC GACCTCATCA TGGAGAATCA CCAGTGGGTG TCGCTGGCCT GCGCCTCTCA CAACATGCGT TCCATCTCCT ACGTGATGGA ACGCGCCAGG GACTTGTCCG TTCCCGCCGG TCGCCTGGAA TACCAGGTGC TCTACGGCAT GGGGGAGCCG GTGCGCAACG CCCTGCGCAA GGCCGGACTG CCGGTCCGGC TCTACACTCC GGTCGGGGAC ATGATTCAGG GCATGTCTTA TCTCGTTCGC AGGCTGCTGG AGAACACAGC GAACGAGTCG TTTCTGAGGA AGAGCTTCTT CCAGGGCGTA TCCCGTGAAC TGCTGCTCCG CAATCCGATG GACGTTCTTG CGGAGGAGCG GACGGCGGGC CCCGTTCCCG CCCGCGATGC ACCGGAATAC GGCGACAAAG GCCCCTTCTG CAATGAACCC TGCTTCGATT GGACGATTCC CGAGCACCGG GCCGGCTTTC GGGACGCACT GGATCGGGTC CGCGCAACCT TTCCCATCAA GGTGCCGTTG ACCATCGGAG GCTCAAGATT CGACACCCCC GTGCGCCTTC GATCGGTAAA CCCGAACCGC GCGGAAGAGA TGGTCGGTGA TGTGGCCGGT GCCGGACCGC TGGAAGCGGA TGCGGCGGTG GAGGCGGCCA AAGCCGCTTT TGCGGCGTGG CGCGACACGC CCCCCGGGGA GCGCGCCGAG TATTTGTTCA AAGCGGCGGC GGCTGCCCGG AGGATACGAT ACGACCTGGC CGCGCTGCAG GTGTACGAGG TGGGAAAGGC GTGGAGCGAG GCCGATGCGG ACGTCTGCGA GGCCATAGAT TTCCTGGAGT ATTACGGCAG GGAAATGATC CGATTGAGCC GTCCGAAGAG GATGGGACAC GCCCCCGGGG AGATCAGCCA CCTGTTCTAC GAGCCGAGAG GGGTTGCCGC AGTGATCGCC CCCTGGAATT TCCCGATGGC CATATCGACC GGAATGACCT CCGCCGCGCT GGTCACCGGG AACACCGTGG TCTACAAGCC CGCCTCCCAG TCTCCCGTGG TCGGTTCGAT GGTGATGAAC GTCTTCGAAG AGGCGGGGCT GCCCAAAGGT GTCCTGAGCT TTCTTCCGGG ACCCGGCGCC CAGATCGGCG ACTACCTCGT TCATCATCCG GACGTAGCGG TGATCGCGTT CACCGGCAGC AAGAAGGTTG GACTCGACAT CATCGCCCAG GCCAACCGGG ATGCGGAGCG TGCCGGCCAC GTCAAGACCG TTGTCGCCGA AATGGGGGGA AAGAACGCGA TCGTCGTGGA TGCCGATGCC GATCTGGATG AAGCGCTTGC GCAGATCGTT CACTCGGCGT TCGGCTACCA GGGGCAGAAA TGCTCCGCTT GCTCGCGCCT GATCGTCCTC GAGGAGATTT ACGACAAGCT GGTCGAACGG CTGAAGGCCG CCGCCGAAAG CATTCACCTG GGCCCTCCCG AAGACCCGAA GAATCTGATG GGGGCCGTGA TCGAAGCCGG CGCGCGGAAA CGAATCATGG AATACATCGA GCTCGGGCGC AAGGACGGCA CGGTACTCGT GGAGCGGACC GTTCCGGGAA ACGAGGGCTT TTTCGTGCCG CTCACGATCC TCGCGGACCT CCCGCCGGAT CATCGCCTGG CCCGGGAAGA GATTTTCGGT CCGGTCCTGG TGGTCTTCAA GGTCAAGGAT TTTGCCCGGG CGATTGAAAT CGCCAACGAC ACCGAATACG CCCTGACCGG CGGGGTGTTC TCCAGGAGCC CGGCCAACAT CGACCTGGCC AGGCGCGAGT TCCGCACCGG CAACCTCTAC ATCAACCGCG GCTGCACGGG AGCCGTCGTG GAAAGGCATC CGTTCGGAGG GTTCAAGCTT TCGGGCATCG GCTCGAAAGC AGGGGGGCCG GATTATCTCC TTCAGTTTAT GGTGCCTCGC AACGTCGTCG AAAACACCCT GCGGCGCGGC TTCGCACCCG CGGATGAATA G
|
Protein sequence | MDSDLERRVQ QTGLWLYELI EGESPSVFRK EFWTGKMLEW CMQNEAFKVE MFRFVDVFPY LTRPESVARH VQEYFSRPGV NFPAVLQWGL RAVSPGSLTA KVIARSITHN LHNMARQFIV GSNPSEALPN LERLRGQGMA FTIDLLGEAV VSEKEAEEYV SRYLELFDIL DEAQRKWPAI GGGAQQADWG HAPKVNVSIK ASAMYSQMSA RSFEDSVARS KEKLRPILRK ALATGSFVNL DMERHALKDL TLALFRSLME EDEFRDYPHV GIVIQAYLKD SERDLEEILG WAKATGRHFT IRLVKGAYWD SEVIWARQSE WPVPVFTSKP ETDANFEKLA DLIMENHQWV SLACASHNMR SISYVMERAR DLSVPAGRLE YQVLYGMGEP VRNALRKAGL PVRLYTPVGD MIQGMSYLVR RLLENTANES FLRKSFFQGV SRELLLRNPM DVLAEERTAG PVPARDAPEY GDKGPFCNEP CFDWTIPEHR AGFRDALDRV RATFPIKVPL TIGGSRFDTP VRLRSVNPNR AEEMVGDVAG AGPLEADAAV EAAKAAFAAW RDTPPGERAE YLFKAAAAAR RIRYDLAALQ VYEVGKAWSE ADADVCEAID FLEYYGREMI RLSRPKRMGH APGEISHLFY EPRGVAAVIA PWNFPMAIST GMTSAALVTG NTVVYKPASQ SPVVGSMVMN VFEEAGLPKG VLSFLPGPGA QIGDYLVHHP DVAVIAFTGS KKVGLDIIAQ ANRDAERAGH VKTVVAEMGG KNAIVVDADA DLDEALAQIV HSAFGYQGQK CSACSRLIVL EEIYDKLVER LKAAAESIHL GPPEDPKNLM GAVIEAGARK RIMEYIELGR KDGTVLVERT VPGNEGFFVP LTILADLPPD HRLAREEIFG PVLVVFKVKD FARAIEIAND TEYALTGGVF SRSPANIDLA RREFRTGNLY INRGCTGAVV ERHPFGGFKL SGIGSKAGGP DYLLQFMVPR NVVENTLRRG FAPADE
|
| |