Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_3724 |
Symbol | |
ID | 4457976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | - |
Start bp | 4540264 |
End bp | 4542162 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639704497 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_847829 |
Protein GI | 116751142 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0077] Prephenate dehydratase [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase [TIGR01362] 3-deoxy-8-phosphooctulonate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATTCTTA TCCTGAAAAA GAAGGCAACC GCTGAACAGA TGGAAAGGCT CAAGGATGTC CTGCGCTCCG AGGGATACCT GGTGAAGGAG ATCGCCGGCG TGGACGAGAA GATCCTGGGC GTCGTGGGCA CGATGTACAA GGAGACGGCC TTCTATGAAT CCCTGCCCGG GGTGGAGCGG GCGGTCCCCA TCTCCAAGCC GTACAAGCTG GTGAGCCGCG AGCTCCATCC CGCCCCGTCG GTCATCAAGG TTGGCGACGT CACCATCGGC GGAGACCGGC TGGTGGTGAT CGCCGGTCCC TGCGGTGTCG AGGACCGGAA GAGGACCCTG GATATCGCGC GCACGGTTCG CAAACACGGA GCGGTCCTGT TCCGGGGCGG TGCGTTCAAG CCCCGCACCT CGCCCTACTC GTTCCAGGGC CTGGGTGAGG AGGGCTTGAA GATTCTGCGG GAAGTGAGAG AGGAAACCGG CCTCGGAGTG GTCACGGAGA TCACCTCTCC CAGCCAGGCG GACCTCATGG TGAAGTACGT GGACGTCGTC CAGGTCGGCG CCCGCAACAT GCAGAACTTC GAGCTCCTGA AGTCGGTCGG CCGAATCGGC AAGCCGGTGC TCCTCAAGCG CGGGCTGTCG GCGACCATCG AGGAATGGCT CATGTCGGCC GAGTACGTGC TTTCCGAAGG AAACGACCAG GTCATTCTGT GCGAGCGGGG CATCCGGACG TTCGAGCGCT ACACGCGAAA CACCCTGGAC CTCACGGCCG TTCCGGTCAT CAAGAAACTC ACCCACCTCC CGATCATCGT CGATCCGAGC CACGCCACGG GGATCCGGGA AAAGGTCAGC CCCATGGCCC GCGCGTCCAT CGCGGCGGGA GCCGACGGGC TGATCATCGA GGTTCACACG GAACCCGACA AGGCACTCTC CGATGGTCCC CAGAGCCTCT ATCCCGAACA GTTCGAGCAG CTCATGCGCG ACCTCTACGT CATCGCCCCG GTGGTGGGAA AGCAGGTCGA CTACGCCTAC CTCGACAAGG CGGCCATCAT GAAGCCTCGC AAGGGCAAAG GCAAGGCGGC CCCGATGGTC GTCTACAGCG GCGTCCCGGG TTCCTTTTCG CACAAGGCGT GTCTGCAGTT CTTCGGAACG GAGGTCCCGA TCCGGGAATG CACATGCTTC AGGGAGGTTT TCGACTCCGT GGCCGGCGAA CAGGCCGCCT TCGGCGTCAT CCCCGTGGAG AACAGCCTTA CCGGGAGCAT TCACGAAAAC TACGACCTGC TCCTCGAATA CGCTATCATG ATCGTCGGGG AACTGACCCT GCGCATCAAG CACAACCTTC TGGGACACCT GGACTCCTCC ATCGAGGGAA TCGAACGCGT ATACTCGCAT CCCCAGGTAT TCCAGCAGTG CCGCGAGTAC CTGGACAAGC ATCCCGCGTG GGATCAGATT GCGTGCAAGG ACACGGCCAG CGCCGTGCGC AAGGTGGAAG AAGCCGGCGA TGCGAAAGAA GCCGCCATCG CCGGAGTCGG TGCGGTTCAA ACCCGGCGGA TGACGGTGCT CAAGGAAAGC ATCGAAACCA ATCCCCGGAA TTTCACCCGG TTTGTGGTCA TTTCGAAAAA CGAGTCGCTG CCCGGGCCCA AGAACAAGTC CTCCCTGATC TATTCGGTAA GCGACAAGCC GGGCGCTCTC TTCGAGACGC TGCGCATCTT CGCGGAGAAC AATATCAACC TGGTCAAGCT GGAATCCCGG CCCATCCACA GCAGGCCCTG GGAGTACCTG TTCTATGCGG ATCTCGAGGT CGACGTCACG GAAGACGGCC GCAGGCACAT CCTCGAAGGG CTCATGAGCA AAACCGAATT CTTCAAGTTT CTCGGCAGCT ACCAGAAGGG GACCGAAGTG AGTCATTAG
|
Protein sequence | MILILKKKAT AEQMERLKDV LRSEGYLVKE IAGVDEKILG VVGTMYKETA FYESLPGVER AVPISKPYKL VSRELHPAPS VIKVGDVTIG GDRLVVIAGP CGVEDRKRTL DIARTVRKHG AVLFRGGAFK PRTSPYSFQG LGEEGLKILR EVREETGLGV VTEITSPSQA DLMVKYVDVV QVGARNMQNF ELLKSVGRIG KPVLLKRGLS ATIEEWLMSA EYVLSEGNDQ VILCERGIRT FERYTRNTLD LTAVPVIKKL THLPIIVDPS HATGIREKVS PMARASIAAG ADGLIIEVHT EPDKALSDGP QSLYPEQFEQ LMRDLYVIAP VVGKQVDYAY LDKAAIMKPR KGKGKAAPMV VYSGVPGSFS HKACLQFFGT EVPIRECTCF REVFDSVAGE QAAFGVIPVE NSLTGSIHEN YDLLLEYAIM IVGELTLRIK HNLLGHLDSS IEGIERVYSH PQVFQQCREY LDKHPAWDQI ACKDTASAVR KVEEAGDAKE AAIAGVGAVQ TRRMTVLKES IETNPRNFTR FVVISKNESL PGPKNKSSLI YSVSDKPGAL FETLRIFAEN NINLVKLESR PIHSRPWEYL FYADLEVDVT EDGRRHILEG LMSKTEFFKF LGSYQKGTEV SH
|
| |