Gene Sfum_3724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3724 
Symbol 
ID4457976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4540264 
End bp4542162 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content61% 
IMG OID639704497 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_847829 
Protein GI116751142 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase
[TIGR01362] 3-deoxy-8-phosphooctulonate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTCTTA TCCTGAAAAA GAAGGCAACC GCTGAACAGA TGGAAAGGCT CAAGGATGTC 
CTGCGCTCCG AGGGATACCT GGTGAAGGAG ATCGCCGGCG TGGACGAGAA GATCCTGGGC
GTCGTGGGCA CGATGTACAA GGAGACGGCC TTCTATGAAT CCCTGCCCGG GGTGGAGCGG
GCGGTCCCCA TCTCCAAGCC GTACAAGCTG GTGAGCCGCG AGCTCCATCC CGCCCCGTCG
GTCATCAAGG TTGGCGACGT CACCATCGGC GGAGACCGGC TGGTGGTGAT CGCCGGTCCC
TGCGGTGTCG AGGACCGGAA GAGGACCCTG GATATCGCGC GCACGGTTCG CAAACACGGA
GCGGTCCTGT TCCGGGGCGG TGCGTTCAAG CCCCGCACCT CGCCCTACTC GTTCCAGGGC
CTGGGTGAGG AGGGCTTGAA GATTCTGCGG GAAGTGAGAG AGGAAACCGG CCTCGGAGTG
GTCACGGAGA TCACCTCTCC CAGCCAGGCG GACCTCATGG TGAAGTACGT GGACGTCGTC
CAGGTCGGCG CCCGCAACAT GCAGAACTTC GAGCTCCTGA AGTCGGTCGG CCGAATCGGC
AAGCCGGTGC TCCTCAAGCG CGGGCTGTCG GCGACCATCG AGGAATGGCT CATGTCGGCC
GAGTACGTGC TTTCCGAAGG AAACGACCAG GTCATTCTGT GCGAGCGGGG CATCCGGACG
TTCGAGCGCT ACACGCGAAA CACCCTGGAC CTCACGGCCG TTCCGGTCAT CAAGAAACTC
ACCCACCTCC CGATCATCGT CGATCCGAGC CACGCCACGG GGATCCGGGA AAAGGTCAGC
CCCATGGCCC GCGCGTCCAT CGCGGCGGGA GCCGACGGGC TGATCATCGA GGTTCACACG
GAACCCGACA AGGCACTCTC CGATGGTCCC CAGAGCCTCT ATCCCGAACA GTTCGAGCAG
CTCATGCGCG ACCTCTACGT CATCGCCCCG GTGGTGGGAA AGCAGGTCGA CTACGCCTAC
CTCGACAAGG CGGCCATCAT GAAGCCTCGC AAGGGCAAAG GCAAGGCGGC CCCGATGGTC
GTCTACAGCG GCGTCCCGGG TTCCTTTTCG CACAAGGCGT GTCTGCAGTT CTTCGGAACG
GAGGTCCCGA TCCGGGAATG CACATGCTTC AGGGAGGTTT TCGACTCCGT GGCCGGCGAA
CAGGCCGCCT TCGGCGTCAT CCCCGTGGAG AACAGCCTTA CCGGGAGCAT TCACGAAAAC
TACGACCTGC TCCTCGAATA CGCTATCATG ATCGTCGGGG AACTGACCCT GCGCATCAAG
CACAACCTTC TGGGACACCT GGACTCCTCC ATCGAGGGAA TCGAACGCGT ATACTCGCAT
CCCCAGGTAT TCCAGCAGTG CCGCGAGTAC CTGGACAAGC ATCCCGCGTG GGATCAGATT
GCGTGCAAGG ACACGGCCAG CGCCGTGCGC AAGGTGGAAG AAGCCGGCGA TGCGAAAGAA
GCCGCCATCG CCGGAGTCGG TGCGGTTCAA ACCCGGCGGA TGACGGTGCT CAAGGAAAGC
ATCGAAACCA ATCCCCGGAA TTTCACCCGG TTTGTGGTCA TTTCGAAAAA CGAGTCGCTG
CCCGGGCCCA AGAACAAGTC CTCCCTGATC TATTCGGTAA GCGACAAGCC GGGCGCTCTC
TTCGAGACGC TGCGCATCTT CGCGGAGAAC AATATCAACC TGGTCAAGCT GGAATCCCGG
CCCATCCACA GCAGGCCCTG GGAGTACCTG TTCTATGCGG ATCTCGAGGT CGACGTCACG
GAAGACGGCC GCAGGCACAT CCTCGAAGGG CTCATGAGCA AAACCGAATT CTTCAAGTTT
CTCGGCAGCT ACCAGAAGGG GACCGAAGTG AGTCATTAG
 
Protein sequence
MILILKKKAT AEQMERLKDV LRSEGYLVKE IAGVDEKILG VVGTMYKETA FYESLPGVER 
AVPISKPYKL VSRELHPAPS VIKVGDVTIG GDRLVVIAGP CGVEDRKRTL DIARTVRKHG
AVLFRGGAFK PRTSPYSFQG LGEEGLKILR EVREETGLGV VTEITSPSQA DLMVKYVDVV
QVGARNMQNF ELLKSVGRIG KPVLLKRGLS ATIEEWLMSA EYVLSEGNDQ VILCERGIRT
FERYTRNTLD LTAVPVIKKL THLPIIVDPS HATGIREKVS PMARASIAAG ADGLIIEVHT
EPDKALSDGP QSLYPEQFEQ LMRDLYVIAP VVGKQVDYAY LDKAAIMKPR KGKGKAAPMV
VYSGVPGSFS HKACLQFFGT EVPIRECTCF REVFDSVAGE QAAFGVIPVE NSLTGSIHEN
YDLLLEYAIM IVGELTLRIK HNLLGHLDSS IEGIERVYSH PQVFQQCREY LDKHPAWDQI
ACKDTASAVR KVEEAGDAKE AAIAGVGAVQ TRRMTVLKES IETNPRNFTR FVVISKNESL
PGPKNKSSLI YSVSDKPGAL FETLRIFAEN NINLVKLESR PIHSRPWEYL FYADLEVDVT
EDGRRHILEG LMSKTEFFKF LGSYQKGTEV SH