Gene EcSMS35_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4398 
SymbolpflD 
ID6145430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4490179 
End bp4492476 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content53% 
IMG OID641619219 
Productputative formate acetyltransferase 2 
Protein accessionYP_001746343 
Protein GI170680527 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01774] pyruvate formate-lyase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.514532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATC GTATCTCTCG CCTCAAAACT GCACTGTTTG CCAATACCCG TGAAATCTCG 
CTGGAGCGGG CGCTGCTTTA TACCGCCAGC CATCGGCAAA CCGAAGGCGA ACCGGTGATA
TTGCGCCGGG CGAAAGCAAC AGCGTATATC CTTGAACACG TTGAAATTTC GATTCGTGAT
GAAGAACTGA TTGCCGGTAA CCGCACCGTA AAACCACGCG CCGGGATTAT GTCGCCGGAA
ATGGACCCTT ACTGGCTGCT GAAAGAGCTG GATCAATTCC CGACGCGTCC GCAGGACCGC
TTTGCTATCA GCGAAGAAGA TAAACGTATC TACCGCGAAG AGTTGTTCCC GTACTGGGAA
AAACGTTCGA TGAAAGATTT CATTAACGGG CAGATGACGG ATGAAGTAAA AGCCGCGACC
AGCACGCAGA TTTTCAGCAT CAACCAGACG GATAAAGGCC AGGGGCACAT TATTATTGAT
TACCCGCGCC TGCTGAATCA CGGGCTGGGG GAGCTGGTGG CACAGATGCA GCAACATTGT
CAGCAACAGC CGGAGAATCA CTTTTATCAG GCAGCGCTGT TACTGCTGGA AGCCTCGCAG
AAACATATTT TGCGTTACGC CGAACTGGCG GAAACGATGG CGGCAAGCTG CACTGATGGC
CAGCGTCGCG AAGAACTGCT GACTATTGCG GAGATCTCCC GCCATAACGC CGAACATAAG
CCGCAGACGT TCTGGCAGGC GTGCCAGTTA TTCTGGTACA TGAACATCAT TCTGCAATAC
GAATCCAACG CCAGTTCGCT GTCTTTAGGG CGCTTCGACC AGTATATGTT GCCGTTCTAT
CAGGCATCAT TAACCCAGGG CGAAGATCCG GCGTTCCTGA AAGAACTGCT CGAATCTTTA
TGGGTGAAAT GCAACGACAT CGTGCTGTTG CGCTCCACCA GTAGCGCGCG TTATTTCGCC
GGTTTCCCGA CCGGCTATAC CGCACTGCTC GGCGGGTTAA CCGAGAACGG ACGTAGCGCG
GTGAACGTGC TTTCGTTCCT TTGCCTTGAC GCCTATCAAA GCGTGCAATT ACCGCAACCG
AACCTGGGGG TGCGCACTAA CGCCTTGATC GACACGCCGT TCCTGATGAA AACCGCCGAA
ACCATTCGCC TCGGCACCGG TATTCCGCAA ATCTTTAACG ATGAAGTAGT GGTGCCAGCG
TTCCTCAACC GTGGCGTTTC GCTGGAAGAT GCACGCGACT ATTCCGTAGT GGGCTGTGTG
GAATTATCTA TTCCCGGCAG AACCTACGGC TTGCATGACA TCGCGATGTT TAATCTGCTG
AAAGTGATGG AAATCTGCCT GCATGAAAAT GAAGGCAATG CTGCGCTGAC TTATGAAGGT
TTACTGGAGC AGATCCGCGC CAAGATCAGC CACTACATCA CCCTGATGGT TGAGGGCAGC
AATATTTGCG ATATCGGTCA CCGTGACTGG GCACCTGTAC CGCTGCTCTC ATCGTTTATC
AGCGATTGTC TGGAAAAAGG CCGCGATATT ACCGATGGCG GTGCGCGTTA TAACTTCTCC
GGCGTACAGG GGATCGGTAT CGCCAACCTG AGCGATTCTC TCCATGCGTT GAAAGGTATG
GTTTTTGAGC AACAGCGTTT AAGTTTTGAC GAATTGCTGT CGGTATTAAA AGCCAACTTC
GCAACGCCAG AAGGCGAAAA AGTCCGTGCT CGCTTAATTA ACCGCTTCGA GAAATACGGT
AACGATATCG ACGAGGTGGA TAACATCAGC GCCGAACTGT TACGCCACTA CTGCAAAGAA
GTGGAAAAAT ACCAGAACCC GCGCGGCGGC TACTTCACGC CGGGATCGTA TACCGTTTCT
GCTCACGTCC CGTTGGGATC GGTGGTTGGC GCGACGCCAG ACGGTCGTTT TGCCGGAGAA
CAGCTGGCAG ACGGCGGCTT GTCACCCATG CTGGGCCAGG ACGCACAAGG GCCAACGGCG
GTACTGAAGT CAGTCAGTAA GCTCGATAAT ACGCTGCTGT CTAACGGTAC GTTGCTGAAC
GTGAAATTCA CTCCGGCGAC CCTGGAAGGT GAAGCGGGAT TACGCAAACT GGCCGACTTC
TTACGGGCGT TTACCCAGCT TAAGTTGCAG CATATTCAGT TTAACGTGGT GAACGCCGAC
ACGTTGCGGG AAGCGCAACA GCGCCCACAA GATTATGCCG GGCTGGTGGT GCGCGTTGCC
GGATACAGCG CCTTCTTTGT CGAATTGTCG AAGGAGATCC AGGATGACAT CATCCGCCGG
ACAGCGCATC AGCTGTAA
 
Protein sequence
MTNRISRLKT ALFANTREIS LERALLYTAS HRQTEGEPVI LRRAKATAYI LEHVEISIRD 
EELIAGNRTV KPRAGIMSPE MDPYWLLKEL DQFPTRPQDR FAISEEDKRI YREELFPYWE
KRSMKDFING QMTDEVKAAT STQIFSINQT DKGQGHIIID YPRLLNHGLG ELVAQMQQHC
QQQPENHFYQ AALLLLEASQ KHILRYAELA ETMAASCTDG QRREELLTIA EISRHNAEHK
PQTFWQACQL FWYMNIILQY ESNASSLSLG RFDQYMLPFY QASLTQGEDP AFLKELLESL
WVKCNDIVLL RSTSSARYFA GFPTGYTALL GGLTENGRSA VNVLSFLCLD AYQSVQLPQP
NLGVRTNALI DTPFLMKTAE TIRLGTGIPQ IFNDEVVVPA FLNRGVSLED ARDYSVVGCV
ELSIPGRTYG LHDIAMFNLL KVMEICLHEN EGNAALTYEG LLEQIRAKIS HYITLMVEGS
NICDIGHRDW APVPLLSSFI SDCLEKGRDI TDGGARYNFS GVQGIGIANL SDSLHALKGM
VFEQQRLSFD ELLSVLKANF ATPEGEKVRA RLINRFEKYG NDIDEVDNIS AELLRHYCKE
VEKYQNPRGG YFTPGSYTVS AHVPLGSVVG ATPDGRFAGE QLADGGLSPM LGQDAQGPTA
VLKSVSKLDN TLLSNGTLLN VKFTPATLEG EAGLRKLADF LRAFTQLKLQ HIQFNVVNAD
TLREAQQRPQ DYAGLVVRVA GYSAFFVELS KEIQDDIIRR TAHQL