Gene ECH74115_5411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5411 
SymbolpflD 
ID6971655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5053990 
End bp5056287 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content53% 
IMG OID643389064 
Productputative formate acetyltransferase 2 
Protein accessionYP_002273473 
Protein GI209398325 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01774] pyruvate formate-lyase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATC GTATCTCTCG CCTCAAAACT GCACTGTTTG CCAATACCCG TGAAATCTCG 
CTGGAGCGTG CGCTGCTTTA TACCGCCAGC CATCGGCAAA CCGAAGGCGA ACCGGTGATA
TTGCGCCGGG CGAAAGCAAC AGCGTATATC CTTGAACATG TTGAAATTTC GATTCGTGAT
GAAGAACTGA TTGCCGGTAA CCGCACCGTA AAACCGCGCG CCGGGATTAT GTCGCCGGAA
ATGGACCCTT ACTGGCTGCT GAAAGAGCTG GGTCAATTCC CGACGCGTCC GCAGGACCGT
TTTGCTATCA GCGAAGAAGA TAAACGTATC TACCGCGAAG AGTTGTTCCC GTACTGGGAA
AAACGTTCGA TGAAAGATTT CATCAACGGG CAGATGACGG ATGAAGTAAA AGCCGCGACT
AGCACGCAGA TTTTCAGCAT CAACCAGACG GATAAAGGCC AGGGGCACAT TATTATTGAT
TACCCGCGCC TGCTGAATCA CGGGCTGGGG GAGCTGGTGG CACAGATGCA GCAACATTGT
CAGCAACAGC CGGAGAATCA CTTTTATCAG GCAGCGCTGT TACTGCTCGA AGCCTCGCAG
AAACACATTT TGCGTTACGC CGAACTGGCG GAAACGATGG CGGCAAGCTG CACTGATGCC
CAGCGTCGCG AAGAACTGTT GACTATTGCG GAGATCTCCC GCCATAACGC CGAACATAAA
CCGCAGACGT TCTGGCAGGC GTGCCAGTTA TTCTGGTACA TGAACATCAT TCTGCAATAC
GAATCCAACG CCAGTTCGCT GTCTTTAGGG CGCTTCGACC AGTATATGTT GCCGTTCTAT
CAGGCATCTT TAACCCAGGG CGAAGATCCG GCGTTCCTGA AAGAACTGCT CGAATCTTTA
TGGGTGAAAT GCAACGACAT TGTGCTGTTG CGCTCCACCA GCAGCGCGCG TTATTTCGCA
GGTTTCCCGA CTGGCTATAC CGCACTGCTC GGCGGGTTAA CCGAGAACGG ACGTAGCGCG
GTGAACGTGC TTTCGTTCCT CTGCCTCGAT GCCTATCAAA GCGTACAATT ACCGCAACCG
AACCTAGGGG TGCGCACTAA CGCCTTGATC GACACGCCGT TCCTGATGAA AACCGCAGAA
ACCATTCGCC TCGGCACCGG TATTCCGCAA ATCTTTAACG ATGAAGTGGT GGTGCCAGCG
TTCCTCAACC GTGGCGTTTC GCTGGAAGAT GCGCGCGACT ATTCCGTAGT GGGCTGTGTG
GAATTATCTA TTCCCGGCAG AACCTACGGC TTGCATGACA TCGCGATGTT TAATCTGCTG
AAAGTGATGG AAATCTGCCT GCATGAAAAT GAAGGCAATG CTGCGCTGAC TTATGAAGGT
TTACTGGAGC AGATCCGCGC CAAGATCAGC CACTACATCA CCCTGATGGT TGAGGGCAGC
AATATTTGTG ATATCGGCCA TCGCGACTGG GCACCTGTAC CGCTGCTCTC GTCTTTTATC
AGCGATTGTC TGGAAAAAGG TCGCGATATT ACCGATGGCG GCGCGCGTTA TAACTTCTCC
GGCGTACAGG GAATTGGTAT CGCCAACCTG AGCGATTCTC TCCATGCGTT GAAAGGGATG
GTTTTTGATC AACAGCGTTT AAGTTTTGAC GAATTGCTGT CGGTATTAAA AGCCAACTTT
GCAACGCCAG AAGGCGAAAA AGTCCGCGCT CGCTTAATTA ACCGCTTTGA GAAATACGGT
AACGATATCG ACGAGGTGGG TAACATTAGC GCCGAACTGT TGCGCCACTA CTGCAAAGAA
GTGGAAAAAT ACCAGAACCC GCGCGGCGGT TACTTCACGC CGGGATCGTA TACCGTTTCT
GCTCACGTCC CGTTGGGATC GGTGGTTGGC GCGACGCCAG ACGGTCGTTT TGCCGGAGAA
CAGTTGGCAG ACGGCGGCTT GTCACCCATG CTGGGCCAGG ACGCACAAGG GCCAACAGCG
GTACTGAAGT CAGTCAGTAA GCTCGATAAC ACGCTGCTGT CTAACGGTAC GTTGCTGAAC
GTGAAATTCA CTCCGGCGAC CCTGGAAGGT GAAGCGGGAT TACGCAAACT GGCCGACTTC
TTACGGGCGT TTACCCAGCT TAAGTTGCAG CATATTCAGT TTAACGTGGT GAACGCCGAC
ACGTTGCGGG AAGCGCAACA GCGCCCACAA GATTATGCCG GGCTGGTGGT GCGCGTTGCC
GGATACAGCG CCTTCTTTGT CGAACTGTCG AAGGAGATCC AGGATGACAT CATCCGCCGG
ACAGCGCATC AGCTGTAA
 
Protein sequence
MTNRISRLKT ALFANTREIS LERALLYTAS HRQTEGEPVI LRRAKATAYI LEHVEISIRD 
EELIAGNRTV KPRAGIMSPE MDPYWLLKEL GQFPTRPQDR FAISEEDKRI YREELFPYWE
KRSMKDFING QMTDEVKAAT STQIFSINQT DKGQGHIIID YPRLLNHGLG ELVAQMQQHC
QQQPENHFYQ AALLLLEASQ KHILRYAELA ETMAASCTDA QRREELLTIA EISRHNAEHK
PQTFWQACQL FWYMNIILQY ESNASSLSLG RFDQYMLPFY QASLTQGEDP AFLKELLESL
WVKCNDIVLL RSTSSARYFA GFPTGYTALL GGLTENGRSA VNVLSFLCLD AYQSVQLPQP
NLGVRTNALI DTPFLMKTAE TIRLGTGIPQ IFNDEVVVPA FLNRGVSLED ARDYSVVGCV
ELSIPGRTYG LHDIAMFNLL KVMEICLHEN EGNAALTYEG LLEQIRAKIS HYITLMVEGS
NICDIGHRDW APVPLLSSFI SDCLEKGRDI TDGGARYNFS GVQGIGIANL SDSLHALKGM
VFDQQRLSFD ELLSVLKANF ATPEGEKVRA RLINRFEKYG NDIDEVGNIS AELLRHYCKE
VEKYQNPRGG YFTPGSYTVS AHVPLGSVVG ATPDGRFAGE QLADGGLSPM LGQDAQGPTA
VLKSVSKLDN TLLSNGTLLN VKFTPATLEG EAGLRKLADF LRAFTQLKLQ HIQFNVVNAD
TLREAQQRPQ DYAGLVVRVA GYSAFFVELS KEIQDDIIRR TAHQL