Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4398 |
Symbol | pflD |
ID | 6145430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4490179 |
End bp | 4492476 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619219 |
Product | putative formate acetyltransferase 2 |
Protein accession | YP_001746343 |
Protein GI | 170680527 |
COG category | [C] Energy production and conversion |
COG ID | [COG1882] Pyruvate-formate lyase |
TIGRFAM ID | [TIGR01774] pyruvate formate-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.514532 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAATC GTATCTCTCG CCTCAAAACT GCACTGTTTG CCAATACCCG TGAAATCTCG CTGGAGCGGG CGCTGCTTTA TACCGCCAGC CATCGGCAAA CCGAAGGCGA ACCGGTGATA TTGCGCCGGG CGAAAGCAAC AGCGTATATC CTTGAACACG TTGAAATTTC GATTCGTGAT GAAGAACTGA TTGCCGGTAA CCGCACCGTA AAACCACGCG CCGGGATTAT GTCGCCGGAA ATGGACCCTT ACTGGCTGCT GAAAGAGCTG GATCAATTCC CGACGCGTCC GCAGGACCGC TTTGCTATCA GCGAAGAAGA TAAACGTATC TACCGCGAAG AGTTGTTCCC GTACTGGGAA AAACGTTCGA TGAAAGATTT CATTAACGGG CAGATGACGG ATGAAGTAAA AGCCGCGACC AGCACGCAGA TTTTCAGCAT CAACCAGACG GATAAAGGCC AGGGGCACAT TATTATTGAT TACCCGCGCC TGCTGAATCA CGGGCTGGGG GAGCTGGTGG CACAGATGCA GCAACATTGT CAGCAACAGC CGGAGAATCA CTTTTATCAG GCAGCGCTGT TACTGCTGGA AGCCTCGCAG AAACATATTT TGCGTTACGC CGAACTGGCG GAAACGATGG CGGCAAGCTG CACTGATGGC CAGCGTCGCG AAGAACTGCT GACTATTGCG GAGATCTCCC GCCATAACGC CGAACATAAG CCGCAGACGT TCTGGCAGGC GTGCCAGTTA TTCTGGTACA TGAACATCAT TCTGCAATAC GAATCCAACG CCAGTTCGCT GTCTTTAGGG CGCTTCGACC AGTATATGTT GCCGTTCTAT CAGGCATCAT TAACCCAGGG CGAAGATCCG GCGTTCCTGA AAGAACTGCT CGAATCTTTA TGGGTGAAAT GCAACGACAT CGTGCTGTTG CGCTCCACCA GTAGCGCGCG TTATTTCGCC GGTTTCCCGA CCGGCTATAC CGCACTGCTC GGCGGGTTAA CCGAGAACGG ACGTAGCGCG GTGAACGTGC TTTCGTTCCT TTGCCTTGAC GCCTATCAAA GCGTGCAATT ACCGCAACCG AACCTGGGGG TGCGCACTAA CGCCTTGATC GACACGCCGT TCCTGATGAA AACCGCCGAA ACCATTCGCC TCGGCACCGG TATTCCGCAA ATCTTTAACG ATGAAGTAGT GGTGCCAGCG TTCCTCAACC GTGGCGTTTC GCTGGAAGAT GCACGCGACT ATTCCGTAGT GGGCTGTGTG GAATTATCTA TTCCCGGCAG AACCTACGGC TTGCATGACA TCGCGATGTT TAATCTGCTG AAAGTGATGG AAATCTGCCT GCATGAAAAT GAAGGCAATG CTGCGCTGAC TTATGAAGGT TTACTGGAGC AGATCCGCGC CAAGATCAGC CACTACATCA CCCTGATGGT TGAGGGCAGC AATATTTGCG ATATCGGTCA CCGTGACTGG GCACCTGTAC CGCTGCTCTC ATCGTTTATC AGCGATTGTC TGGAAAAAGG CCGCGATATT ACCGATGGCG GTGCGCGTTA TAACTTCTCC GGCGTACAGG GGATCGGTAT CGCCAACCTG AGCGATTCTC TCCATGCGTT GAAAGGTATG GTTTTTGAGC AACAGCGTTT AAGTTTTGAC GAATTGCTGT CGGTATTAAA AGCCAACTTC GCAACGCCAG AAGGCGAAAA AGTCCGTGCT CGCTTAATTA ACCGCTTCGA GAAATACGGT AACGATATCG ACGAGGTGGA TAACATCAGC GCCGAACTGT TACGCCACTA CTGCAAAGAA GTGGAAAAAT ACCAGAACCC GCGCGGCGGC TACTTCACGC CGGGATCGTA TACCGTTTCT GCTCACGTCC CGTTGGGATC GGTGGTTGGC GCGACGCCAG ACGGTCGTTT TGCCGGAGAA CAGCTGGCAG ACGGCGGCTT GTCACCCATG CTGGGCCAGG ACGCACAAGG GCCAACGGCG GTACTGAAGT CAGTCAGTAA GCTCGATAAT ACGCTGCTGT CTAACGGTAC GTTGCTGAAC GTGAAATTCA CTCCGGCGAC CCTGGAAGGT GAAGCGGGAT TACGCAAACT GGCCGACTTC TTACGGGCGT TTACCCAGCT TAAGTTGCAG CATATTCAGT TTAACGTGGT GAACGCCGAC ACGTTGCGGG AAGCGCAACA GCGCCCACAA GATTATGCCG GGCTGGTGGT GCGCGTTGCC GGATACAGCG CCTTCTTTGT CGAATTGTCG AAGGAGATCC AGGATGACAT CATCCGCCGG ACAGCGCATC AGCTGTAA
|
Protein sequence | MTNRISRLKT ALFANTREIS LERALLYTAS HRQTEGEPVI LRRAKATAYI LEHVEISIRD EELIAGNRTV KPRAGIMSPE MDPYWLLKEL DQFPTRPQDR FAISEEDKRI YREELFPYWE KRSMKDFING QMTDEVKAAT STQIFSINQT DKGQGHIIID YPRLLNHGLG ELVAQMQQHC QQQPENHFYQ AALLLLEASQ KHILRYAELA ETMAASCTDG QRREELLTIA EISRHNAEHK PQTFWQACQL FWYMNIILQY ESNASSLSLG RFDQYMLPFY QASLTQGEDP AFLKELLESL WVKCNDIVLL RSTSSARYFA GFPTGYTALL GGLTENGRSA VNVLSFLCLD AYQSVQLPQP NLGVRTNALI DTPFLMKTAE TIRLGTGIPQ IFNDEVVVPA FLNRGVSLED ARDYSVVGCV ELSIPGRTYG LHDIAMFNLL KVMEICLHEN EGNAALTYEG LLEQIRAKIS HYITLMVEGS NICDIGHRDW APVPLLSSFI SDCLEKGRDI TDGGARYNFS GVQGIGIANL SDSLHALKGM VFEQQRLSFD ELLSVLKANF ATPEGEKVRA RLINRFEKYG NDIDEVDNIS AELLRHYCKE VEKYQNPRGG YFTPGSYTVS AHVPLGSVVG ATPDGRFAGE QLADGGLSPM LGQDAQGPTA VLKSVSKLDN TLLSNGTLLN VKFTPATLEG EAGLRKLADF LRAFTQLKLQ HIQFNVVNAD TLREAQQRPQ DYAGLVVRVA GYSAFFVELS KEIQDDIIRR TAHQL
|
| |