Gene ECD_03836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03836 
SymbolpflD 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4052542 
End bp4054839 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content53% 
IMG OID 
Productpredicted formate acetyltransferase 2 (pyruvate formate lyase II) 
Protein accessionACT45629 
Protein GI253979959 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAATC GTATCTCTCG CCTCAAAACT GCACTGTTTG CCAATACCCG TGAAATCTCG 
CTGGAGCGGG CGCTGCTTTA TACCGCCAGC CATCGGCAAA CCGAAGGCGA ACCGGTGATA
TTGCGCCGGG CGAAAGCAAC AGCGTATATC CTTGAACATG TTGAAATTTC GATTCGTGAT
GAAGAACTGA TTGCCGGTAA CCGCACCGTA AAACCGCGCG CCGGGATTAT GTCGCCGGAA
ATGGACCCTT ACTGGCTGCT GAAAGAGCTG GATCAATTCC CGACGCGTCC GCAGGACCGC
TTTGCTATCA GCGAAGAAGA TAAACGTATC TACCGCGAAG AGTTGTTCCC GTACTGGGAA
AAACGTTCGA TGAAAGATTT CATCAACGGG CAGATGACAG ATGAAGTAAA AGCCGCGACC
AGCACGCAGA TTTTCAGCAT CAACCAGACA GATAAAGGCC AGGGGCACAT TATTATTGAT
TACCCACGCC TGCTGAATCA CGGGCTGGGG GAGCTGGTAG CACAGATGCA GCAACATTGT
CAGCAACAGC CGGAGAATCA CTTTTATCAG GCAGCGCTGT TACTGCTGGA AGCCTCGCAG
AAACACATTT TGCGTTACGC CGAACTGGCG GAAACGATGG CGGCAAACTG CACAGATGCC
CAGCGTCGCG AAGAGCTGCT GACTATTGCG GAGATCTCCC GCCATAACGC GCAACATAAG
CCGCAGACGT TCTGGCAGGC GTGCCAGTTA TTCTGGTACA TGAACATCAT TCTGCAATAC
GAATCCAACG CCAGTTCGCT ATCGTTGGGG CGCTTCGACC AGTATATGTT GCCGTTCTAT
CAGACATCAT TAACCCAGGG CGAAGATGCG GCGTTCCTGA AAGAACTGCT CGAATCTTTA
TGGGTGAAAT GCAACGACAT CGTGCTGTTG CGCTCCACCA GCAGCGCGCG TTATTTCGCC
GGTTTCCCGA CCGGCTATAC CGCACTGCTC GGCGGGTTAA CCGAGAACGG ACGTAGCGCG
GTGAACGTGC TTTCGTTCCT TTGCCTTGAC GCCTATCAAA GCGTGCAATT ACCGCAACCG
AACCTCGGCG TGCGCACTAA CGCCTTGATC GACACGCCGT TCCTGATGAA AACCGCCGAA
ACCATTCGCC TCGGCACCGG TATTCCGCAA ATCTTTAACG ATGAAGTGGT GGTGCCAGCG
TTCCTCAACC GTGGCGTTTC GCTGGAAGAT GCGCGCGACT ATTCCGTAGT GGGCTGTGTG
GAATTATCTA TTCCCGGCAG AACCTACGGC TTGCATGACA TCGCGATGTT TAACCTGCTG
AAAGTGATGG AAATCTGCCT GCATGAAAAT GAAGGCAATG CCGCGCTGAC TTATGAAGGT
TTACTGGAAC AGATCCGTGC CAAGATCAGC CACTACATCA CCCTGATGGT TGAAGGCAGT
AATATTTGCG ATATCGGCCA TCGCGACTGG GCACCTGTAC CGCTGCTCTC GTCTTTTATC
AGCGATTGTC TGGAAAAAGG CCGCGATATT ACCGATGGCG GCGCGCGTTA TAACTTCTCC
GGCGTACAGG GGATCGGTAT CGCCAACCTG AGCGATTCTC TCCATGCGTT GAAAGGGATG
GTTTTTGATC AACAGCGTTT AAGTTTTGAC GAATTGCTGT CGGTATTAAA AGCCAACTTT
GCAACGCCAG AAGGCGAAAA AGTCCGCGCT CGCTTAATTA ACCGCTTTGA GAAATACGGT
AACGATATCG ACGAGGTGGA TAACATTAGC GCCGAACTGT TGCGCCACTA CTGCAAAGAA
GTGGAAAAAT ACCAGAACCC GCGCGGCGGC TACTTCACGC CGGGATCGTA TACCGTTTCT
GCTCACGTTC CGTTGGGATC GGTGGTTGGC GCGACGCCAG ACGGTCGTTT TGCCGGAGAA
CAGCTGGCAG ACGGCGGCTT GTCACCTATG CTGGGTCAGG ACGCACAAGG GCCAACGGCG
GTACTGAAGT CAGTCAGTAA GCTCGATAAC ACACTGCTGT CTAACGGTAC ATTGCTGAAC
GTGAAATTCA CTCCGGCGAC CCTGGAAGGT GAAGCGGGAT TACGCAAACT GGCCGACTTC
TTACGGGCGT TTACCCAGCT TAAGTTACAA CATATTCAGT TTAACGTGGT GAACGCCGAC
ACGTTGCGGG AAGCGCAACA GCGCCCACAA GATTATGCCG GGCTGGTGGT GCGCGTTGCC
GGATACAGCG CCTTCTTTGT CGAACTGTCG AAGGAGATCC AGGATGACAT CATCCGCCGG
ACAGCGCATC AGCTGTAA
 
Protein sequence
MTNRISRLKT ALFANTREIS LERALLYTAS HRQTEGEPVI LRRAKATAYI LEHVEISIRD 
EELIAGNRTV KPRAGIMSPE MDPYWLLKEL DQFPTRPQDR FAISEEDKRI YREELFPYWE
KRSMKDFING QMTDEVKAAT STQIFSINQT DKGQGHIIID YPRLLNHGLG ELVAQMQQHC
QQQPENHFYQ AALLLLEASQ KHILRYAELA ETMAANCTDA QRREELLTIA EISRHNAQHK
PQTFWQACQL FWYMNIILQY ESNASSLSLG RFDQYMLPFY QTSLTQGEDA AFLKELLESL
WVKCNDIVLL RSTSSARYFA GFPTGYTALL GGLTENGRSA VNVLSFLCLD AYQSVQLPQP
NLGVRTNALI DTPFLMKTAE TIRLGTGIPQ IFNDEVVVPA FLNRGVSLED ARDYSVVGCV
ELSIPGRTYG LHDIAMFNLL KVMEICLHEN EGNAALTYEG LLEQIRAKIS HYITLMVEGS
NICDIGHRDW APVPLLSSFI SDCLEKGRDI TDGGARYNFS GVQGIGIANL SDSLHALKGM
VFDQQRLSFD ELLSVLKANF ATPEGEKVRA RLINRFEKYG NDIDEVDNIS AELLRHYCKE
VEKYQNPRGG YFTPGSYTVS AHVPLGSVVG ATPDGRFAGE QLADGGLSPM LGQDAQGPTA
VLKSVSKLDN TLLSNGTLLN VKFTPATLEG EAGLRKLADF LRAFTQLKLQ HIQFNVVNAD
TLREAQQRPQ DYAGLVVRVA GYSAFFVELS KEIQDDIIRR TAHQL