Gene EcSMS35_3410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3410 
SymbolpflB1 
ID6144961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3490352 
End bp3492646 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content53% 
IMG OID641618239 
Productformate acetyltransferase 
Protein accessionYP_001745388 
Protein GI170681495 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01255] formate acetyltransferase 1 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTAG ATATTGATAC CAGCGATAAG CTGTACGCCG ACGCATGGCT TGGCTTTAAA 
GGTACGGACT GGAAAAACGA AATTAATGTC CGCGATTTTA TTCAACATAA CTATACACCG
TATGAAGGTG ATGAATCTTT TCTCGCCGAA GCAACGCCAG CCACCACGGA ATTGTGGGAA
AAAGTGATGG AAGGCATCCG TATCGAAAAC GCAACCCACG CGCCGGTTGA TTTCGATACT
AATATTGCTA CCACCATTAC CGCTCATGAT GCGGGATATA TTAACCAGCC GCTGGAAAAA
ATAGTGGGCC TGCAAACGGA TGCGCCATTA AAGCGTGCGC TGCATCCGTT CGGTGGCATT
AATATGATTA AAAGTTCATT CCACGCCTAT GGTCGTGAAA TGGACAGTGA ATTTGAATAT
CTGTTTACCG ATCTGCGTAA AACCCATAAC CAGGGCGTAT TTGATGTTTA CTCACCGGAT
ATGCTGCGCT GCCGTAAATC TGGCGTGCTG ACCGGTTTAC CAGATGGCTA TGGTCGTGGG
CGCATTATCG GTGACTATCG CCGCGTGGCG CTATATGGCA TCAGTTATCT GGTGCGTGAA
CGCGAACTGC AATTTGCCGA TCTCCAGTCC CGTCTGGAAA AAGGCGAGGA TCTGGAAGCC
ACCATCCGTC TGCGTGAGGA GCTGGCGGAA CATCGTCGTG CGCTGTTGCA GATTCAGGAA
ATGGCGGCGA AATACGGTTT TGATATCTCC CGTCCGGCGC AGAATGCGCA GGAAGCGGTG
CAGTGGCTCT ACTTCGCTTA CCTGGCGGCA GTGAAATCAC AGAACGGCGG CGCAATGTCT
CTTGGCCGTA CGGCATCGTT CCTCGATATC TACATTGAGC GCGACTTTAA AGCTGGCGTA
CTCAATGAAC AGCAGGCACA GGAACTGATC GACCACTTCA TTATGAAGAT CCGTATGGTG
CGCTTCCTGC GTACGCCGGA ATTTGATTCG CTGTTCTCCG GCGACCCAAT CTGGGCGACG
GAAGTGATCG GCGGGATGGG GCTGGACGGT CGTACGCTGG TGACCAAAAA CTCTTTCCGT
TATCTGCACA CCCTGCACAC TATGGGGCCA GCACCGGAAC CAAACCTGAC CATTCTGTGG
TCGGAAGAAT TACCGATCGC CTTCAAAAAA TATGCCGCAC AGGTGTCTAT CGTCACCTCT
TCCTTGCAGT ATGAAAACGA CGATCTGATG CGTACTGACT TCAACAGCGA CGATTACGCG
ATTGCCTGCT GCGTTAGCCC GATGGTGATT GGTAAGCAAA TGCAGTTCTT TGGTGCACGC
GCTAACCTGG CGAAAACGCT GCTCTACGCA ATTAACGGCG GGGTGGATGA GAAGCTGAAG
ATTCAGGTCG GGCCGAAAAC AGCACCATTG ATGGACGATG TGCTGGACTA CGACAAAGTG
ATGGACAGCC TCGATCATTT CATGGATTGG CTGGCAGTGC AGTACATCAG CGCGCTGAAT
ATCATCCACT ACATGCACGA CAAGTACAGC TATGAAGCCT CGCTGATGGC GCTGCACGAT
CGTGATGTTT ATCGCACGAT GGCATGCGGC ATCGCGGGCC TGTCGGTGGC GACGGACTCC
CTGTCTGCCA TCAAATATGC CCGCGTGAAA CCAATCCGTG ACGAAAACGG CCTGGCGGTG
GACTTTGAAA TCGACGGCGA ATATCCGCAG TACGGTAACA ACGACGAGCG CGTAGACAGC
ATTGCCTGCG ACCTGGTTGA ACGCTTTATG AAGAAAATTA AAGCGCTGCC AACCTATCGC
AACGCCGTCC CTACCCAGTC GATTCTGACT ATCACTTCTA ACGTGGTGTA CGGCCAGAAA
ACCGGTAACA CGCCGGACGG TCGTCGCGCC GGAACACCGT TCGCACCGGG CGCTAACCCG
ATGCACGGTC GTGACCGCAA AGGTGCGGTA GCCTCGCTGA CGTCGGTGGC GAAACTGCCG
TTCACCTACG CCAAAGACGG TATTTCGTAC ACCTTCTCAA TTGTCCCGGC GGCGCTGGGC
AAAGAAGATC CAGTACGTAA AACCAACCTT GTCGGCCTGC TGGATGGGTA TTTCCACCAC
GAAGCGGATG TCGAAGGCGG TCAACACCTC AACGTCAACG TAATGAATCG GGAAATGCTG
CTGGATGCCA TCGAGCACCC GGAAAAATAT CCTAACCTGA CAATCCGTGT CTCTGGCTAC
GCCGTGCGCT TCAACGCACT GACCCGTGAA CAGCAGCAGG ATGTTATTTC ACGTACCTTT
ACCCAGGCGC TCTGA
 
Protein sequence
MKVDIDTSDK LYADAWLGFK GTDWKNEINV RDFIQHNYTP YEGDESFLAE ATPATTELWE 
KVMEGIRIEN ATHAPVDFDT NIATTITAHD AGYINQPLEK IVGLQTDAPL KRALHPFGGI
NMIKSSFHAY GREMDSEFEY LFTDLRKTHN QGVFDVYSPD MLRCRKSGVL TGLPDGYGRG
RIIGDYRRVA LYGISYLVRE RELQFADLQS RLEKGEDLEA TIRLREELAE HRRALLQIQE
MAAKYGFDIS RPAQNAQEAV QWLYFAYLAA VKSQNGGAMS LGRTASFLDI YIERDFKAGV
LNEQQAQELI DHFIMKIRMV RFLRTPEFDS LFSGDPIWAT EVIGGMGLDG RTLVTKNSFR
YLHTLHTMGP APEPNLTILW SEELPIAFKK YAAQVSIVTS SLQYENDDLM RTDFNSDDYA
IACCVSPMVI GKQMQFFGAR ANLAKTLLYA INGGVDEKLK IQVGPKTAPL MDDVLDYDKV
MDSLDHFMDW LAVQYISALN IIHYMHDKYS YEASLMALHD RDVYRTMACG IAGLSVATDS
LSAIKYARVK PIRDENGLAV DFEIDGEYPQ YGNNDERVDS IACDLVERFM KKIKALPTYR
NAVPTQSILT ITSNVVYGQK TGNTPDGRRA GTPFAPGANP MHGRDRKGAV ASLTSVAKLP
FTYAKDGISY TFSIVPAALG KEDPVRKTNL VGLLDGYFHH EADVEGGQHL NVNVMNREML
LDAIEHPEKY PNLTIRVSGY AVRFNALTRE QQQDVISRTF TQAL