Gene EcSMS35_2218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2218 
SymbolpflB2 
ID6144248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2235140 
End bp2237422 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content51% 
IMG OID641617094 
Productformate acetyltransferase 
Protein accessionYP_001744268 
Protein GI170681780 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01255] formate acetyltransferase 1 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.673985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGC TTAATGAAAA GTTAGCCACA GCCTGGGAAG GTTTTACCAA AGGTGACTGG 
CAGAATGAAG TAAACGTCCG TGACTTCATT CAGAAAAACT ACACTCCGTA CGAGGGTGAC
GAGTCCTTCC TGGCTGGCGC TACTGAAGCG ACCACCACCC TGTGGGACAA AGTAATGGAA
GGCGTTAAAC TGGAAAACCG CACTCACGCG CCAGTTGACT TTGACACCGC TGTTGCTTCC
ACCATCACCT CTCACGACGC TGGCTACATC AACAAAGCGT TGGAAAAAGT TGTTGGTCTG
CAGACTGAAG CTCCGCTGAA ACGTGCTCTT ATCCCGTTCG GTGGTATCAA AATGATCGAA
GGTTCCTGCA AAGCGTACAA CCGCGAACTG GACCCGATGA TCAAAAAAAT CTTCACTGAA
TACCGTAAAA CTCACAACCA GGGCGTGTTC GACGTTTACA CTCCGGACAT CCTGCGTTGC
CGTAAATCCG GTGTTCTGAC CGGTCTGCCA GATGCTTATG GCCGTGGTCG TATCATCGGT
GACTACCGTC GCGTTGCGCT GTACGGTATC GACTACCTGA TGAAAGACAA ATACGCTCAG
TTCACCTCTC TGCAGGCTGA TCTGGAAAAC GGCGTAAACC TGGAACAGAC TATCCGTCTG
CGCGAAGAAA TCGCTGAACA GCACCGCGCT CTGGGTCAGA TGAAAGAAAT GGCTGCGAAA
TACGGCTACG ACATCTCTGG TCCGGCTACC AACGCTCAGG AAGCTATCCA GTGGACTTAC
TTCGGCTACC TGGCTGCTGT TAAGTCTCAG AACGGTGCTG CAATGTCCTT CGGTCGTACC
TCCACCTTCC TGGATGTGTA CATCGAACGT GACCTGAAAG CTGGCAAGAT CACCGAACAA
GAAGCGCAGG AAATGGTTGA CCACCTGGTC ATGAAACTGC GTATGGTTCG CTTCCTGCGT
ACTCCGGAAT ACGATGAACT GTTCTCTGGC GACCCAATCT GGGCAACCGA ATCTATCGGT
GGTATGGGCC TCGACGGTCG TACCCTGGTT ACCAAAAACA GCTTCCGTTT CCTGAACACC
CTGTACACCA TGGGTCCGTC TCCGGAACCG AACATGACCA TTCTGTGGTC CGAAAAACTG
CCGCTGAACT TCAAGAAATT CGCTGCTAAA GTGTCCATCG ACACCTCTTC TCTGCAGTAT
GAGAACGATG ACCTGATGCG TCCGGACTTC AACAACGATG ACTACGCTAT CGCTTGCTGC
GTAAGCCCGA TGATCGTTGG TAAACAAATG CAGTTCTTCG GTGCGCGTGC AAACCTGGCG
AAAACCATGC TGTACGCAAT CAACGGCGGC GTTGACGAAA AACTGAAAAT GCAGGTTGGT
CCGAAGTCTG AACCGATCAA AGGCGATCTC CTGAACTACG ATGAAGTGAT GGAGCGCATG
GATCACTTCA TGGACTGGCT GGCTAAACAG TACATCACTG CACTGAACAT CATCCACTAC
ATGCACGACA AGTACAGCTA CGAAGCCTCT CTGATGGCGC TGCACGACCG TGACGTTATC
CGCACCATGG CGTGTGGTAT CGCTGGTCTG TCCGTTGCTG CTGACTCCCT GTCTGCAATC
AAATATGCGA AAGTTAAACC GATTCGTGAC GAAGACGGTC TGGCTATCGA CTTCGAAATC
GAAGGCGAAT ACCCGCAGTT TGGTAACAAC GATCCGCGTG TAGATGACCT GGCTGTTGAC
CTGGTAGAAC GTTTCATGAA GAAAATTCAG AAACTGCACA CCTACCGTGA CGCTATCCCG
ACTCAGTCTG TTCTGACCAT CACTTCTAAC GTTGTGTATG GTAAGAAAAC TGGTAACACC
CCAGACGGTC GTCGTGCTGG CGCGCCGTTC GGGCCGGGTG CTAACCCGAT GCACGGTCGT
GACCAGAAAG GTGCAGTAGC CTCTCTGACT TCCGTTGCTA AACTGCCGTT TGCTTACGCT
AAAGATGGTA TCTCCTACAC CTTCTCTATC GTTCCGAACG CACTGGGTAA AGACGACGAA
GTTCGTAAGA CCAACCTGGC TGGTCTGATG GATGGTTACT TCCACCACGA AGCGTCCATC
GAAGGTGGTC AGCACCTGAA CGTTAACGTG ATGAACCGTG AAATGCTGCT CGACGCGATG
GAAAACCCGG AAAAATATCC GCAGCTGACC ATCCGTGTAT CTGGCTACGC AGTACGTTTC
AACTCGCTGA CTAAAGAACA GCAGCAGGAC GTTATTACTC GTACCTTCAC TCAATCTATG
TAA
 
Protein sequence
MSELNEKLAT AWEGFTKGDW QNEVNVRDFI QKNYTPYEGD ESFLAGATEA TTTLWDKVME 
GVKLENRTHA PVDFDTAVAS TITSHDAGYI NKALEKVVGL QTEAPLKRAL IPFGGIKMIE
GSCKAYNREL DPMIKKIFTE YRKTHNQGVF DVYTPDILRC RKSGVLTGLP DAYGRGRIIG
DYRRVALYGI DYLMKDKYAQ FTSLQADLEN GVNLEQTIRL REEIAEQHRA LGQMKEMAAK
YGYDISGPAT NAQEAIQWTY FGYLAAVKSQ NGAAMSFGRT STFLDVYIER DLKAGKITEQ
EAQEMVDHLV MKLRMVRFLR TPEYDELFSG DPIWATESIG GMGLDGRTLV TKNSFRFLNT
LYTMGPSPEP NMTILWSEKL PLNFKKFAAK VSIDTSSLQY ENDDLMRPDF NNDDYAIACC
VSPMIVGKQM QFFGARANLA KTMLYAINGG VDEKLKMQVG PKSEPIKGDL LNYDEVMERM
DHFMDWLAKQ YITALNIIHY MHDKYSYEAS LMALHDRDVI RTMACGIAGL SVAADSLSAI
KYAKVKPIRD EDGLAIDFEI EGEYPQFGNN DPRVDDLAVD LVERFMKKIQ KLHTYRDAIP
TQSVLTITSN VVYGKKTGNT PDGRRAGAPF GPGANPMHGR DQKGAVASLT SVAKLPFAYA
KDGISYTFSI VPNALGKDDE VRKTNLAGLM DGYFHHEASI EGGQHLNVNV MNREMLLDAM
ENPEKYPQLT IRVSGYAVRF NSLTKEQQQD VITRTFTQSM