Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2218 |
Symbol | pflB2 |
ID | 6144248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2235140 |
End bp | 2237422 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617094 |
Product | formate acetyltransferase |
Protein accession | YP_001744268 |
Protein GI | 170681780 |
COG category | [C] Energy production and conversion |
COG ID | [COG1882] Pyruvate-formate lyase |
TIGRFAM ID | [TIGR01255] formate acetyltransferase 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.673985 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAGC TTAATGAAAA GTTAGCCACA GCCTGGGAAG GTTTTACCAA AGGTGACTGG CAGAATGAAG TAAACGTCCG TGACTTCATT CAGAAAAACT ACACTCCGTA CGAGGGTGAC GAGTCCTTCC TGGCTGGCGC TACTGAAGCG ACCACCACCC TGTGGGACAA AGTAATGGAA GGCGTTAAAC TGGAAAACCG CACTCACGCG CCAGTTGACT TTGACACCGC TGTTGCTTCC ACCATCACCT CTCACGACGC TGGCTACATC AACAAAGCGT TGGAAAAAGT TGTTGGTCTG CAGACTGAAG CTCCGCTGAA ACGTGCTCTT ATCCCGTTCG GTGGTATCAA AATGATCGAA GGTTCCTGCA AAGCGTACAA CCGCGAACTG GACCCGATGA TCAAAAAAAT CTTCACTGAA TACCGTAAAA CTCACAACCA GGGCGTGTTC GACGTTTACA CTCCGGACAT CCTGCGTTGC CGTAAATCCG GTGTTCTGAC CGGTCTGCCA GATGCTTATG GCCGTGGTCG TATCATCGGT GACTACCGTC GCGTTGCGCT GTACGGTATC GACTACCTGA TGAAAGACAA ATACGCTCAG TTCACCTCTC TGCAGGCTGA TCTGGAAAAC GGCGTAAACC TGGAACAGAC TATCCGTCTG CGCGAAGAAA TCGCTGAACA GCACCGCGCT CTGGGTCAGA TGAAAGAAAT GGCTGCGAAA TACGGCTACG ACATCTCTGG TCCGGCTACC AACGCTCAGG AAGCTATCCA GTGGACTTAC TTCGGCTACC TGGCTGCTGT TAAGTCTCAG AACGGTGCTG CAATGTCCTT CGGTCGTACC TCCACCTTCC TGGATGTGTA CATCGAACGT GACCTGAAAG CTGGCAAGAT CACCGAACAA GAAGCGCAGG AAATGGTTGA CCACCTGGTC ATGAAACTGC GTATGGTTCG CTTCCTGCGT ACTCCGGAAT ACGATGAACT GTTCTCTGGC GACCCAATCT GGGCAACCGA ATCTATCGGT GGTATGGGCC TCGACGGTCG TACCCTGGTT ACCAAAAACA GCTTCCGTTT CCTGAACACC CTGTACACCA TGGGTCCGTC TCCGGAACCG AACATGACCA TTCTGTGGTC CGAAAAACTG CCGCTGAACT TCAAGAAATT CGCTGCTAAA GTGTCCATCG ACACCTCTTC TCTGCAGTAT GAGAACGATG ACCTGATGCG TCCGGACTTC AACAACGATG ACTACGCTAT CGCTTGCTGC GTAAGCCCGA TGATCGTTGG TAAACAAATG CAGTTCTTCG GTGCGCGTGC AAACCTGGCG AAAACCATGC TGTACGCAAT CAACGGCGGC GTTGACGAAA AACTGAAAAT GCAGGTTGGT CCGAAGTCTG AACCGATCAA AGGCGATCTC CTGAACTACG ATGAAGTGAT GGAGCGCATG GATCACTTCA TGGACTGGCT GGCTAAACAG TACATCACTG CACTGAACAT CATCCACTAC ATGCACGACA AGTACAGCTA CGAAGCCTCT CTGATGGCGC TGCACGACCG TGACGTTATC CGCACCATGG CGTGTGGTAT CGCTGGTCTG TCCGTTGCTG CTGACTCCCT GTCTGCAATC AAATATGCGA AAGTTAAACC GATTCGTGAC GAAGACGGTC TGGCTATCGA CTTCGAAATC GAAGGCGAAT ACCCGCAGTT TGGTAACAAC GATCCGCGTG TAGATGACCT GGCTGTTGAC CTGGTAGAAC GTTTCATGAA GAAAATTCAG AAACTGCACA CCTACCGTGA CGCTATCCCG ACTCAGTCTG TTCTGACCAT CACTTCTAAC GTTGTGTATG GTAAGAAAAC TGGTAACACC CCAGACGGTC GTCGTGCTGG CGCGCCGTTC GGGCCGGGTG CTAACCCGAT GCACGGTCGT GACCAGAAAG GTGCAGTAGC CTCTCTGACT TCCGTTGCTA AACTGCCGTT TGCTTACGCT AAAGATGGTA TCTCCTACAC CTTCTCTATC GTTCCGAACG CACTGGGTAA AGACGACGAA GTTCGTAAGA CCAACCTGGC TGGTCTGATG GATGGTTACT TCCACCACGA AGCGTCCATC GAAGGTGGTC AGCACCTGAA CGTTAACGTG ATGAACCGTG AAATGCTGCT CGACGCGATG GAAAACCCGG AAAAATATCC GCAGCTGACC ATCCGTGTAT CTGGCTACGC AGTACGTTTC AACTCGCTGA CTAAAGAACA GCAGCAGGAC GTTATTACTC GTACCTTCAC TCAATCTATG TAA
|
Protein sequence | MSELNEKLAT AWEGFTKGDW QNEVNVRDFI QKNYTPYEGD ESFLAGATEA TTTLWDKVME GVKLENRTHA PVDFDTAVAS TITSHDAGYI NKALEKVVGL QTEAPLKRAL IPFGGIKMIE GSCKAYNREL DPMIKKIFTE YRKTHNQGVF DVYTPDILRC RKSGVLTGLP DAYGRGRIIG DYRRVALYGI DYLMKDKYAQ FTSLQADLEN GVNLEQTIRL REEIAEQHRA LGQMKEMAAK YGYDISGPAT NAQEAIQWTY FGYLAAVKSQ NGAAMSFGRT STFLDVYIER DLKAGKITEQ EAQEMVDHLV MKLRMVRFLR TPEYDELFSG DPIWATESIG GMGLDGRTLV TKNSFRFLNT LYTMGPSPEP NMTILWSEKL PLNFKKFAAK VSIDTSSLQY ENDDLMRPDF NNDDYAIACC VSPMIVGKQM QFFGARANLA KTMLYAINGG VDEKLKMQVG PKSEPIKGDL LNYDEVMERM DHFMDWLAKQ YITALNIIHY MHDKYSYEAS LMALHDRDVI RTMACGIAGL SVAADSLSAI KYAKVKPIRD EDGLAIDFEI EGEYPQFGNN DPRVDDLAVD LVERFMKKIQ KLHTYRDAIP TQSVLTITSN VVYGKKTGNT PDGRRAGAPF GPGANPMHGR DQKGAVASLT SVAKLPFAYA KDGISYTFSI VPNALGKDDE VRKTNLAGLM DGYFHHEASI EGGQHLNVNV MNREMLLDAM ENPEKYPQLT IRVSGYAVRF NSLTKEQQQD VITRTFTQSM
|
| |