Gene EcE24377A_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1000 
SymbolpflB2 
ID5590867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1017186 
End bp1019468 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content51% 
IMG OID640924707 
Productformate acetyltransferase 
Protein accessionYP_001462121 
Protein GI157157380 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01255] formate acetyltransferase 1 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.623819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAGC TTAATGAAAA GTTAGCCACA GCCTGGGAAG GTTTTACCAA AGGTGACTGG 
CAGAATGAAG TAAACGTCCG TGACTTCATT CAGAAAAACT ACACTCCGTA CGAGGGTGAC
GAGTCCTTCC TGGCTGGCGC TACTGAAGCG ACCACCACCC TGTGGGACAA AGTAATGGAA
GGCGTTAAAC TGGAAAACCG CACTCACGCG CCAGTTGACT TTGACACCGC TGTTGCTTCC
ACCATCACCT CTCACGACGC TGGCTACATC AACAAGCAGC TTGAGAAAAT CGTTGGTCTG
CAGACTGAAG CTCCGCTGAA ACGTGCTCTT ATCCCGTTCG GTGGTATCAA AATGATCGAA
GGTTCCTGCA AAGCGTACAA CCGCGAACTG GATCCGATGA TCAAAAAAAT CTTCACTGAA
TACCGTAAAA CTCACAACCA GGGCGTGTTC GACGTTTACA CTCCGGACAT CCTGCGTTGC
CGTAAATCTG GTGTTCTGAC CGGTCTGCCA GATGCATATG GCCGTGGCCG TATCATCGGT
GACTACCGTC GCGTTGCGCT GTACGGTATC GACTACCTGA TGAAAGACAA ACTGGCACAG
TTCACTTCTC TGCAGGCTGA TCTGGAAAAC GGCGTAAACC TGGAACAGAC TATCCGTCTG
CGCGAAGAAA TCGCTGAACA GCACCGCGCT CTGGGTCAGA TGAAAGAAAT GGCTGCGAAA
TACGGCTACG ACATCTCTGG TCCGGCTACC AACGCTCAGG AAGCTATCCA GTGGACTTAC
TTCGGCTACC TGGCTGCTGT TAAGTCTCAG AACGGTGCTG CAATGTCCTT CGGTCGTACC
TCCACCTTCC TGGATGTGTA CATCGAACGT GACCTGAAAG CTGGCAAGAT CACCGAACAA
GAAGCGCAGG AAATGGTTGA CCACCTGGTC ATGAAACTGC GTATGGTTCG CTTCCTGCGT
ACTCCGGAGT ACGATGAACT GTTCTCTGGC GACCCGATCT GGGCAACCGA ATCTATCGGT
GGTATGGGCC TCGACGGTCG TACTCTGGTT ACCAAAAACA GCTTCCGTTT CCTGAACACC
CTGTACACCA TGGGTCCGTC TCCGGAACCG AACATGACCA TTCTGTGGTC TGAAAAACTG
CCGCTGAACT TCAAGAAATT CGCCGCTAAA GTGTCCATCG ACACCTCTTC TCTGCAGTAT
GAGAACGATG ACCTGATGCG TCCGGACTTC AACAACGATG ACTACGCTAT TGCTTGCTGC
GTAAGCCCGA TGATCGTTGG TAAACAAATG CAGTTCTTCG GTGCGCGTGC AAACCTGGCG
AAAACCATGC TGTACGCAAT CAACGGCGGC GTTGACGAAA AACTGAAAAT GCAGGTTGGT
CCGAAGTCTG AACCGATCAA AGGCGATGTC CTGAACTATG ATGAAGTGAT GGAGCGCATG
GATCACTTCA TGGACTGGCT GGCTAAACAG TACATCACTG CACTGAACAT CATCCACTAC
ATGCACGACA AGTACAGCTA CGAAGCCTCT CTGATGGCGC TGCACGACCG TGACGTTATC
CGCACTATGG CGTGTGGTAT CGCTGGTCTG TCCGTTGCTG CTGACTCCCT GTCTGCAATC
AAATATGCGA AAGTTAAACC GATTCGTGAC GAAGACGGTC TGGCTATCGA CTTCGAAATC
GAAGGCGAAT ACCCGCAGTT TGGTAACAAT GATCCGCGTG TAGATGACCT GGCTGTTGAC
CTGGTAGAAC GTTTCATGAA GAAAATTCAG AAACTGCACA CCTACCGTGA CGCTATCCCG
ACTCAGTCTG TTCTGACCAT CACTTCTAAC GTTGTGTATG GTAAGAAAAC TGGTAACACC
CCAGACGGTC GTCGTGCTGG CGCGCCGTTC GGACCGGGTG CTAACCCGAT GCACGGTCGT
GACCAGAAAG GTGCTGTAGC GTCTCTGACT TCCGTTGCTA AACTGCCGTT TGCTTACGCT
AAAGATGGTA TCTCCTACAC CTTCTCTATC GTTCCGAACG CACTGGGTAA AGACGACGAA
GTTCGTAAGA CCAACCTGGC TGGTCTGATG GATGGTTACT TCCACCACGA AGCATCCATC
GAAGGTGGTC AGCACCTGAA CGTTAACGTG ATGAACCGTG AAATGCTGCT CGACGCGATG
GAAAACCCGG AAAAATATCC GCAGCTGACC ATCCGTGTAT CTGGCTACGC AGTACGTTTC
AACTCGCTGA CTAAAGAACA GCAGCAGGAC GTTATTACTC GTACCTTCAC TCAATCTATG
TAA
 
Protein sequence
MSELNEKLAT AWEGFTKGDW QNEVNVRDFI QKNYTPYEGD ESFLAGATEA TTTLWDKVME 
GVKLENRTHA PVDFDTAVAS TITSHDAGYI NKQLEKIVGL QTEAPLKRAL IPFGGIKMIE
GSCKAYNREL DPMIKKIFTE YRKTHNQGVF DVYTPDILRC RKSGVLTGLP DAYGRGRIIG
DYRRVALYGI DYLMKDKLAQ FTSLQADLEN GVNLEQTIRL REEIAEQHRA LGQMKEMAAK
YGYDISGPAT NAQEAIQWTY FGYLAAVKSQ NGAAMSFGRT STFLDVYIER DLKAGKITEQ
EAQEMVDHLV MKLRMVRFLR TPEYDELFSG DPIWATESIG GMGLDGRTLV TKNSFRFLNT
LYTMGPSPEP NMTILWSEKL PLNFKKFAAK VSIDTSSLQY ENDDLMRPDF NNDDYAIACC
VSPMIVGKQM QFFGARANLA KTMLYAINGG VDEKLKMQVG PKSEPIKGDV LNYDEVMERM
DHFMDWLAKQ YITALNIIHY MHDKYSYEAS LMALHDRDVI RTMACGIAGL SVAADSLSAI
KYAKVKPIRD EDGLAIDFEI EGEYPQFGNN DPRVDDLAVD LVERFMKKIQ KLHTYRDAIP
TQSVLTITSN VVYGKKTGNT PDGRRAGAPF GPGANPMHGR DQKGAVASLT SVAKLPFAYA
KDGISYTFSI VPNALGKDDE VRKTNLAGLM DGYFHHEASI EGGQHLNVNV MNREMLLDAM
ENPEKYPQLT IRVSGYAVRF NSLTKEQQQD VITRTFTQSM