Gene SeD_A1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1037 
SymbolpflB2 
ID6872972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1035069 
End bp1037351 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content52% 
IMG OID642784222 
Productformate acetyltransferase 
Protein accessionYP_002214896 
Protein GI198241907 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01255] formate acetyltransferase 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0101477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.620237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGC TTAATGAAAA GTTAGCCACA GCCTGGGAAG GTTTTACCAA AGGTGACTGG 
CAGAATGAAG TAAACGTCCG TGACTTCATT CAGAAAAACT ACACTCCGTA TGAGGGTGAC
GAGTCCTTCC TGGCTGGCGC TACTGACGCG ACCACCAAGC TGTGGGACAG CGTAATGGAA
GGCGTTAAAC AGGAAAACCG CACTCACGCG CCTGTTGACT TTGACACCTC CGTTGCTTCC
ACCATCACTT CTCACGACGC TGGCTACATC AACAAAGCGC TTGAGAAAAT TGTTGGTCTG
CAGACTGAAG CTCCGCTGAA GCGTGCGATT ATCCCGTTCG GCGGCATCAA AATGGTTGAA
GGTTCCTGCA AAGCGTACAA TCGCGAGCTG GACCCAATGA TCAAAAAAAT CTTCACCGAA
TACCGTAAGA CTCACAACCA GGGCGTGTTC GACGTTTATA CTCCGGACAT CCTGCGTTGC
CGTAAATCCG GCGTTCTGAC CGGTCTGCCG GATGCGTATG GCCGTGGCCG TATCATCGGT
GACTACCGTC GCGTAGCGCT GTACGGTATC GACTACCTGA TGAAAGACAA ATTCGCACAG
TTTACGTCTC TGCAATCCGA TCTGGAAAAC GGCGTAAATC TGGAAGCGAC TATCCGTCTG
CGTGAAGAAA TCGCTGAACA GCACCGCGCT CTGGGTCAGA TCAAAGAAAT GGCAGCTAAA
TACGGCTGCG ATATCTCTGG TCCGGCGACT AACGCTCAGG AAGCAATCCA GTGGACTTAC
TTCGGTTACC TGGCTGCGGT TAAATCTCAG AACGGCGCAG CAATGTCCTT CGGTCGTGTA
TCCACCTTCC TGGATGCGTA CATCGAACGT GACCTGAAAG CAGGCAAAAT CACCGAGCAA
GACGCACAGG AAATGATTGA CCACCTGGTC ATGAAACTGC GTATGGTTCG CTTCCTGCGT
ACTCCTGAAT ATGATGAACT GTTCTCCGGC GACCCGATTT GGGCAACCGA ATCTATCGGC
GGTATGGGCG TTGATGGCCG TACTCTGGTC ACCAAAAACA GCTTCCGTTT CCTGAACACC
CTGTACACCA TGGGGCCGTC TCCGGAGCCG AACATCACCG TTCTGTGGTC TGAAAAACTG
CCGCTGAACT TCAAGAAATT CGCCGCTAAA GTCTCCATCG ACACCTCTTC TCTGCAGTAC
GAGAACGATG ACCTGATGCG TCCGGACTTC AACAACGATG ACTACGCTAT CGCATGCTGC
GTAAGCCCGA TGATCGTTGG TAAACAAATG CAGTTCTTCG GCGCGCGTGC AAACCTGGCG
AAAACCATGC TGTACGCTAT CAACGGCGGC GTTGATGAAA AACTGAAAAT GCAGGTTGGT
CCGAAATCCG AACCGATCAA AGGCGATGTT CTGAACTTCG ACGAAGTGAT GGATCGCATG
GATCACTTCA TGGACTGGCT GGCTAAACAG TATGTCACCG CGCTGAACGT TATCCACTAC
ATGCACGACA AGTACAGCTA CGAAGCCTCT CTGATGGCGC TGCACGACCG TGACGTTATC
CGCACCATGG CGTGTGGTAT CGCAGGTCTG TCCGTTGCTG CTGACTCCCT GTCTGCCATC
AAATATGCGA AAGTTAAACC GATTCGTGAC GAAGATGGTC TGGCTATCGA CTTCGAAATC
GAAGGCGAAT ACCCGCAGTT TGGTAACAAC GACGCTCGTG TAGATGACAT GGCGGTTGAC
CTGGTAGAAC GTTTCATGAA GAAAATTCAG AAACTGACCA CCTACCGTGG CGCTATCCCG
ACGCAGTCTG TTCTGACCAT CACTTCTAAC GTTGTGTATG GTAAGAAAAC CGGTAACACC
CCGGATGGTC GTCGCGCTGG CGCGCCGTTC GGACCAGGTG CTAACCCGAT GCACGGTCGT
GACCAGAAAG GCGCTGTCGC TTCTCTGACC TCCGTTGCTA AACTGCCGTT TGCTTACGCG
AAAGATGGTA TTTCTTATAC CTTCTCTATC GTTCCGAACG CACTGGGTAA AGACGACGAA
GTTCGTAAGA CTAACCTGGC AGGTCTGATG GATGGTTACT TCCACCACGA AGCGTCCATC
GAAGGCGGTC AGCACCTGAA CGTCAACGTC ATGAACCGTG AAATGCTGCT GGACGCGATG
GAACATCCGG AAAAATATCC GCAGCTGACC ATCCGTGTAT CTGGTTACGC AGTACGTTTT
AACTCCCTGA CGAAAGAACA GCAGCAGGAC GTTATTACTC GTACCTTCAC GCAGACCATG
TAA
 
Protein sequence
MSELNEKLAT AWEGFTKGDW QNEVNVRDFI QKNYTPYEGD ESFLAGATDA TTKLWDSVME 
GVKQENRTHA PVDFDTSVAS TITSHDAGYI NKALEKIVGL QTEAPLKRAI IPFGGIKMVE
GSCKAYNREL DPMIKKIFTE YRKTHNQGVF DVYTPDILRC RKSGVLTGLP DAYGRGRIIG
DYRRVALYGI DYLMKDKFAQ FTSLQSDLEN GVNLEATIRL REEIAEQHRA LGQIKEMAAK
YGCDISGPAT NAQEAIQWTY FGYLAAVKSQ NGAAMSFGRV STFLDAYIER DLKAGKITEQ
DAQEMIDHLV MKLRMVRFLR TPEYDELFSG DPIWATESIG GMGVDGRTLV TKNSFRFLNT
LYTMGPSPEP NITVLWSEKL PLNFKKFAAK VSIDTSSLQY ENDDLMRPDF NNDDYAIACC
VSPMIVGKQM QFFGARANLA KTMLYAINGG VDEKLKMQVG PKSEPIKGDV LNFDEVMDRM
DHFMDWLAKQ YVTALNVIHY MHDKYSYEAS LMALHDRDVI RTMACGIAGL SVAADSLSAI
KYAKVKPIRD EDGLAIDFEI EGEYPQFGNN DARVDDMAVD LVERFMKKIQ KLTTYRGAIP
TQSVLTITSN VVYGKKTGNT PDGRRAGAPF GPGANPMHGR DQKGAVASLT SVAKLPFAYA
KDGISYTFSI VPNALGKDDE VRKTNLAGLM DGYFHHEASI EGGQHLNVNV MNREMLLDAM
EHPEKYPQLT IRVSGYAVRF NSLTKEQQQD VITRTFTQTM