Gene SbBS512_E4437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4437 
SymbolpflD 
ID6272494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4144587 
End bp4146884 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content53% 
IMG OID641728233 
Productputative formate acetyltransferase 2 
Protein accessionYP_001882646 
Protein GI187732803 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01774] pyruvate formate-lyase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAATC GTATCTCTCG CCTCAAAACT GCACTGTTTG CCAATACCCG TGAAATCTCG 
CTGGAGCGGG CGCTGCTTTA TACCGCCAGC CATCGGCAAA CCGAAGGCGA ACCGGTGATA
TTGCGCCGGG CGAAAGCAAC AGCGTATATC CTTGAACATG TTGAAATTTC GATTCGTGAT
GAAGAACTGA TTGCCGGTAA CCGCACCGTA AAACCGCGCG CCGGGATTAT GTCGCCGGAA
ATGGACCCTT ACTGGCTGCT GAAAGAGCTG GATCAATTCC CGACGCGTCC GCAGGACCGC
TTTGCTATCA GCGAAGAAGA TAAACGTATC TATCGTGAAG AGTTGTTCCC GTACTGGGAA
AAACGTTCGA TGAAAGATTT CATCAACGGG CAGATGACGG ATGAAGTAAA AGCCGCGACC
AGCACGCAGA TTTTCAGCAT TAACCAGACG GATAAAGGCC AGGGGCACAT TATTATTGAT
TACCCACGCC TGTTGAATCA CGGGCTGGGG GAGTTGGTGG CACAGATGCA GCAACATTGT
CAGCAACAGC CGGAGAATCA CTTTTATCAG GCAGCGCTGT TACTGCTGGA AGCCTCGCAG
AAACACATTT TGCGTTACGC CGTACTGGCG GAAACGATGG CGGCAAACTG CACAGATGCC
CAGCGTCGCG AAGAACTGCT GACTATTGCG GAGATCTCTC GCCATAACGC CGAACATAAG
CCGCAGACGT TCTGGCAGGC GTGCCAGTTA TTCTGGTACA TGAACATCAT TCTGCAATAC
GAATCCAACG CCAGTTCGCT GTCTTTAGGG CGCTTCGACC AGTATATGTT GCCGTTCTAT
CAGGCATCTT TAACCCAGGG CGAAGATCCG GCGTTCCTGA AAGAACTGCT CGAATCTTTA
TGGGTGAAAT GCAACGACAT CGTGCTGTTG CGCTCCACCA GCAGCGCGCG TTATTTCGCA
GGTTTCCCGA CCGGCTATAC CGCACTGCTC GGCGGGTTAA CCGAGAACGG ACGTAGCGCG
GTGAACGTGC TTTCGTTCCT TTGCCTTGAC GCCTATCAAA GCGTGCAATT ACCGCAACCG
AACCTCGGCG TGCGCACTAA CGCCTTGATC GACACGCCAT TCCTGATGAA AACCGCCGAA
ACCATTCGCC TCGGCACCGG TATTCCGCAA ATCTTTAACG ATGAAGTGGT GGTACCAGCG
TTCCTCAACC GTGGCGTTTC GCTGGAAGAT GCGCGCGACT ATTCCGTAGT GGGCTGTGTG
GAATTATCTA TTCCCGGCAG AACCTACGGC TTGCATGACA TCGCGATGTT TAATCTGCTG
AAAGTGATGG AAATCTGCCT GCATGAAAAT GAAGGCAATG CTGCGCTGAC TTATGAAGGT
TTACTGGAGC AGATCCGCGC CAAGATCAGC CACTACATCA CCCTGATGGT TGAGGGCAGC
AATATTTGTG ATATCGGCCA TCGCGACTGG GCACCTGTAC CGCTGCTCTC GTCTTTTATC
AGCGATTGTC TGGAAAAAGG TCGCGATATT ACCGATGGCG GCGCGCGTTA TAACTTCTCC
GGCGTACAGG GGATCGGTAT CGCCAACCTG AGCGATTCTC TCCATGCGTT GAAAGGGATG
GTTTTTGAGC AACAGCGTTT AAGTTTTGAC GAATTGCTGT CGGTATTAAA AGCCAACTTT
GCAACGCCAG AAGGCGAAAA AGTCCGCGCT CGCTTAATTA ACCGCTTTGA GAAATATGGT
AACGATATCG ACGAGGTGGA TAACATTAGC GCCGAACTGT TGCGCCACTA CTGCAAAGAA
GTGGAAAAAT ACCAGAACCC GCGCGGTGGT TACTTCACGC CGGGATCGTA TACCGTTTCT
GCTCACGTCC CGTTGGGATC GGTGGTTGGC GCGACGCCAG ACGGTCGTTT TGCCGGAGAA
CAGCTGGCAG ACGGCGGCTT GTCACCTATG CTGGGCCAGG ACGAACAAGG GCCAACAGCG
GTACTGAAGT CAGTCAGTAA GCTCGATAAT ACGCTGCTGT CTAACGGTAC GTTGCTGAAC
GTGAAATTCA CTCCGGCGAC CCTGGAAGGT GAAGCGGGAT TACGCAAACT GGCCGACTTC
TTACGGGCGT TTACCCAGCT TAAGTTGCAG CATATTCAGT TTAACGTGGT GAACGCCGAC
ACGTTGTGGG AAGCGCAACA GCGCCCACAA GATTATGCCG GGCTGGTGGT GCGCGTTGCC
GGATACAGCG CCTTCTTTGT CGAACTGTCG AAGGAGATCC AGGATGACAT CATCCGCCGG
ACAGCGCATC AGCTGTAA
 
Protein sequence
MTNRISRLKT ALFANTREIS LERALLYTAS HRQTEGEPVI LRRAKATAYI LEHVEISIRD 
EELIAGNRTV KPRAGIMSPE MDPYWLLKEL DQFPTRPQDR FAISEEDKRI YREELFPYWE
KRSMKDFING QMTDEVKAAT STQIFSINQT DKGQGHIIID YPRLLNHGLG ELVAQMQQHC
QQQPENHFYQ AALLLLEASQ KHILRYAVLA ETMAANCTDA QRREELLTIA EISRHNAEHK
PQTFWQACQL FWYMNIILQY ESNASSLSLG RFDQYMLPFY QASLTQGEDP AFLKELLESL
WVKCNDIVLL RSTSSARYFA GFPTGYTALL GGLTENGRSA VNVLSFLCLD AYQSVQLPQP
NLGVRTNALI DTPFLMKTAE TIRLGTGIPQ IFNDEVVVPA FLNRGVSLED ARDYSVVGCV
ELSIPGRTYG LHDIAMFNLL KVMEICLHEN EGNAALTYEG LLEQIRAKIS HYITLMVEGS
NICDIGHRDW APVPLLSSFI SDCLEKGRDI TDGGARYNFS GVQGIGIANL SDSLHALKGM
VFEQQRLSFD ELLSVLKANF ATPEGEKVRA RLINRFEKYG NDIDEVDNIS AELLRHYCKE
VEKYQNPRGG YFTPGSYTVS AHVPLGSVVG ATPDGRFAGE QLADGGLSPM LGQDEQGPTA
VLKSVSKLDN TLLSNGTLLN VKFTPATLEG EAGLRKLADF LRAFTQLKLQ HIQFNVVNAD
TLWEAQQRPQ DYAGLVVRVA GYSAFFVELS KEIQDDIIRR TAHQL