Gene EcHS_A0881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0881 
Symbol 
ID5594761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp888056 
End bp890488 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content55% 
IMG OID640920053 
Productformate C-acetyltransferas 
Protein accessionYP_001457620 
Protein GI157160302 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01774] pyruvate formate-lyase 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACAC TGAAACTGGA CACGCTCAGC GACCGCATTA AAGCGCACAA AAATGCGCTG 
GTGCATATTG TGAAACCGCC AGTCTGTACC GAGCGCGCGC AGCACTATAC CGAGATGTAT
CAACAACATC TCGATAAGCC GATCCCGGTA CGTCGCGCGC TGGCACTGGC GCATCACCTG
GCGAATCGCA CTATCTGGAT CAAGCACGAT GAGTTGATCA TTGGCAACCA GGCAAGCGAA
GTTCGCGCCG CGCCGATCTT CCCGGAATAT ACCGTCTCCT GGATCGAAAA AGAGATTGAT
GATCTGGCAG ATCGTCCGGG TGCTGGTTTT GCGGTGAGCG AAGAGAACAA ACGTGTTCTG
CATGAAGTGT GCCCTTGGTG GCGCGGTCAG ACGGTACAGG ATCGTTGCTA CGGCATGTTT
ACCGATGAGC AAAAAGGTCT GCTGGCGACC GGAATCATTA AAGCGGAGGG CAATATGACT
TCCGGCGATG CGCACCTGGC GGTCAATTTC CCGCTGCTGC TGGAAAAAGG GCTTGATGGT
CTGCGCGAGA AAGTGGCGGA ACGTCGCTCG CGCATCAACC TGACGGTGCT GGAAGATTTG
CACGGCGAGC AGTTCCTGAA AGCGATTGAT ATCGTGCTGG TGGCAGTCAG TGAACACATT
GAACGTTTCG CTGCCCTGGC GCGTGAAATG GCCGCGACCG AAACCCGCGA AAGCCGTCGC
GATGAACTGC TGGCGATAGC AGAAAACTGC GATCTTATCG CCCACCAGCC GCCGCAGACG
TTCTGGCAGG CGCTGCAACT GTGCTACTTC ATCCAGTTGA TTTTGCAGAT TGAATCTAAC
GGTCACTCGG TGTCGTTTGG TCGTATGGAC CAGTATCTCT ACCCGTATTA TCGCCGTGAT
GTTGAACTGA ACCAAATGCT GGAACGCGAA CACGCCATCG AATTGCTGCA TAGCTGCTGG
TTGAAATTGC TGGAAGTGAA CAAGATCCGC TCCGGCTCAC ACTCAAAAGC CTCTGCGGGT
AGCCCGCTGT ATCAGAACGT CACCATTGGC GGGCAAAATT TGATTGATGG TCAGCCGATG
GATGCGGTGA ATCCGCTCTC CTACGCGATC CTCGAATCCT GCGGTCGCCT GCGTTCGACG
CAGCCTAACC TCAGCGTGCG CTACCATGCG GGAATGAGTA ACGATTTCCT CGACGCCTGC
GTACAGGTGA TCCGGTGCGG CTTCGGGATG CCGGCCTTCA ACAACGACGA AATCGTGATC
CCGGAATTTA TTAAACTCGG TATTGAACCG CAGGACGCTT ACGACTACGC AGCGATTGGT
TGTATCGAAA CCGCCGTCGG TGGCAAATGG GGCTATCGCT GTACCGGCAT GAGCTTTATC
AACTTCGCCC GCGTGATGCT GGCTGCGCTG GAAGGCGGTC GTGATGCCAC CAGCGGCAAA
GTGTTCCTGG CACAAGAAAA AGCGTTGTCG GCAGGTAACT TCAACAACTT CGATGAAGTA
ATGGACGCGT GGGATACGCA AATCCGTTAC TACACCCGCA AATCAATCGA AATCGAATAT
GTCGTCGACA CCATGCTGGA AGAGAACGTG CACGATATTC TCTGCTCGGC GCTGGTGGAT
GACTGTATTG AGCGAGCAAA AAGTATCAAG CAAGGCGGCG CGAAATATGA CTGGGTTTCT
GGCCTGCAGG TCGGCATTGC CAACCTCGGC AACAGCCTGG CGGCAGTGAA GAAACTGGTG
TTTGAACAGG GCGCGATTGG TCAGCAACAG CTGGCTGCCG CACTGGCGGA TGACTTCGAC
GGCCTGACTC ATGAGCAACT GCGTCAGCGG CTGATTAACG GTGCGCCGAA GTACGGTAAC
GACGATGATA CTGTCGATAC GCTGCTGGCT CGCGCTTATC AGACCTATAT CGACGAACTG
AAACAGTACC ATAATCCGCG CTACGGTCGT GGTCCGGTTG GCGGCAACTA TTACGCGGGT
ACGTCGTCTA TCTCCGCTAA CGTACCGTTT GGCGCGCAGA CGATGGCAAC GCCGGACGGA
CGTAAAGCGC ATACCCCGCT GGCAGAAGGC GCAAGCCCGG CTTCCGGTAC TGACCATCTC
GGCCCAACGG CAGTCATTGG ATCGGTGGGT AAACTGCCTA CAGCAGCGAT TCTCGGCGGC
GTGTTGCTCA ACCAGAAACT GAATCCGGCG ACGCTGGAGA ACGAATCTGA CAAGCAGAAA
CTGATGATCC TGCTCCGTAC CTTCTTCGAG GTGCATAAAG GCTGGCATAT TCAGTACAAC
ATCGTTTCCC GCGAAACGCT GCTGGAGGCG AAAAAACATC CCGATCAGTA TCGCGATCTG
GTAGTGCGTG TCGCGGGCTA TTCCGCCTTC TTCACCGCGC TCTCTCCAGA CGCTCAGGAC
GATATCATCG CCCGTACTGA ACATATGCTG TAA
 
Protein sequence
MTTLKLDTLS DRIKAHKNAL VHIVKPPVCT ERAQHYTEMY QQHLDKPIPV RRALALAHHL 
ANRTIWIKHD ELIIGNQASE VRAAPIFPEY TVSWIEKEID DLADRPGAGF AVSEENKRVL
HEVCPWWRGQ TVQDRCYGMF TDEQKGLLAT GIIKAEGNMT SGDAHLAVNF PLLLEKGLDG
LREKVAERRS RINLTVLEDL HGEQFLKAID IVLVAVSEHI ERFAALAREM AATETRESRR
DELLAIAENC DLIAHQPPQT FWQALQLCYF IQLILQIESN GHSVSFGRMD QYLYPYYRRD
VELNQMLERE HAIELLHSCW LKLLEVNKIR SGSHSKASAG SPLYQNVTIG GQNLIDGQPM
DAVNPLSYAI LESCGRLRST QPNLSVRYHA GMSNDFLDAC VQVIRCGFGM PAFNNDEIVI
PEFIKLGIEP QDAYDYAAIG CIETAVGGKW GYRCTGMSFI NFARVMLAAL EGGRDATSGK
VFLAQEKALS AGNFNNFDEV MDAWDTQIRY YTRKSIEIEY VVDTMLEENV HDILCSALVD
DCIERAKSIK QGGAKYDWVS GLQVGIANLG NSLAAVKKLV FEQGAIGQQQ LAAALADDFD
GLTHEQLRQR LINGAPKYGN DDDTVDTLLA RAYQTYIDEL KQYHNPRYGR GPVGGNYYAG
TSSISANVPF GAQTMATPDG RKAHTPLAEG ASPASGTDHL GPTAVIGSVG KLPTAAILGG
VLLNQKLNPA TLENESDKQK LMILLRTFFE VHKGWHIQYN IVSRETLLEA KKHPDQYRDL
VVRVAGYSAF FTALSPDAQD DIIARTEHML