Gene EcSMS35_0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0848 
Symbol 
ID6147098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp851853 
End bp854285 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content55% 
IMG OID641615736 
Productformate C-acetyltransferase 3 
Protein accessionYP_001742928 
Protein GI170680406 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01774] pyruvate formate-lyase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAC TGAAACTGGA CACGCTCAGC GACCGCATTA AAGCGCACAA AAATGCGCTG 
GTGCATATTG TGAAACCACC AGTCTGTACC GAGCGCGCGC AGCACTATAC CGAGATGTAT
CAACAACATC TCGATAAGCC GATCCCGGTA CGTCGCGCGC TGGCACTGGC GCATCACCTG
GCGAATCGCA CCATCTGGAT CAAGCACGAT GAATTGATCA TTGGCAACCA GGCAAGCGAA
GTTCGCGCCG CGCCGATCTT CCCGGAATAT ACCGTCTCCT GGATCGAAAA AGAGATTGAT
GATCTGGCAG ATCGTCCCGG TGCTGGTTTT GCGGTGAGCG AAGAGAACAA ACGCGTTCTG
CATGAAGTGT GCCCGTGGTG GCGCGGTCAG ACCGTACAGG ATCGCTGCTA CGGCATGTTT
ACCGATGAGC AAAAAGGTCT GCTGGCGACC GGCATCATTA AAGCGGAGGG CAATATGACC
TCCGGCGATG CGCACCTGGC GGTCAATTTC CCGCTACTGC TGGAAAAAGG TCTTGATGGT
CTGCGTGAGA AAGTGGCGGA ACGTCGCTCG CGCATCAATC TGACGGTACT GGAAGATCTG
CACGGCGAGC AATTCCTGAA AGCGATTGAT ATCGTGCTGG TGGCAGTCAG TGAACACATT
GAACGTTTCG CTGCCCTGGC GCGTGAAATG GCCGCGACCG AAACCCGCGA AAGCCGTCGC
GATGAACTGC TGACGATAGC AGAAAACTGC GATCTTATCG CCCACCAGCC GCCGCAGACT
TTCTGGCAGG CGCTGCAACT GTGTTACTTC ATCCAGTTGA TTTTGCAGAT CGAATCTAAC
GGTCACTCAG TATCGTTTGG TCGTATGGAC CAGTATCTCT ACCCGTACTA TCGCCGCGAC
GTTGAACTGA ACCAGACACT GGATCGTGAA CACGCCATCG AGATGCTGCA TAGCTGCTGG
TTAAAATTGC TGGAAGTGAA CAAGATCCGC TCCGGCTCAC ACTCAAAAGC CTCTGCGGGA
AGTCCGCTGT ATCAGAACGT CACCATTGGC GGGCAAAATC TGGTTGATGG TCAACCAATG
GACGCGGTGA ATCCACTCTC TTACGCGCTC CTCGAATCCT GCGGTCGCCT GCGTTCCACT
CAGCCTAACC TCAGCGTGCG TTACCATGCA GGAATGAGCA ACGATTTCCT CGACGCCTGC
GTACAGGTGA TCCGTTGCGG CTTCGGGATG CCGGCGTTCA ACAACGACGA AATCGTGATC
CCGGAATTTA TTAAACTCGG TATTGAACCG CAGGACGCTT ACGACTATGC AGCGATTGGT
TGTATCGAAA CCGCCGTCGG TGGCAAATGG GGCTATCGCT GTACCGGCAT GAGCTTTATC
AACTTCGCCC GTGTGATGCT GGCGGCGCTG GAAGGCGGTC GTGATGCCAC CAGCGGCAAA
GTGTTCCTGC CACAAGAAAA AGCGTTGTCG GCAGGTAGCT TCAACAACTT CGATGAAGTG
ATGGACGCGT GGGATACGCA AATCCGTTAC TACACGCGTA AATCAATCGA AATCGAATAT
GTCGTCGACA CCATGCTGGA AGAGAACGTA CACGATATTC TCTGCTCGGC GCTGGTGGAT
GACTGCATTG AGCGAGCGAA AAGTATCAAA CAAGGCGGCG CGAAGTATGA CTGGGTGTCT
GGCTTGCAGG TCGGTATCGC CAACCTCGGC AACAGCCTGG CGGCAGTGAA GAAACTGGTA
TTTGAACAGG GCGCGATTGG TCAGCAACAG CTGGCTGCCG CACTGGCGGA TGACTTCGAC
GGCCTGACTC ACGAGCAGCT GCGTCAGCGG CTGATTAACG GTGCGCCTAA GTACGGCAAC
GACGATGATA CTGTCGATAC GCTGCTGGCT CGCGCTTATC AGACCTATAT CGACGAACTG
AAACAGTACC ATAATCCGCG CTACGGTCGT GGTCCGGTTG GCGGCAACTA TTACGCGGGT
ACGTCATCTA TCTCCGCTAA CGTACCTTTT GGCGCGCAGA CGATGGCAAC GCCGGACGGG
CGTAAAGCCC ACACCCCGCT GGCAGAAGGC GCAAGCCCGG CCTCCGGTAC TGACCATCTT
GGCCCTACCG CGGTCATTGG CTCGGTGGGT AAACTGCCTA CGGCAGCGAT TCTCGGCGGC
GTGTTGCTCA ACCAGAAACT GAATCCGGCA ACGCTGGAGA ACGAATCTGA CAAGCAGAAG
CTGATGATCC TGCTGCGCAC CTTCTTCGAG GTGCATAAAG GCTGGCATAT TCAGTACAAC
ATCGTTTCCC GCGAAACGCT GCTGGAGGCG AAAAAACATC CCGATCAGTA TCGCGATCTG
GTAGTGCGCG TCGCGGGCTA TTCCGCCTTC TTCACCGCGC TCTCTCCAGA CGCTCAGGAC
GATATCATCG CCCGTACTGA ACATATGCTG TAA
 
Protein sequence
MTTLKLDTLS DRIKAHKNAL VHIVKPPVCT ERAQHYTEMY QQHLDKPIPV RRALALAHHL 
ANRTIWIKHD ELIIGNQASE VRAAPIFPEY TVSWIEKEID DLADRPGAGF AVSEENKRVL
HEVCPWWRGQ TVQDRCYGMF TDEQKGLLAT GIIKAEGNMT SGDAHLAVNF PLLLEKGLDG
LREKVAERRS RINLTVLEDL HGEQFLKAID IVLVAVSEHI ERFAALAREM AATETRESRR
DELLTIAENC DLIAHQPPQT FWQALQLCYF IQLILQIESN GHSVSFGRMD QYLYPYYRRD
VELNQTLDRE HAIEMLHSCW LKLLEVNKIR SGSHSKASAG SPLYQNVTIG GQNLVDGQPM
DAVNPLSYAL LESCGRLRST QPNLSVRYHA GMSNDFLDAC VQVIRCGFGM PAFNNDEIVI
PEFIKLGIEP QDAYDYAAIG CIETAVGGKW GYRCTGMSFI NFARVMLAAL EGGRDATSGK
VFLPQEKALS AGSFNNFDEV MDAWDTQIRY YTRKSIEIEY VVDTMLEENV HDILCSALVD
DCIERAKSIK QGGAKYDWVS GLQVGIANLG NSLAAVKKLV FEQGAIGQQQ LAAALADDFD
GLTHEQLRQR LINGAPKYGN DDDTVDTLLA RAYQTYIDEL KQYHNPRYGR GPVGGNYYAG
TSSISANVPF GAQTMATPDG RKAHTPLAEG ASPASGTDHL GPTAVIGSVG KLPTAAILGG
VLLNQKLNPA TLENESDKQK LMILLRTFFE VHKGWHIQYN IVSRETLLEA KKHPDQYRDL
VVRVAGYSAF FTALSPDAQD DIIARTEHML