Gene EcolC_2821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2821 
Symbol 
ID6065071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3084425 
End bp3086857 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content55% 
IMG OID641602227 
Productpyruvate formate-lyase 
Protein accessionYP_001725776 
Protein GI170020822 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01774] pyruvate formate-lyase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAC TGAAACTGGA CACGCTCAGC GACCGCATTA AAGCGCACAA AAATGCGCTG 
GTGCATATTG TGAAACCGCC AGTCTGTACC GAGCGCGCGC AGCACTATAC CGAGATGTAT
CAACAACATC TCGATAAGCC GATCCCGGTA CGTCGCGCGC TGGCACTGGC GCATCACCTG
GCGAATCGCA CTATCTGGAT CAAGCACGAT GAGTTGATCA TTGGCAACCA GGCAAGCGAA
GTTCGCGCCG CGCCGATCTT CCCGGAATAT ACCGTCTCCT GGATCGAAAA AGAGATTGAT
GATCTGGCAG ATCGTCCGGG TGCTGGTTTT GCGGTGAGCG AAGAGAACAA ACGTGTTCTG
CATGAAGTGT GCCCTTGGTG GCGCGGTCAG ACGGTACAGG ATCGTTGCTA CGGCATGTTT
ACCGATGAGC AAAAAGGTCT GCTGGCGACC GGAATCATTA AAGCGGAGGG CAATATGACT
TCCGGCGATG CGCACCTGGC GGTCAATTTC CCGCTGCTGC TGGAAAAAGG GCTTGATGGT
CTGCGCGAGA AAGTGGCGGA ACGTCGCTCG CGCATCAACC TGACGGTGCT GGAAGATTTG
CACGGCGAGC AGTTCCTGAA AGCGATTGAT ATCGTGCTGG TGGCAGTCAG TGAACACATT
GAACGTTTCG CTGCCCTGGC GCGTGAAATG GCCGCGACCG AAACCCGCGA AAGCCGTCGC
GATGAACTGC TGGCGATAGC AGAAAACTGC GATCTTATCG CCCACCAGCC GCCGCAGACT
TTCTGGCAGG CGCTGCAACT GTGTTACTTC ATCCAGTTGA TATTGCAGAT CGAATCTAAC
GGTCACTCGG TATCGTTTGG TCGTATGGAC CAGTATCTCT ACCCGTACTA TCGTCGTGAT
GTTGAACTGA AACAGACGCT GGATCGCGAA CACGCCATCG AATTACTGCA TAGCTGCTGG
TTGAAATTGC TGGAAGTGAA CAAAATCCGC TCCGGCTCGC ACTCGAAAGC CTCTGCGGGA
AGTCCTCTGT ATCAGAACGT CACCATTGGC GGACAAAATC TGGTTGATGG TCAACCAATG
GACGCGGTGA ATCCACTCTC TTACGCGATC CTTGAGTCCT GCGGCCGTCT GCGTTCGACT
CAGCCTAACC TCAGCGTGCG CTACCACGCG GGAATGAGCA ACGATTTCCT CGACGCCTGC
GTACAGGTGA TCCGCTGCGG CTTCGGGATG CCGGCGTTCA ACAACGACGA AATCGTGATC
CCGGAATTTA TTAAACTCGG TATTGAACCG CAGGACGCTT ATGACTACGC AGCGATTGGT
TGTATAGAAA CCGCCGTCGG TGGCAAATGG GGCTATCGCT GTACCGGCAT GAGCTTTATC
AACTTCGCCC GCGTGATGCT GGCGGCGCTG GAAGGCGGTC GTGATGCCAC CAGCGGCAAA
GTGTTCCTGC CACAAGAAAA AGCGTTGTCG GCAGGTAACT TCAACAACTT CGATGAAGTG
ATGGACGCGT GGGATACGCA AATCCGTTAC TACACCCGCA AATCAATCGA AATTGAGTAT
GTCGTTGACA CCATGCTGGA GGAGAACGTG CACGATATTC TCTGCTCGGC GCTGGTGGAT
GACTGTATTG AGCGAGCAAA AAGTATCAAG CAAGGCGGCG CGAAATATGA CTGGGTTTCT
GGCCTGCAGG TCGGCATTGC CAACCTCGGC AACAGCCTGG CGGCAGTGAA GAAACTGGTG
TTTGAACAGG GCGCGATTGG TCAGCAACAG CTGGCTGCCG CACTGGCGGA TGACTTCGAA
GGCCTGACTC ATGAGCAACT GCGTCAGCGG CTGATTAACG GTGCGCCGAA GTACGGCAAC
GACGATGATA CCGTCGATAT GCTGCTGGCT CGCGCTTATC AGACCTATAT CGACGAACTG
AAACAGTACC ATAACCCACG CTACGGTCGT GGTCCGGTTG GCGGCAACTA TTACGCGGGT
ACGTCGTCTA TCTCCGCTAA CGTACCGTTT GGCGCGCAGA CTATGGCAAC GCCGGACGGA
CGTAAAGCCC ACACCCCGCT GGCAGAAGGC GCAAGCCCGG CCTCCGGTAC TGACCATCTC
GGCCCTACCG CGGTCATTGG CTCAGTGGGT AAACTGCCTA CGGCAGCGAT CCTCGGCGGC
GTGTTGCTCA ACCAGAAACT GAATCCGGCA ACGCTGGAGA ACGAATCTGA CAAGCAGAAA
CTGATGATCC TGCTGCGTAC CTTCTTCGAG GTGCATAAAG GCTGGCATAT TCAGTACAAC
ATCGTTTCCC GCGAGACGCT GCTGGAGGCG AAAAAACATC CCGATCAGTA TCGCGATCTG
GTAGTGCGTG TCGCGGGCTA TTCCGCCTTC TTCACCGCGC TCTCTCCTGA CGCACAGGAC
GATATCATCG CCCGTACTGA ACATATGCTG TAA
 
Protein sequence
MTTLKLDTLS DRIKAHKNAL VHIVKPPVCT ERAQHYTEMY QQHLDKPIPV RRALALAHHL 
ANRTIWIKHD ELIIGNQASE VRAAPIFPEY TVSWIEKEID DLADRPGAGF AVSEENKRVL
HEVCPWWRGQ TVQDRCYGMF TDEQKGLLAT GIIKAEGNMT SGDAHLAVNF PLLLEKGLDG
LREKVAERRS RINLTVLEDL HGEQFLKAID IVLVAVSEHI ERFAALAREM AATETRESRR
DELLAIAENC DLIAHQPPQT FWQALQLCYF IQLILQIESN GHSVSFGRMD QYLYPYYRRD
VELKQTLDRE HAIELLHSCW LKLLEVNKIR SGSHSKASAG SPLYQNVTIG GQNLVDGQPM
DAVNPLSYAI LESCGRLRST QPNLSVRYHA GMSNDFLDAC VQVIRCGFGM PAFNNDEIVI
PEFIKLGIEP QDAYDYAAIG CIETAVGGKW GYRCTGMSFI NFARVMLAAL EGGRDATSGK
VFLPQEKALS AGNFNNFDEV MDAWDTQIRY YTRKSIEIEY VVDTMLEENV HDILCSALVD
DCIERAKSIK QGGAKYDWVS GLQVGIANLG NSLAAVKKLV FEQGAIGQQQ LAAALADDFE
GLTHEQLRQR LINGAPKYGN DDDTVDMLLA RAYQTYIDEL KQYHNPRYGR GPVGGNYYAG
TSSISANVPF GAQTMATPDG RKAHTPLAEG ASPASGTDHL GPTAVIGSVG KLPTAAILGG
VLLNQKLNPA TLENESDKQK LMILLRTFFE VHKGWHIQYN IVSRETLLEA KKHPDQYRDL
VVRVAGYSAF FTALSPDAQD DIIARTEHML