Gene ECH74115_2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2091 
SymbolsfcA 
ID6966887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1989815 
End bp1991512 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content52% 
IMG OID643385992 
Productmalate dehydrogenase 
Protein accessionYP_002270481 
Protein GI209396539 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCAA AAACAAAAAA ACAGCGTTCG CTTTATATCC CTTACGCTGG CCCTGTACTG 
CTGGAATTTC CGTTGTTGAA TAAAGGCAGT GCCTTCAGCA TGGAAGAACG CCGTAACTTC
AACCTGCTGG GGTTACTGCC GGAAGTGGTC GAAACCATCG AAGAACAAGC GGAACGAGCA
TGGATCCAGT ATCAGGGATT CAAAACCGAA ATCGACAAAC ACATCTACCT GCGTAACATC
CAGGACACCA ACGAAACCCT CTTCTACCGT CTGGTAAACA ATCATCTTGA TGAGATGATG
CCTGTTATTT ATACCCCAAC CGTCGGCGCA GCCTGTGAGC GTTTTTCTGA GATCTACCGC
CGTTCACGCG GCGTGTTTAT CTCTTACCAG AACCGCCACA ATATGGACGA TATTCTGCAA
AACGTGCCGA ACCATAATAT TAAAGTGATT GTGGTGACTG ACGGTGAACG TATTCTGGGG
CTTGGTGACC AGGGCATCGG CGGGATGGGC ATTCCGATCG GTAAACTGTC GCTCTATACC
GCCTGTGGCG GCATCAGCCC GGCGTATACC CTTCCGGTGG TGCTGGATGT CGGAACGAAC
AACCAACAGC TGCTTAACGA TCCGCTGTAT ATGGGCTGGC GTAATCCGCG TATCACTGAC
GACGAATACT ATGAATTCGT TGATGAATTT ATCCAGGCTG TGAAACAACG CTGGCCAGAC
GTGCTGTTGC AGTTTGAAGA CTTTGCTCAA AAAAATGCGA TGCCGTTACT TAACCGCTAT
CGCAATGAAA TTTGTTCCTT TAACGATGAC ATTCAGGGCA CCGCGGCGGT AACAGTCGGC
ACCCTGATCG CAGCAAGCCG CGCGGCAGGT GGTCAGTTAA GCGAGAAAAA AATCGTCTTC
CTTGGCGCAG GTTCAGCGGG ATGCGGCATT GCCGAAATGA TCATCGCCCA GACCCAGCGC
GAAGGATTAA GCGAGGAAGC GGCGCGGCAG AAAGTCTTTA TGGTCGATCG CTTTGGCCTG
CTGACCGACA AGATGCCGAA CCTGCTGCCT TTCCAGACCA AACTGGTGCA GAAGCGCGAA
AACCTCAGTG ACAGGGATAC CGACAGCGAT GTGCTGTCAC TGCTGGATGT GGTGCGCAAT
GTAAAACCAG ATATTCTGAT CGGCGTCTCA GGACAGACCG GGCTGTTTAC GGAAGAGATC
ATCCGTGAGA TGCATAAACA CTGTCCGCGT CCGATCGTGA TGCCGCTGTC CAACCCGACG
TCACGCGTGG AAGCCACACC GCAGGACATT ATCGCCTGGA CTGAAGGTAA CGCGCTGGTC
GCCACGGGCA GTCCGTTTAA TCCAGTGGTA TGGAAAGATA AAATCTACCC TATCGCCCAG
TGTAACAACG CCTTTATTTT CCCGGGCATC GGGCTGGGTG TTATTGCTTC CGGCGCGTCA
CGTATCACCG ATGAGATGCT GATGTCGGCA AGTGAAACGC TTGCCCAGTA TTCGCCGCTG
GTCCTGAACG GCGAAGGTCT GGTACTACCG GAACTAAAAG ATATTCAGAA AGTCTCCCGC
GCAATTGCGT TTGCGGTTGG CAAAATGGCG CAGCAGCAAG GCGTGGCGGT GAAAACGTCT
GCCGAAGCTT TGCAACAAGC CATTGACGAT AATTTCTGGC AAGCCGAATA CCGCGACTAC
CGCCGTACCT CCATCTAA
 
Protein sequence
MEPKTKKQRS LYIPYAGPVL LEFPLLNKGS AFSMEERRNF NLLGLLPEVV ETIEEQAERA 
WIQYQGFKTE IDKHIYLRNI QDTNETLFYR LVNNHLDEMM PVIYTPTVGA ACERFSEIYR
RSRGVFISYQ NRHNMDDILQ NVPNHNIKVI VVTDGERILG LGDQGIGGMG IPIGKLSLYT
ACGGISPAYT LPVVLDVGTN NQQLLNDPLY MGWRNPRITD DEYYEFVDEF IQAVKQRWPD
VLLQFEDFAQ KNAMPLLNRY RNEICSFNDD IQGTAAVTVG TLIAASRAAG GQLSEKKIVF
LGAGSAGCGI AEMIIAQTQR EGLSEEAARQ KVFMVDRFGL LTDKMPNLLP FQTKLVQKRE
NLSDRDTDSD VLSLLDVVRN VKPDILIGVS GQTGLFTEEI IREMHKHCPR PIVMPLSNPT
SRVEATPQDI IAWTEGNALV ATGSPFNPVV WKDKIYPIAQ CNNAFIFPGI GLGVIASGAS
RITDEMLMSA SETLAQYSPL VLNGEGLVLP ELKDIQKVSR AIAFAVGKMA QQQGVAVKTS
AEALQQAIDD NFWQAEYRDY RRTSI