Gene EcSMS35_1695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1695 
SymbolsfcA 
ID6146449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1696479 
End bp1698176 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content52% 
IMG OID641616571 
Productmalate dehydrogenase 
Protein accessionYP_001743749 
Protein GI170682113 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.488975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCAA AAACAAAAAA ACAGCGTTCG CTTTATATCC CTTACGCTGG CCCTGTATTG 
CTGGAATTTC CGTTGTTGAA TAAAGGCAGC GCCTTCAGCA TGGAAGAACG CCGTAACTTC
AACCTGCTGG GGTTACTGCC GGAAGTGGTC GAAACCATCG AAGAACAAGC GGAACGAGCA
TGGATCCAGT ATCAGGGATT CAAAACCGAA ATCGACAAAC ACATCTACCT GCGTAACATC
CAGGACACTA ACGAAACCCT CTTCTACCGT CTGGTAAACA ATCATCTTGA TGAGATGATG
CCAGTCATTT ACACCCCAAC CGTCGGCGCA GCCTGTGAGC GTTTTTCTGA GATCTACCGC
CGTTCACGCG GTGTGTTTAT CTCTTACCAG AACCGCCACA ATATGGACGA TATTCTGCAA
AACGTACCAA ACCATAATAT CAAAGTGATT GTGGTGACTG ACGGCGAACG TATTCTGGGG
CTTGGTGACC AGGGCATCGG AGGGATGGGC ATTCCGATCG GTAAACTGTC ACTCTATACC
GCCTGTGGCG GCATCAGCCC GGCGTATACC CTTCCGGTGG TGCTGGATGT CGGGACGAAC
AACCAACAGC TGCTTAACGA TCCGCTGTAT ATGGGCTGGC GTAATCCGCG TATCACTGAC
GACGAATACT ATGAATTCGT TGATGAATTT ATCCAGGCTG TGAAACAACG CTGGCCGGAC
GTACTGTTGC AGTTTGAAGA CTTCGCACAA AAAAATGCGA TGCCGTTACT TAACCGCTAT
CGCAATGAAA TTTGTTCTTT TAACGATGAC ATTCAGGGCA CCGCGGCGGT AACGGTCGGC
ACACTGATCG CAGCCAGCCG CGCGGCGGGT GGTCAACTGA GCGAGAAAAA GATTGTCTTC
CTCGGCGCAG GCTCTGCGGG GTGCGGGATT GCCGAAATGA TCATTGCCCA GACTCAGCGT
GAAGGATTAA GCGAGGAAAC GGCGCGGCAG AAAGTCTTTA TGGTCGATCG CTTTGGCCTG
CTGACCGACA AGATGCCGAA CCTGCTGCCT TTCCAGACCA AACTGGTGCA GAAACGCGAA
AACCTCAGTG ACTGGGATAC CGACAGCGAT GTGCTGTCAC TGCTGGATGT GGTGCGCAAT
GTAAAACCAG ATATTCTGAT TGGCGTCTCA GGACAGACCG GGCTGTTTAC GGAAGAGATC
ATCCGTGAGA TGCATAAACA CTGTCCGCGT CCGATCGTGA TGCCGCTGTC TAACCCGACT
TCACGCGTCG AAGCAACACC GCAGGACATT ATCGCCTGGA CTGAAGGTAA CGCGCTGGTC
GCCACTGGCA GTCCATTTAA TCCAGTGGTA TGGAAAGATA AAATCTACCC TATCGCCCAG
TGTAATAACG CCTTTATTTT CCCGGGCATC GGGCTGGGTG TTATTGCTTC CGGCGCGTCA
CGTATCACCG ATGAAATGCT GATGTCGGCA AGTGAAACGC TTGCTCAGTA TTCGCCGCTG
GTCCTGAACG GCGAAGGTCT GGTACTACCG GAACTGAAAG ATATACAGAA AGTCTCCCGC
GCAATTGCGT TTGCGGTTGG CAAAATGGCG CAGCAGCAAG GCGTGGCGGT GAAAACCTCC
GCCGAAGCTC TGCAACAGGC CATTGATGAT AATTTCTGGC AAGCCGAATA CCGCGACTAC
CGCCGTACCT CCATCTAA
 
Protein sequence
MEPKTKKQRS LYIPYAGPVL LEFPLLNKGS AFSMEERRNF NLLGLLPEVV ETIEEQAERA 
WIQYQGFKTE IDKHIYLRNI QDTNETLFYR LVNNHLDEMM PVIYTPTVGA ACERFSEIYR
RSRGVFISYQ NRHNMDDILQ NVPNHNIKVI VVTDGERILG LGDQGIGGMG IPIGKLSLYT
ACGGISPAYT LPVVLDVGTN NQQLLNDPLY MGWRNPRITD DEYYEFVDEF IQAVKQRWPD
VLLQFEDFAQ KNAMPLLNRY RNEICSFNDD IQGTAAVTVG TLIAASRAAG GQLSEKKIVF
LGAGSAGCGI AEMIIAQTQR EGLSEETARQ KVFMVDRFGL LTDKMPNLLP FQTKLVQKRE
NLSDWDTDSD VLSLLDVVRN VKPDILIGVS GQTGLFTEEI IREMHKHCPR PIVMPLSNPT
SRVEATPQDI IAWTEGNALV ATGSPFNPVV WKDKIYPIAQ CNNAFIFPGI GLGVIASGAS
RITDEMLMSA SETLAQYSPL VLNGEGLVLP ELKDIQKVSR AIAFAVGKMA QQQGVAVKTS
AEALQQAIDD NFWQAEYRDY RRTSI