Gene EcSMS35_3304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3304 
Symbol 
ID6147435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3380059 
End bp3381069 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content47% 
IMG OID641618134 
Productmalate/L-lactate dehydrogenase family protein 
Protein accessionYP_001745284 
Protein GI170680163 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0489391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACTG TTTATGTCAG TGAAGAAAAT TTAAAATCAC TGGTGCACCA CAAATTACAT 
ACCGCCGGGC TGGATACCGA CACCACACAA CAGGTGACAG ATGTTTTAGT ACATGCCGAT
ATTACAGGTG TGCATTCACA TGGTGTTATG CGTGTCGAAC ATTATTGCAC CCGCCTTGCT
GCCGGAGGGT TAAATAAAGC CCCGCAGTTT AGCATTGAGC AAATTTCACC GTCAGTGGCC
ATTCTCGACT CTGACGACGG AATGGGGCAT TCCGCATTAA TTAGCGCCAC TGAGCACGCT
ATTAAACTGG CCCAGCAAGA GGGCCTCGGT TTTGTTAGCG TTAAAAACAC GTCTCACTGT
GGTGCGTTAT CTTATTTTGC AGAGATGATC ACCAACAAAG GGTTGGTTGC TATCGTAATG
ACGCAAACCG ATACCTGTGT GGCTCCCCAT GGCGGCGCCG AGCGCTTTTT GGGAACTAAC
CCCATCGCCT TTGGTTTCCC GGTGGAAAAC AGCCATCCGA TGATTGTTGA TATGGCGACC
AGTGCCACGG CTTTCGGCAA AATACTTCAT GCAAAGGAAA CCGGAAAACA TATTGGTGAA
GGGCTGGCGA TAGATAAAAA CGGTTATGGC ACCACTGATC CGTATAAGAT TGAAAACCTG
CTACCTTTCG GCCAACACAA AGGTTCAGGC ATTGCACTGG CTATTGATGC ACTGACTGGC
GTGCTGATGA ATGCGAATTT TGGCAACCAT ATTGTTCGCA TGTATGGTGA TTATGACAAG
ATGCGTAAGC TGGCAAGTTT GGTTATTGCC ATCGATCCGA AAAAGCTCGG CAATCCTGTT
TTTGCAAAAA CCATGGCGAA AATGGTCACG GAGCTGCATG CCGTTAAACC GGCTCCCGGT
GTCGAAAAAG TGTTAGCGCC GAACGATCCG CAAATACACT ACAAAGAAAA ATGCCAACAG
GAAGGTATTC CGGTTCCGGC AGGAATATTC CATTATCTGG CAGAGAATTA A
 
Protein sequence
MTTVYVSEEN LKSLVHHKLH TAGLDTDTTQ QVTDVLVHAD ITGVHSHGVM RVEHYCTRLA 
AGGLNKAPQF SIEQISPSVA ILDSDDGMGH SALISATEHA IKLAQQEGLG FVSVKNTSHC
GALSYFAEMI TNKGLVAIVM TQTDTCVAPH GGAERFLGTN PIAFGFPVEN SHPMIVDMAT
SATAFGKILH AKETGKHIGE GLAIDKNGYG TTDPYKIENL LPFGQHKGSG IALAIDALTG
VLMNANFGNH IVRMYGDYDK MRKLASLVIA IDPKKLGNPV FAKTMAKMVT ELHAVKPAPG
VEKVLAPNDP QIHYKEKCQQ EGIPVPAGIF HYLAEN