Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3304 |
Symbol | |
ID | 6147435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3380059 |
End bp | 3381069 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641618134 |
Product | malate/L-lactate dehydrogenase family protein |
Protein accession | YP_001745284 |
Protein GI | 170680163 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0489391 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACTG TTTATGTCAG TGAAGAAAAT TTAAAATCAC TGGTGCACCA CAAATTACAT ACCGCCGGGC TGGATACCGA CACCACACAA CAGGTGACAG ATGTTTTAGT ACATGCCGAT ATTACAGGTG TGCATTCACA TGGTGTTATG CGTGTCGAAC ATTATTGCAC CCGCCTTGCT GCCGGAGGGT TAAATAAAGC CCCGCAGTTT AGCATTGAGC AAATTTCACC GTCAGTGGCC ATTCTCGACT CTGACGACGG AATGGGGCAT TCCGCATTAA TTAGCGCCAC TGAGCACGCT ATTAAACTGG CCCAGCAAGA GGGCCTCGGT TTTGTTAGCG TTAAAAACAC GTCTCACTGT GGTGCGTTAT CTTATTTTGC AGAGATGATC ACCAACAAAG GGTTGGTTGC TATCGTAATG ACGCAAACCG ATACCTGTGT GGCTCCCCAT GGCGGCGCCG AGCGCTTTTT GGGAACTAAC CCCATCGCCT TTGGTTTCCC GGTGGAAAAC AGCCATCCGA TGATTGTTGA TATGGCGACC AGTGCCACGG CTTTCGGCAA AATACTTCAT GCAAAGGAAA CCGGAAAACA TATTGGTGAA GGGCTGGCGA TAGATAAAAA CGGTTATGGC ACCACTGATC CGTATAAGAT TGAAAACCTG CTACCTTTCG GCCAACACAA AGGTTCAGGC ATTGCACTGG CTATTGATGC ACTGACTGGC GTGCTGATGA ATGCGAATTT TGGCAACCAT ATTGTTCGCA TGTATGGTGA TTATGACAAG ATGCGTAAGC TGGCAAGTTT GGTTATTGCC ATCGATCCGA AAAAGCTCGG CAATCCTGTT TTTGCAAAAA CCATGGCGAA AATGGTCACG GAGCTGCATG CCGTTAAACC GGCTCCCGGT GTCGAAAAAG TGTTAGCGCC GAACGATCCG CAAATACACT ACAAAGAAAA ATGCCAACAG GAAGGTATTC CGGTTCCGGC AGGAATATTC CATTATCTGG CAGAGAATTA A
|
Protein sequence | MTTVYVSEEN LKSLVHHKLH TAGLDTDTTQ QVTDVLVHAD ITGVHSHGVM RVEHYCTRLA AGGLNKAPQF SIEQISPSVA ILDSDDGMGH SALISATEHA IKLAQQEGLG FVSVKNTSHC GALSYFAEMI TNKGLVAIVM TQTDTCVAPH GGAERFLGTN PIAFGFPVEN SHPMIVDMAT SATAFGKILH AKETGKHIGE GLAIDKNGYG TTDPYKIENL LPFGQHKGSG IALAIDALTG VLMNANFGNH IVRMYGDYDK MRKLASLVIA IDPKKLGNPV FAKTMAKMVT ELHAVKPAPG VEKVLAPNDP QIHYKEKCQQ EGIPVPAGIF HYLAEN
|
| |