Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0950 |
Symbol | |
ID | 6972208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 964156 |
End bp | 965241 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384971 |
Product | hypothetical protein |
Protein accession | YP_002269471 |
Protein GI | 209397893 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAGTG GTCATCGCTT TGATGCTCAG ACGCTGCACA GCTTTATTCA GGCTGTATTT CGTCAGATGG GTAGCGAGGA ACAAGAAGCG AAATTAGTTG CCGATCATTT AATCGCGGCA AACCTGGCAG GGCATGATTC ACATGGTATT GGCATGATCC CAAGCTATGT ACGCTCCTGG AGTCAGGGGC ACCTGCAAAT TAACCATCAT GCCAAAATCG TTAAAGAGGC GGGGGCGGCG GTCACGCTCG ATGGCGATCG CGCATTTGGT CAGGTCGCGG CACATGAAGC GATGGCGCTG GGGATTGAGA AAGCGCATCA GCACGGTATT GCTGCCGTGG CGCTACATAA CTCGCATCAT ATCGGCCGTA TCGGTTACTG GGCGGAGCAG TGTGCAGCGG CGGGGTTTGT CTCTATCCAC TTTGTTAGCG TGGTCGGTAT TCCAATGGTC GCACCGTTCC ACGGTCGCGA CAGCCGCTTT GGCACGAATC CGTTCTGTGT GGTTTTCCCA CGTAAAGATG ATTTTCCGCT GTTGCTTGAT TACGCCACCA GCGCCATTGC ATTTGGCAAA ACCCGCGTCG CCTGGCATAA AGGCGTCCCC GTGCCGCCAG GTTGCCTGAT TGACGTTAAC GGCGTGCCGA CGACCAATCC GGCGGTAATG CAGGAGTCGC CGTTGGGTTC GCTGTTGACC TTTGCCGAAC ATAAAGGCTA CGCCCTGGCG GCAATGTGTG AAATTCTTGG CGGGGCGCTT TCTGGCGGTA AAACGACGCA TCAGGAAACG TTACAAACCA GTCCCGATGC CATTCTTAAC TGCATGACCA CTATCATCAT CAACCCGGAA CTGTTCGGCG CGCCGGATTG TAGCGCGCAG ACCGAAGCCT TTGCCGAGTG GGTGAAAGCC TCGCCGCATG ATGACGATAA GCCGATTTTG CTCCCGGGCG AGTGGGAAGT GAACACGCGT CGCGAACGGC AGGAGCAGGG GATTCCATTG GATGCGGGAA GCTGGCAGGC CATTTGTGAT GCGGCGCGGC AGATTGGTAT GCCGGAAGAG ACGTTGCAGG CTTTCTGTCA GCAGTTAGCC AGCTAA
|
Protein sequence | MESGHRFDAQ TLHSFIQAVF RQMGSEEQEA KLVADHLIAA NLAGHDSHGI GMIPSYVRSW SQGHLQINHH AKIVKEAGAA VTLDGDRAFG QVAAHEAMAL GIEKAHQHGI AAVALHNSHH IGRIGYWAEQ CAAAGFVSIH FVSVVGIPMV APFHGRDSRF GTNPFCVVFP RKDDFPLLLD YATSAIAFGK TRVAWHKGVP VPPGCLIDVN GVPTTNPAVM QESPLGSLLT FAEHKGYALA AMCEILGGAL SGGKTTHQET LQTSPDAILN CMTTIIINPE LFGAPDCSAQ TEAFAEWVKA SPHDDDKPIL LPGEWEVNTR RERQEQGIPL DAGSWQAICD AARQIGMPEE TLQAFCQQLA S
|
| |