Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0499 |
Symbol | thiL |
ID | 6967794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 503475 |
End bp | 504452 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643384547 |
Product | thiamine monophosphate kinase |
Protein accession | YP_002269061 |
Protein GI | 209398517 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATGTG GCGAGTTCTC CCTGATTGCC CGTTATTTTG ACCGTGTAAG AAGTTCTCGT CTTGATGTCG AACTGGGCAT CGGCGACGAT TGCGCTCTTC TCAATATCCC CGAGAAACAG ACCCTGGCGA TCAGCACTGA TACGCTGGTG GCGGGCAACC ATTTCCTCCC TGATATCGAT CCTGCTGATC TGGCTTATAA AGCACTGGCG GTGAACCTAA GCGATCTGGC AGCGATGGGG GCCGATCCGG CCTGGCTGAC GCTGGCATTA ACCTTACCGG ACGTAGACGA AGCGTGGCTT GAGTCCTTCA GCGACAGTTT GTTTGATCTT CTCAATTATT ACGATATGCA ACTCATTGGC GGCGATACCA CGCGTGGGCC ATTATCAATG ACGTTGGGTA TCCACGGCTT TGTTCCGATG GGACGAGCCT TAACGCGCTC TGGGGCGAAA CCGGGTGACT GGATCTATGT GACCGGTACA CCGGGCGATA GCGCCGCCGG GCTGGCGATT TTGCAAAACC GTTTGCAGGT TGCCGATGCT AAAGATGCGG ACTACTTGAT CAAACGTCAT CTCCGTCCAT CGCCGCGTAT TTTACAGGGG CAGGCACTGC GCGATCTGGC AAATTCAGCT ATCGATCTCT CTGACGGTCT GATTTCCGAT CTCGGGCATA TCGTGAAAGC CAGCGACTGC GGCGCACGTA TTGACCTGGC ATTGCTGCCG TTTTCTGATG CGCTTTCTCG CCATGTTGTA CCGGAACAGG CGCTGCGCTG GGCGCTCTCT GGCGGTGAAG ATTACGAGTT GTGTTTCACG GTGCCGGAAC TGAACCGTGG CGCGCTGGAT GTGGCTCTCG GTCATCTGGG CGTACCGTTT ACCTGTATCG GGCAAATGAC CGCCGATATC GAAGGGCTTT GTTTTATTCG TGACGGCGAA CCTGTCACGT TAGACTGGAA AGGATATGAC CATTTTGCCA CGCCATAA
|
Protein sequence | MACGEFSLIA RYFDRVRSSR LDVELGIGDD CALLNIPEKQ TLAISTDTLV AGNHFLPDID PADLAYKALA VNLSDLAAMG ADPAWLTLAL TLPDVDEAWL ESFSDSLFDL LNYYDMQLIG GDTTRGPLSM TLGIHGFVPM GRALTRSGAK PGDWIYVTGT PGDSAAGLAI LQNRLQVADA KDADYLIKRH LRPSPRILQG QALRDLANSA IDLSDGLISD LGHIVKASDC GARIDLALLP FSDALSRHVV PEQALRWALS GGEDYELCFT VPELNRGALD VALGHLGVPF TCIGQMTADI EGLCFIRDGE PVTLDWKGYD HFATP
|
| |