Gene ECH74115_0499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0499 
SymbolthiL 
ID6967794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp503475 
End bp504452 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content55% 
IMG OID643384547 
Productthiamine monophosphate kinase 
Protein accessionYP_002269061 
Protein GI209398517 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATGTG GCGAGTTCTC CCTGATTGCC CGTTATTTTG ACCGTGTAAG AAGTTCTCGT 
CTTGATGTCG AACTGGGCAT CGGCGACGAT TGCGCTCTTC TCAATATCCC CGAGAAACAG
ACCCTGGCGA TCAGCACTGA TACGCTGGTG GCGGGCAACC ATTTCCTCCC TGATATCGAT
CCTGCTGATC TGGCTTATAA AGCACTGGCG GTGAACCTAA GCGATCTGGC AGCGATGGGG
GCCGATCCGG CCTGGCTGAC GCTGGCATTA ACCTTACCGG ACGTAGACGA AGCGTGGCTT
GAGTCCTTCA GCGACAGTTT GTTTGATCTT CTCAATTATT ACGATATGCA ACTCATTGGC
GGCGATACCA CGCGTGGGCC ATTATCAATG ACGTTGGGTA TCCACGGCTT TGTTCCGATG
GGACGAGCCT TAACGCGCTC TGGGGCGAAA CCGGGTGACT GGATCTATGT GACCGGTACA
CCGGGCGATA GCGCCGCCGG GCTGGCGATT TTGCAAAACC GTTTGCAGGT TGCCGATGCT
AAAGATGCGG ACTACTTGAT CAAACGTCAT CTCCGTCCAT CGCCGCGTAT TTTACAGGGG
CAGGCACTGC GCGATCTGGC AAATTCAGCT ATCGATCTCT CTGACGGTCT GATTTCCGAT
CTCGGGCATA TCGTGAAAGC CAGCGACTGC GGCGCACGTA TTGACCTGGC ATTGCTGCCG
TTTTCTGATG CGCTTTCTCG CCATGTTGTA CCGGAACAGG CGCTGCGCTG GGCGCTCTCT
GGCGGTGAAG ATTACGAGTT GTGTTTCACG GTGCCGGAAC TGAACCGTGG CGCGCTGGAT
GTGGCTCTCG GTCATCTGGG CGTACCGTTT ACCTGTATCG GGCAAATGAC CGCCGATATC
GAAGGGCTTT GTTTTATTCG TGACGGCGAA CCTGTCACGT TAGACTGGAA AGGATATGAC
CATTTTGCCA CGCCATAA
 
Protein sequence
MACGEFSLIA RYFDRVRSSR LDVELGIGDD CALLNIPEKQ TLAISTDTLV AGNHFLPDID 
PADLAYKALA VNLSDLAAMG ADPAWLTLAL TLPDVDEAWL ESFSDSLFDL LNYYDMQLIG
GDTTRGPLSM TLGIHGFVPM GRALTRSGAK PGDWIYVTGT PGDSAAGLAI LQNRLQVADA
KDADYLIKRH LRPSPRILQG QALRDLANSA IDLSDGLISD LGHIVKASDC GARIDLALLP
FSDALSRHVV PEQALRWALS GGEDYELCFT VPELNRGALD VALGHLGVPF TCIGQMTADI
EGLCFIRDGE PVTLDWKGYD HFATP