Gene EcSMS35_0453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0453 
SymbolthiL 
ID6145338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp461790 
End bp462767 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content55% 
IMG OID641615347 
Productthiamine monophosphate kinase 
Protein accessionYP_001742554 
Protein GI170681628 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATGTG GCGAGTTCTC CCTGATTGCC CGTTATTTTG ACCGTGTAAG AAGTTCTCGT 
CTTGATGTCG AACTGGGCAT CGGCGACGAT TGCGCACTTC TCAATATCCC CGAGAAGCAG
ACCCTGGCGA TCAGCACTGA TACGCTGGTG GCGGGCAACC ACTTCCTCCC TGATATCGAT
CCTGCTGATC TGGCGTATAA AGCACTGGCG GTGAACCTAA GCGATCTGGC AGCGATGGGG
GCCGATCCAG CCTGGCTGAC GCTGGCATTA ACCTTACCGG ACGTCGACGA AGCGTGGCTT
GAGTCCTTCA GCGACAGTTT GTTTGATCTT CTCAATTATT ACGATATGCA ACTCATTGGC
GGCGATACCA CGCGTGGGCC ATTATCAATG ACGCTGGGTA TCCACGGCTT TGTTCCGATG
GGACGAGCCT TAACGCGCGC TGGGGCGAAA CCGGGTGACT GGATCTATGT GACCGGTACA
CCGGGCGATA GCGCCGCCGG GCTGGCGATT TTGCAAAACC GTTTGCAGGT TGCCGATGCT
AAAGATGCGG ACTACTTGAT CAAACGTCAT CTCCGTCCAT CGCCGCGTAT TTTACAGGGG
CAGGCACTGC GCGATCTGGC AAATTCAGCT ATCGATCTCT CTGACGGTCT GATTTCCGAT
CTCGGGCATA TCGTGAAAGC CAGCGACTGC GGCGCACGTA TTGACCTGGC ATTGCTGCCG
TTTTCTGATG CGCTTTCTCG CCATGTTGAA CCTGAACAGG CGTTGCGCTG GGCGCTCTCT
GGCGGTGAAG ATTACGAGTT GTGTTTCACT GTGCCGGAAC TGAACCGTGG CGCGCTGGAT
GTGGCTCTCG GACACCTGGG CGTACCGTTT ACCTGTATCG GGCAAATGAC CGCCGATATC
GAAGGGCTTT GTTTTATTCG TGACGGCGAA CCTGTCACGT TAGACTGGAA AGGATATGAC
CATTTTGCCA CGCCATAA
 
Protein sequence
MACGEFSLIA RYFDRVRSSR LDVELGIGDD CALLNIPEKQ TLAISTDTLV AGNHFLPDID 
PADLAYKALA VNLSDLAAMG ADPAWLTLAL TLPDVDEAWL ESFSDSLFDL LNYYDMQLIG
GDTTRGPLSM TLGIHGFVPM GRALTRAGAK PGDWIYVTGT PGDSAAGLAI LQNRLQVADA
KDADYLIKRH LRPSPRILQG QALRDLANSA IDLSDGLISD LGHIVKASDC GARIDLALLP
FSDALSRHVE PEQALRWALS GGEDYELCFT VPELNRGALD VALGHLGVPF TCIGQMTADI
EGLCFIRDGE PVTLDWKGYD HFATP