Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0453 |
Symbol | thiL |
ID | 6145338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 461790 |
End bp | 462767 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615347 |
Product | thiamine monophosphate kinase |
Protein accession | YP_001742554 |
Protein GI | 170681628 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATGTG GCGAGTTCTC CCTGATTGCC CGTTATTTTG ACCGTGTAAG AAGTTCTCGT CTTGATGTCG AACTGGGCAT CGGCGACGAT TGCGCACTTC TCAATATCCC CGAGAAGCAG ACCCTGGCGA TCAGCACTGA TACGCTGGTG GCGGGCAACC ACTTCCTCCC TGATATCGAT CCTGCTGATC TGGCGTATAA AGCACTGGCG GTGAACCTAA GCGATCTGGC AGCGATGGGG GCCGATCCAG CCTGGCTGAC GCTGGCATTA ACCTTACCGG ACGTCGACGA AGCGTGGCTT GAGTCCTTCA GCGACAGTTT GTTTGATCTT CTCAATTATT ACGATATGCA ACTCATTGGC GGCGATACCA CGCGTGGGCC ATTATCAATG ACGCTGGGTA TCCACGGCTT TGTTCCGATG GGACGAGCCT TAACGCGCGC TGGGGCGAAA CCGGGTGACT GGATCTATGT GACCGGTACA CCGGGCGATA GCGCCGCCGG GCTGGCGATT TTGCAAAACC GTTTGCAGGT TGCCGATGCT AAAGATGCGG ACTACTTGAT CAAACGTCAT CTCCGTCCAT CGCCGCGTAT TTTACAGGGG CAGGCACTGC GCGATCTGGC AAATTCAGCT ATCGATCTCT CTGACGGTCT GATTTCCGAT CTCGGGCATA TCGTGAAAGC CAGCGACTGC GGCGCACGTA TTGACCTGGC ATTGCTGCCG TTTTCTGATG CGCTTTCTCG CCATGTTGAA CCTGAACAGG CGTTGCGCTG GGCGCTCTCT GGCGGTGAAG ATTACGAGTT GTGTTTCACT GTGCCGGAAC TGAACCGTGG CGCGCTGGAT GTGGCTCTCG GACACCTGGG CGTACCGTTT ACCTGTATCG GGCAAATGAC CGCCGATATC GAAGGGCTTT GTTTTATTCG TGACGGCGAA CCTGTCACGT TAGACTGGAA AGGATATGAC CATTTTGCCA CGCCATAA
|
Protein sequence | MACGEFSLIA RYFDRVRSSR LDVELGIGDD CALLNIPEKQ TLAISTDTLV AGNHFLPDID PADLAYKALA VNLSDLAAMG ADPAWLTLAL TLPDVDEAWL ESFSDSLFDL LNYYDMQLIG GDTTRGPLSM TLGIHGFVPM GRALTRAGAK PGDWIYVTGT PGDSAAGLAI LQNRLQVADA KDADYLIKRH LRPSPRILQG QALRDLANSA IDLSDGLISD LGHIVKASDC GARIDLALLP FSDALSRHVE PEQALRWALS GGEDYELCFT VPELNRGALD VALGHLGVPF TCIGQMTADI EGLCFIRDGE PVTLDWKGYD HFATP
|
| |