Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0448 |
Symbol | thiL |
ID | 5586076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 468509 |
End bp | 469486 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640924172 |
Product | thiamine monophosphate kinase |
Protein accession | YP_001461599 |
Protein GI | 157157291 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATGTG GCGAGTTCTC CCTGATTGCC CGTTATTTTG ACCGTGTAAG AAGTTCTCGT CTTGATGTCG AACTGGGCAT CGGCGACGAT TGCGCACTTC TCAATATCCC CGAGAAACAG ACCCTGGCGA TCAGCACTGA TACGCTGGTG GCGGGTAACC ATTTCCTCCC TGATATCGAT CCTGCTGATC TGGCTTATAA AGCACTGGCG GTGAACCTAA GCGATCTGGC AGCGATGGGG GCCGATCCGG CCTGGCTGAC GCTGGCATTA ACCTTACCGG ACGTAGACGA AGCGTGGCTT GAGTCCTTCA GCGACAGTTT GTTTGATCTT CTCAATTATT ACGATATGCA ACTCATTGGC GGCGATACTA CGCGTGGGCC ATTATCAATG ACGTTGGGTA TCCACGGCTT TGTTCCGATG GGACGAGCCT TAACGCGCTC TGGGGCGAAA CCGGGTGACT GGATCTATGT GACCGGTACA CCGGGCGATA GCGCCGCCGG GCTGGCGATT TTGCAAAACC GTTTGCAGGT TGCCGATGCT AAAGATGCTG ACTACTTGAT CAAACGTCAT CTCCGTCCAT CGCCGCGTAT TTTACAGGGG CAGGCACTGC GCGATCTGGC AAATTCAGCT ATCGATCTCT CTGACGGTCT GATTTCCGAT CTCGGGCATA TCGTGAAAGC CAGCGACTGC GGCGCACGTA TTGACCTGGC ATTGCTGCCG TTTTCTGATG CGCTTTCTCG CCATGTTGAA CCGGAACAGG CGCTGCGCTG GGCGCTCTCT GGCGGTGAAG ATTACGAGTT GTGTTTCACG GTGCCGGAAC TGAACCGTGG CGCGCTGGAT GTTGCTCTCG GACACCTTGG CGTACCGTTT ACCTGTATCG GGCAAATGAC CGCCGATATC GAAGGGCTTT GTTTTATTCG TGACGGCGAA CCTGTCACGT TTGACTGGAA AGGATATGAC CATTTTGCCA CGCCATAA
|
Protein sequence | MACGEFSLIA RYFDRVRSSR LDVELGIGDD CALLNIPEKQ TLAISTDTLV AGNHFLPDID PADLAYKALA VNLSDLAAMG ADPAWLTLAL TLPDVDEAWL ESFSDSLFDL LNYYDMQLIG GDTTRGPLSM TLGIHGFVPM GRALTRSGAK PGDWIYVTGT PGDSAAGLAI LQNRLQVADA KDADYLIKRH LRPSPRILQG QALRDLANSA IDLSDGLISD LGHIVKASDC GARIDLALLP FSDALSRHVE PEQALRWALS GGEDYELCFT VPELNRGALD VALGHLGVPF TCIGQMTADI EGLCFIRDGE PVTFDWKGYD HFATP
|
| |