Gene EcE24377A_0448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0448 
SymbolthiL 
ID5586076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp468509 
End bp469486 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content54% 
IMG OID640924172 
Productthiamine monophosphate kinase 
Protein accessionYP_001461599 
Protein GI157157291 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATGTG GCGAGTTCTC CCTGATTGCC CGTTATTTTG ACCGTGTAAG AAGTTCTCGT 
CTTGATGTCG AACTGGGCAT CGGCGACGAT TGCGCACTTC TCAATATCCC CGAGAAACAG
ACCCTGGCGA TCAGCACTGA TACGCTGGTG GCGGGTAACC ATTTCCTCCC TGATATCGAT
CCTGCTGATC TGGCTTATAA AGCACTGGCG GTGAACCTAA GCGATCTGGC AGCGATGGGG
GCCGATCCGG CCTGGCTGAC GCTGGCATTA ACCTTACCGG ACGTAGACGA AGCGTGGCTT
GAGTCCTTCA GCGACAGTTT GTTTGATCTT CTCAATTATT ACGATATGCA ACTCATTGGC
GGCGATACTA CGCGTGGGCC ATTATCAATG ACGTTGGGTA TCCACGGCTT TGTTCCGATG
GGACGAGCCT TAACGCGCTC TGGGGCGAAA CCGGGTGACT GGATCTATGT GACCGGTACA
CCGGGCGATA GCGCCGCCGG GCTGGCGATT TTGCAAAACC GTTTGCAGGT TGCCGATGCT
AAAGATGCTG ACTACTTGAT CAAACGTCAT CTCCGTCCAT CGCCGCGTAT TTTACAGGGG
CAGGCACTGC GCGATCTGGC AAATTCAGCT ATCGATCTCT CTGACGGTCT GATTTCCGAT
CTCGGGCATA TCGTGAAAGC CAGCGACTGC GGCGCACGTA TTGACCTGGC ATTGCTGCCG
TTTTCTGATG CGCTTTCTCG CCATGTTGAA CCGGAACAGG CGCTGCGCTG GGCGCTCTCT
GGCGGTGAAG ATTACGAGTT GTGTTTCACG GTGCCGGAAC TGAACCGTGG CGCGCTGGAT
GTTGCTCTCG GACACCTTGG CGTACCGTTT ACCTGTATCG GGCAAATGAC CGCCGATATC
GAAGGGCTTT GTTTTATTCG TGACGGCGAA CCTGTCACGT TTGACTGGAA AGGATATGAC
CATTTTGCCA CGCCATAA
 
Protein sequence
MACGEFSLIA RYFDRVRSSR LDVELGIGDD CALLNIPEKQ TLAISTDTLV AGNHFLPDID 
PADLAYKALA VNLSDLAAMG ADPAWLTLAL TLPDVDEAWL ESFSDSLFDL LNYYDMQLIG
GDTTRGPLSM TLGIHGFVPM GRALTRSGAK PGDWIYVTGT PGDSAAGLAI LQNRLQVADA
KDADYLIKRH LRPSPRILQG QALRDLANSA IDLSDGLISD LGHIVKASDC GARIDLALLP
FSDALSRHVE PEQALRWALS GGEDYELCFT VPELNRGALD VALGHLGVPF TCIGQMTADI
EGLCFIRDGE PVTFDWKGYD HFATP