Gene EcHS_A0488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0488 
SymbolthiL 
ID5594310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp499426 
End bp500403 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content55% 
IMG OID640919671 
Productthiamine monophosphate kinase 
Protein accessionYP_001457256 
Protein GI157159938 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones73 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATGTG GCGAGTTCTC CCTGATTGCC CGTTATTTTG ACCGTGTAAG AAGTTCTCGT 
CTTGATGTCG AACTGGGCAT CGGCGACGAT TGCGCTCTTC TCAATATCCC CGAGAAACAG
ACCCTGGCGA TCAGCACTGA TACGCTGGTG GCGGGCAACC ACTTCCTCCA TGATATCGAT
CCTGCTGATC TGGCGTATAA AGCACTGGCG GTGAACCTAA GCGATCTGGC AGCGATGGGG
GCCGATCCGG CCTGGCTGAC GCTGGCATTA ACCTTACCGG ACGTAGACGA AGCGTGGCTT
GAGTCCTTCA GCGACAGTTT GTTTGATCTT CTCAATTATT ACGATATGCA ACTCATTGGC
GGCGATACCA CGCGTGGGCC ATTATCAATG ACGTTGGGTA TCCACGGCTT TGTTCCGATG
GGACGAGCCT TAACGCGCTC TGGAGCGAAA CCGGGTGACT GGATCTATGT GACCGGTACA
CCGGGCGATA GCGCCGCCGG GCTGGCGATT TTGCAAAACC GTTTGCAGGT TGCCGATGCT
AAAGATGCGG ACTACTTGAT CAAACGTCAT CTCCGTCCAT CGCCGCGTAT TTTACAGGGA
CAGGCACTGC GCGATCTGGC AAATTCAGCT ATCGATCTCT CTGACGGTCT GATTTCCGAT
CTCGGGCATA TCGTGAAAGC CAGCGACTGC GGCGCACGTA TTGACCTGGC ATTGCTGCCG
TTTTCTGATG CGCTTTCTCG CCATGTTGAA CCGGAACAGG CGCTGCGCTG GGCGCTCTCT
GGCGGTGAAG ATTACGAGTT GTGTTTCACT GTGCCGGAAC TGAACCGTGG CGCGCTGGAT
GTGGCTCTCG GACACCTGGG CGTACCGTTT ACCTGTATCG GGCAAATGAC CGCCGATATC
GAAGGGCTTT GTTTTATTCG TGACGGCGAA CCTGTCACGT TTGACTGGAA AGGATATGAC
CATTTTGCCA CGCCATAA
 
Protein sequence
MACGEFSLIA RYFDRVRSSR LDVELGIGDD CALLNIPEKQ TLAISTDTLV AGNHFLHDID 
PADLAYKALA VNLSDLAAMG ADPAWLTLAL TLPDVDEAWL ESFSDSLFDL LNYYDMQLIG
GDTTRGPLSM TLGIHGFVPM GRALTRSGAK PGDWIYVTGT PGDSAAGLAI LQNRLQVADA
KDADYLIKRH LRPSPRILQG QALRDLANSA IDLSDGLISD LGHIVKASDC GARIDLALLP
FSDALSRHVE PEQALRWALS GGEDYELCFT VPELNRGALD VALGHLGVPF TCIGQMTADI
EGLCFIRDGE PVTFDWKGYD HFATP