Gene EcolC_3216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3216 
Symbol 
ID6066699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3523036 
End bp3524013 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content54% 
IMG OID641602631 
Productthiamine monophosphate kinase 
Protein accessionYP_001726165 
Protein GI170021211 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000150062 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCATGTG GCGAGTTCTC CCTGATTGCC CGTTATTTTG ACCGTGTAAG AAGTTCTCGT 
CTTGATGTCG AACTGGGCAT CGGCGACGAT TGCGCACTTC TCAATATCCC CGAGAAACAG
ACCCTGGCGA TCAGCACTGA TACGCTGGTG GCGGGTAACC ATTTCCTCCC TGATATCGAT
CCTGCTGATC TGGCTTATAA AGCACTGGCG GTGAACCTAA GCGATCTGGC AGCGATGGGG
GCCGATCCGG CCTGGCTGAC GCTGGCATTA ACCTTACCGG ACGTAGACGA AGCGTGGCTT
GAGTCCTTCA GCGACAGTTT GTTTGATCTT CTCAATTATT ACGATATGCA ACTCATTGGC
GGCGATACCA CGCGTGGGCC ATTATCAATG ACGTTGGGTA TCCACGGCTT TGTTCCGATG
GGACGAGCCT TAACGCGCTC TGGGGCGAAA CCGGGTGACT GGATCTATGT GACCGGTACA
CCGGGCGATA GCGCCGCCGG GCTGGCGATT TTGCAAAACC GTTTGCAGGT TGCCGATGCT
AAAGATGCTG ACTACTTGAT CAAACGTCAT CTCCGTCCAT CGCCGCGTAT TTTACAGGGG
CAGGCACTGC GCGATCTGGC AAATTCAGCT ATCGATCTCT CTGACGGTCT GATTTCCGAT
CTCGGGCATA TCGTGAAAGC CAGCGACTGC GGCGCACGTA TTGACCTGGC ATTGCTGCCG
TTTTCTGATG CGCTTTCTCG CCATGTTGAA CCGGAACAGG CGCTGCGCTG GGCGCTCTCT
GGCGGTGAAG ATTACGAGTT GTGTTTCACG GTGCCGGAAC TGAACCGTGG CGCGCTGGAT
GTTGCTCTCG GACACCTTGG CGTACCGTTT ACCTGTATCG GGCAAATGAC CGCCGATATC
GAAGGGCTTT GTTTTATTCG TGACGGCGAA CCTGTCACGT TTGACTGGAA AGGATATGAC
CATTTTGCCA CGCCATAA
 
Protein sequence
MACGEFSLIA RYFDRVRSSR LDVELGIGDD CALLNIPEKQ TLAISTDTLV AGNHFLPDID 
PADLAYKALA VNLSDLAAMG ADPAWLTLAL TLPDVDEAWL ESFSDSLFDL LNYYDMQLIG
GDTTRGPLSM TLGIHGFVPM GRALTRSGAK PGDWIYVTGT PGDSAAGLAI LQNRLQVADA
KDADYLIKRH LRPSPRILQG QALRDLANSA IDLSDGLISD LGHIVKASDC GARIDLALLP
FSDALSRHVE PEQALRWALS GGEDYELCFT VPELNRGALD VALGHLGVPF TCIGQMTADI
EGLCFIRDGE PVTFDWKGYD HFATP