Gene EcSMS35_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2020 
SymbolthiK 
ID6142954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2042443 
End bp2043267 
Gene Length825 bp 
Protein Length274 aa 
Translation table11 
GC content53% 
IMG OID641616896 
Productthiamine kinase 
Protein accessionYP_001744072 
Protein GI170684187 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0510] Predicted choline kinase involved in LPS biosynthesis 
TIGRFAM ID[TIGR02721] thiamine kinase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.379564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0000784787 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCGTTTC GCAGCAATAA TCCCCTCACG CGCGACGAAT TGCTGTCGCG CTTTTTCCCG 
CAGTTTCATC CCGTCACGAC GTTTAATAGT GGGCTTAGTG GCGGGAGTTT TCTCATTGAA
CATCAGGGCC AGCGTTTTGT TGTGCGTCAG CCGCACGATC CTGATGCGCC GCAGTCCGCG
TTCTTGCGCC AGTATCGGGC TTTATCACAA CTACCCGCAT GCATTGCACC GAAGCCGCAT
TTATATCTCC GTGACTGGAT GGTAGTCGAC TATCTGCCCG GCGAGGTAAA AACGTATTTG
CCGGATACTA ACGAACTGGC AGGCTTGCTG TATTATCTAC ATCAACAACC GCGTTTTGGC
TGGCGAATAA CGCTGTTGCC GTTACTGGAA CTGTACTGGC AGCAAAGCGA TCCGGCGCGG
CGGACAGTGG GTTGGCTGCG AAGGTTAAAA CGTCTGCACA AAGCGCGGGA ACCACGGCCT
TTACGCTTAA GTCCATTGCA TATGGATGTC CACGCCGGAA ATTTAGTGCA TAGCGCGTCA
GGGTTAAAAC TCATCGACTG GGAGTATGCC GGAGATGGTG ATATCGCGCT GGAACTGGCG
GCGGTCTGGG TGGAAAATAC TGACCAGCAC CGGCAATTGG TCAATGACTA TGCCACTCGC
GCGAAGATTT ATCCGGCGCA ATTATGGCGT CAGGTCAGGC GATGGTTTCC CTGGCTGCTG
ATGCTCAAAG CAGGGTGGTT TGAGTACCGC TGGCGACAAA CCGGCGATCA ACAATTTATC
AGGCTGGCCG ATGACACCTG GCGGCAGCTA TTAATAAAAC AATAA
 
Protein sequence
MPFRSNNPLT RDELLSRFFP QFHPVTTFNS GLSGGSFLIE HQGQRFVVRQ PHDPDAPQSA 
FLRQYRALSQ LPACIAPKPH LYLRDWMVVD YLPGEVKTYL PDTNELAGLL YYLHQQPRFG
WRITLLPLLE LYWQQSDPAR RTVGWLRRLK RLHKAREPRP LRLSPLHMDV HAGNLVHSAS
GLKLIDWEYA GDGDIALELA AVWVENTDQH RQLVNDYATR AKIYPAQLWR QVRRWFPWLL
MLKAGWFEYR WRQTGDQQFI RLADDTWRQL LIKQ