Gene ECH74115_5462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5462 
SymbolthiC 
ID6969670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5106142 
End bp5108037 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content56% 
IMG OID643389109 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002273510 
Protein GI209399239 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.416651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCAA CAAAACTGAC CCGTCGCGAA CAACGCGCCC AGGCCCAACA TTTTATCGAC 
ACCCTGGAAG GCACCGCCTT TCCCAACTCA AAACGCATTT ATATCACTGG CACACAACCC
GGCGTGCGCG TGCCGATGCG TGAGATCCAG CTTAGCCCGA CGCTAATTGG CGGTAGCAAA
GAACAGCCGC AGTACGAAGA AAACGAAGCG ATTCCGGTCT ACGACACCTC CGGCCCGTAT
GGCGATCCGC AGATCGCCAT TAACGTGCAG CAAGGGCTGG CAAAACTACG CCAGCCGTGG
ATCGATGCGC GCGGCGATAC CGAAGAACTT ACCGTGCGCA GTTCCGATTA CACTAAAGCG
CGGCTGGCAG ATGATGGCCT CGACGAGCTG CGTTTTAGCG GCGTATTAAC GCCAAAACGC
GCCAAAGCAG GACGCCGCGT CACACAACTG CACTACGCCC GCCAGGGCAT CATCACACCG
GAAATGGAAT TCATCGCCAT CCGCGAGAAT ATGGGCCGCG AGCGCATCCG TAGCGAAGTT
TTACGCCACC AGCATCCGGG AATGAGCTTT GGCGCACGTC TGCCGGAAAA TATCACTGCG
GAATTTGTCC GTGATGAAGT TGCTGCCGGA CGTGCGATTA TCCCGGCCAA CATTAATCAT
CCGGAATCGG AGCCGATGAT TATTGGTCGC AATTTCCTGG TAAAAGTTAA CGCCAATATC
GGCAACTCGG CGGTCACCTC TTCCATCGAA GAAGAAGTGG AAAAGCTGGT ATGGTCCACG
CGCTGGGGAG CGGATACGGT GATGGATCTC TCCACCGGTC GCTATATTCA CGAAACCCGC
GAGTGGATTT TGCGTAACAG CCCGGTGCCG ATTGGTACAG TGCCGATCTA CCAGGCGCTG
GAGAAGGTTA ATGGGATCGC CGAAGATCTT ACCTGGGAAG CGTTCCGCGA CACGCTGCTG
GAACAGGCCG AGCAAGGTGT GGATTACTTC ACTATCCATG CGGGGGTGCT GCTGCGCTAT
GTGCCGATGA CCGCGAAACG CCTGACCGGT ATCGTCTCGC GCGGCGGTTC GATTATGGCG
AAATGGTGCC TCTCCCATCA TCAGGAAAAT TTCCTCTATC AACACTTCCG CGAAATTTGT
GAAATCTGTG CCGCTTATGA CGTTTCGCTG TCGCTGGGCG ACGGTCTGCG CCCCGGTTCT
ATTCAGGACG CCAACGATGA AGCGCAGTTT GCCGAGCTGC ATACGCTGGG CGAACTGACA
AAAATCGCCT GGGAATATGA TGTGCAGGTG ATGATTGAAG GCCCAGGCCA CGTGCCGATG
CAGATGATCC GCCGCAATAT GACCGAGGAG TTAGAGCACT GCCACGAAGC GCCGTTTTAC
ACTCTGGGGC CGCTAACTAC CGATATTGCG CCGGGCTATG ACCACTTCAC GTCGGGGATT
GGTGCGGCGA TGATTGGCTG GTTTGGCTGC GCGATGCTCT GTTACGTAAC GCCAAAAGAG
CATCTGGGTC TGCCTAATAA AGAAGATGTT AAGCAGGGGC TTATCACCTA TAAGATTGCC
GCCCACGCCG CTGACCTGGC GAAAGGGCAT CCGGGCGCAC AAATTCGCGA TAACGCCATG
TCGAAAGCCC GCTTCGAATT TCGCTGGGAA GACCAGTTTA ATCTGGCCCT CGACCCGTTT
ACCGCCCGCG CTTATCACGA TGAAACCCTG CCGCAAGAGT CAGGTAAAGT CGCCCATTTT
TGCTCCATGT GTGGGCCGAA ATTCTGCTCG ATGAAAATCA GCCAGGAAGT GCGTGATTAC
GCCGCCACGC AAACTATTGA AATGGGAATA GCGGATATGT CGGAGAACTT CCGTGCCAGA
GGCGGAGAAA TCTACCTGCG TAAGGAGGAA GCGTGA
 
Protein sequence
MSATKLTRRE QRAQAQHFID TLEGTAFPNS KRIYITGTQP GVRVPMREIQ LSPTLIGGSK 
EQPQYEENEA IPVYDTSGPY GDPQIAINVQ QGLAKLRQPW IDARGDTEEL TVRSSDYTKA
RLADDGLDEL RFSGVLTPKR AKAGRRVTQL HYARQGIITP EMEFIAIREN MGRERIRSEV
LRHQHPGMSF GARLPENITA EFVRDEVAAG RAIIPANINH PESEPMIIGR NFLVKVNANI
GNSAVTSSIE EEVEKLVWST RWGADTVMDL STGRYIHETR EWILRNSPVP IGTVPIYQAL
EKVNGIAEDL TWEAFRDTLL EQAEQGVDYF TIHAGVLLRY VPMTAKRLTG IVSRGGSIMA
KWCLSHHQEN FLYQHFREIC EICAAYDVSL SLGDGLRPGS IQDANDEAQF AELHTLGELT
KIAWEYDVQV MIEGPGHVPM QMIRRNMTEE LEHCHEAPFY TLGPLTTDIA PGYDHFTSGI
GAAMIGWFGC AMLCYVTPKE HLGLPNKEDV KQGLITYKIA AHAADLAKGH PGAQIRDNAM
SKARFEFRWE DQFNLALDPF TARAYHDETL PQESGKVAHF CSMCGPKFCS MKISQEVRDY
AATQTIEMGI ADMSENFRAR GGEIYLRKEE A