Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5462 |
Symbol | thiC |
ID | 6969670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5106142 |
End bp | 5108037 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643389109 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002273510 |
Protein GI | 209399239 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.416651 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCAA CAAAACTGAC CCGTCGCGAA CAACGCGCCC AGGCCCAACA TTTTATCGAC ACCCTGGAAG GCACCGCCTT TCCCAACTCA AAACGCATTT ATATCACTGG CACACAACCC GGCGTGCGCG TGCCGATGCG TGAGATCCAG CTTAGCCCGA CGCTAATTGG CGGTAGCAAA GAACAGCCGC AGTACGAAGA AAACGAAGCG ATTCCGGTCT ACGACACCTC CGGCCCGTAT GGCGATCCGC AGATCGCCAT TAACGTGCAG CAAGGGCTGG CAAAACTACG CCAGCCGTGG ATCGATGCGC GCGGCGATAC CGAAGAACTT ACCGTGCGCA GTTCCGATTA CACTAAAGCG CGGCTGGCAG ATGATGGCCT CGACGAGCTG CGTTTTAGCG GCGTATTAAC GCCAAAACGC GCCAAAGCAG GACGCCGCGT CACACAACTG CACTACGCCC GCCAGGGCAT CATCACACCG GAAATGGAAT TCATCGCCAT CCGCGAGAAT ATGGGCCGCG AGCGCATCCG TAGCGAAGTT TTACGCCACC AGCATCCGGG AATGAGCTTT GGCGCACGTC TGCCGGAAAA TATCACTGCG GAATTTGTCC GTGATGAAGT TGCTGCCGGA CGTGCGATTA TCCCGGCCAA CATTAATCAT CCGGAATCGG AGCCGATGAT TATTGGTCGC AATTTCCTGG TAAAAGTTAA CGCCAATATC GGCAACTCGG CGGTCACCTC TTCCATCGAA GAAGAAGTGG AAAAGCTGGT ATGGTCCACG CGCTGGGGAG CGGATACGGT GATGGATCTC TCCACCGGTC GCTATATTCA CGAAACCCGC GAGTGGATTT TGCGTAACAG CCCGGTGCCG ATTGGTACAG TGCCGATCTA CCAGGCGCTG GAGAAGGTTA ATGGGATCGC CGAAGATCTT ACCTGGGAAG CGTTCCGCGA CACGCTGCTG GAACAGGCCG AGCAAGGTGT GGATTACTTC ACTATCCATG CGGGGGTGCT GCTGCGCTAT GTGCCGATGA CCGCGAAACG CCTGACCGGT ATCGTCTCGC GCGGCGGTTC GATTATGGCG AAATGGTGCC TCTCCCATCA TCAGGAAAAT TTCCTCTATC AACACTTCCG CGAAATTTGT GAAATCTGTG CCGCTTATGA CGTTTCGCTG TCGCTGGGCG ACGGTCTGCG CCCCGGTTCT ATTCAGGACG CCAACGATGA AGCGCAGTTT GCCGAGCTGC ATACGCTGGG CGAACTGACA AAAATCGCCT GGGAATATGA TGTGCAGGTG ATGATTGAAG GCCCAGGCCA CGTGCCGATG CAGATGATCC GCCGCAATAT GACCGAGGAG TTAGAGCACT GCCACGAAGC GCCGTTTTAC ACTCTGGGGC CGCTAACTAC CGATATTGCG CCGGGCTATG ACCACTTCAC GTCGGGGATT GGTGCGGCGA TGATTGGCTG GTTTGGCTGC GCGATGCTCT GTTACGTAAC GCCAAAAGAG CATCTGGGTC TGCCTAATAA AGAAGATGTT AAGCAGGGGC TTATCACCTA TAAGATTGCC GCCCACGCCG CTGACCTGGC GAAAGGGCAT CCGGGCGCAC AAATTCGCGA TAACGCCATG TCGAAAGCCC GCTTCGAATT TCGCTGGGAA GACCAGTTTA ATCTGGCCCT CGACCCGTTT ACCGCCCGCG CTTATCACGA TGAAACCCTG CCGCAAGAGT CAGGTAAAGT CGCCCATTTT TGCTCCATGT GTGGGCCGAA ATTCTGCTCG ATGAAAATCA GCCAGGAAGT GCGTGATTAC GCCGCCACGC AAACTATTGA AATGGGAATA GCGGATATGT CGGAGAACTT CCGTGCCAGA GGCGGAGAAA TCTACCTGCG TAAGGAGGAA GCGTGA
|
Protein sequence | MSATKLTRRE QRAQAQHFID TLEGTAFPNS KRIYITGTQP GVRVPMREIQ LSPTLIGGSK EQPQYEENEA IPVYDTSGPY GDPQIAINVQ QGLAKLRQPW IDARGDTEEL TVRSSDYTKA RLADDGLDEL RFSGVLTPKR AKAGRRVTQL HYARQGIITP EMEFIAIREN MGRERIRSEV LRHQHPGMSF GARLPENITA EFVRDEVAAG RAIIPANINH PESEPMIIGR NFLVKVNANI GNSAVTSSIE EEVEKLVWST RWGADTVMDL STGRYIHETR EWILRNSPVP IGTVPIYQAL EKVNGIAEDL TWEAFRDTLL EQAEQGVDYF TIHAGVLLRY VPMTAKRLTG IVSRGGSIMA KWCLSHHQEN FLYQHFREIC EICAAYDVSL SLGDGLRPGS IQDANDEAQF AELHTLGELT KIAWEYDVQV MIEGPGHVPM QMIRRNMTEE LEHCHEAPFY TLGPLTTDIA PGYDHFTSGI GAAMIGWFGC AMLCYVTPKE HLGLPNKEDV KQGLITYKIA AHAADLAKGH PGAQIRDNAM SKARFEFRWE DQFNLALDPF TARAYHDETL PQESGKVAHF CSMCGPKFCS MKISQEVRDY AATQTIEMGI ADMSENFRAR GGEIYLRKEE A
|
| |