Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4494 |
Symbol | thiC |
ID | 6492374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 4375891 |
End bp | 4377786 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642744569 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002048149 |
Protein GI | 194449579 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0642711 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.0188033 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACAA CAACGTTAAC CCGCCGCGAG CAGCGCGCTA AAGCCCAGCA TTTTATCGAT ACGCTGGAAG GCACCGCGTT TCCCAACTCG AAACGCATCT ACGTGACCGG TTCGCAGCAT GATATTCGCG TACCGATGCG CGAAATTCAA CTTAGCCCCA CGCTCATCGG CGGCAGTAAA GACAACCCGC AGTTTGAAGA GAACGAAGCC GTACCGGTAT ACGACACCTC CGGCCCCTAT GGCGATCCTG AGGTGGCGAT TAACGTCCAG CAGGGTCTGG CGAAACTGCG CCAGCCATGG ATTGACGCAC GTAACGATAG CGAAGAATTA GACGACCGTA GCTCGGCTTA TACCAGAGAA CGTCTGGCCG ACGATGGCCT GGACGATCTG CGCTTTACCG GCCTGCTGAC GCCAAAACGC GCTAAAGCGG GCAAGCGCGT CACCCAGTTA CACTACGCCC GCAAGGGGAT CGTCACTCCC GAAATGGAGT TCATCGCCAT CCGTGAAAAC ATGGGCCGCG AACGCATTCG TAGTGAAGTG CTGCGCCACC AGCATCCGGG GATGAGCTTT GGCGCGCGCC TGCCGGAAAA CATTACGCCG GAATTCGTGC GTGATGAAGT CGCCGCGGGC CGCGCGATTA TTCCCGCCAA CATCAACCAC CCGGAATCGG AGCCGATGAT TATCGGCCGC AACTTCCTGG TCAAAGTGAA TGCCAACATC GGTAACTCGG CGGTGACCTC CTCTATCGAA GAAGAGGTGG AAAAACTGGT GTGGTCAACC CGCTGGGGCG CGGATACGGT TATGGACCTC TCCACCGGCC GCTATATCCA CGAAACCCGC GAATGGATCC TGCGTAACAG CCCGGTACCG ATCGGCACCG TCCCGATCTA CCAGGCGCTG GAGAAGGTCA ACGGGATCGC CGAAGATCTT ACCTGGGAAG CGTTCCGCGA CACGCTGCTG GAGCAGGCCG AACAGGGCGT CGACTACTTC ACCATTCACG CGGGCGTGCT GTTGCGCTAC GTGCCGATGA CCGCCAAACG CCTGACCGGG ATTGTCTCGC GCGGCGGTTC GATCATGGCG AAGTGGTGCC TCTCGCATCA CAAAGAGAAC TTCCTGTTCG AACATTTCCG CGAGATCTGC GAAATCTGCG CCGCCTACGA CGTTTCCCTG TCGTTAGGCG ACGGCCTGCG CCCCGGCTCC ATTCAGGACG CCAACGACGA AGCGCAGTTC TCCGAGCTGC ATACGCTGGG CGAGCTGACC AAAATCGCCT GGGAATACGA CGTGCAGGTG ATGATTGAAG GCCCGGGTCA TGTACCCATG CATATGATTC AGCGCAACAT GACCGAAGAG CTGGAGAGCT GCCATGAAGC ACCGTTCTAC ACCTTAGGGC CATTGACCAC CGATATCGCG CCGGGCTATG ACCACTTCAC CTCCGGCATT GGCGCCGCGA TGATCGGCTG GTTCGGTTGT GCGATGCTGT GTTACGTCAC GCCGAAAGAG CATCTCGGCC TGCCGAACAA AGAAGATGTG AAGCAGGGGC TGATCACCTA CAAAATAGCC GCCCACGCCG CTGACTTAGC GAAAGGCCAT CCGGGCGCGC AGATCCGCGA TAACGCCATG TCGAAAGCGC GCTTCGAATT CCGCTGGGAA GATCAGTTTA ACCTCGCGCT CGACCCGTTC ACCGCCCGCG CTTATCACGA TGAAACCCTA CCGCAGGAGT CCGGTAAGGT CGCTCACTTC TGTTCCATGT GCGGGCCGAA GTTCTGCTCG ATGAAAATCA GCCAGGAAGT CCGCGACTAT GCCGCCGCAC AAACCATCGA AGTCGGCATG GCGGATATGT CGGAAAACTT CCGCGCCAAA GGCGGCGAAA TCTATCTCAA GCGGGAGGAA GCCTGA
|
Protein sequence | MSTTTLTRRE QRAKAQHFID TLEGTAFPNS KRIYVTGSQH DIRVPMREIQ LSPTLIGGSK DNPQFEENEA VPVYDTSGPY GDPEVAINVQ QGLAKLRQPW IDARNDSEEL DDRSSAYTRE RLADDGLDDL RFTGLLTPKR AKAGKRVTQL HYARKGIVTP EMEFIAIREN MGRERIRSEV LRHQHPGMSF GARLPENITP EFVRDEVAAG RAIIPANINH PESEPMIIGR NFLVKVNANI GNSAVTSSIE EEVEKLVWST RWGADTVMDL STGRYIHETR EWILRNSPVP IGTVPIYQAL EKVNGIAEDL TWEAFRDTLL EQAEQGVDYF TIHAGVLLRY VPMTAKRLTG IVSRGGSIMA KWCLSHHKEN FLFEHFREIC EICAAYDVSL SLGDGLRPGS IQDANDEAQF SELHTLGELT KIAWEYDVQV MIEGPGHVPM HMIQRNMTEE LESCHEAPFY TLGPLTTDIA PGYDHFTSGI GAAMIGWFGC AMLCYVTPKE HLGLPNKEDV KQGLITYKIA AHAADLAKGH PGAQIRDNAM SKARFEFRWE DQFNLALDPF TARAYHDETL PQESGKVAHF CSMCGPKFCS MKISQEVRDY AAAQTIEVGM ADMSENFRAK GGEIYLKREE A
|
| |