Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A4374 |
Symbol | thiC |
ID | 6515514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | - |
Start bp | 4246493 |
End bp | 4248388 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642749326 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002117065 |
Protein GI | 194734581 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.701948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACAA CAACGTTAAC CCGCCGCGAG CAGCGCGCTA AAGCCCAGCA TTTTATCGAT ACGCTGGAAG GCACCGCATT TCCCAACTCG AAACGCATCT ACGTGACCGG TTCGCAGCAT GATATTCGCG TACCGATGCG CGAAATTCAA CTTAGCCCAA CGCTCATCGG CGGCAGCAAA GACAACCCGC AGTTTGAAGA GAACGAAGCC GTGCCGGTGT ACGACACCTC CGGCCCCTAT GGCGATCCTG AGGTGGCGAT TAACGTCCAG CAGGGTCTGG CGAAACTGCG CCAGCCATGG ATTGACGCAC GTAACGATAG CGAAGAATTA GACGACCGTA GCTCGGCTTA TACCAGAGAA CGTCTGGCCG ACGATGGCCT GGACGATCTG CGTTTTACCG GCCTGCTGAC GCCAAAACGC GCTAAAGCGG GCAAGCGCGT CACCCAGTTA CACTACGCCC GCAACGGGAT CGTCACTCCC GAAATGGAGT TCATCGCCAT CCGTGAAAAT ATGGGCCGCG AGCGCATTCG CAGTGAAGTA CTGCGCCACC AGCATCCGGG GATGAGCTTT GGCGCACGCC TGCCGGAAAA CATCACCCCG GAATTCGTGC GTGATGAAGT CGCCGCCGGA CGCGCCATCA TCCCCGCCAA CATCAACCAC CCGGAATCGG AGCCGATGAT TATTGGCCGC AACTTCCTGG TGAAGGTCAA CGCCAATATC GGCAACTCGG CGGTGACCTC CTCTATCGAA GAAGAGGTGG AAAAACTGGT GTGGGCGACG CGCTGGGGGG CGGATACGGT AATGGACCTT TCCACCGGGC GCTATATTCA CGAAACCCGT GAGTGGATCC TGCGTAATAG CCCAGTACCA ATCGGCACGG TGCCGATCTA CCAGGCGCTG GAAAAGGTCA ACGGGATCGC CGAGGATCTT ACCTGGGAAG CCTTCCGCGA CACGCTGCTG GAGCAGGCCG AACAGGGCGT CGACTACTTC ACCATTCACG CGGGCGTGCT GCTGCGTTAC GTGCCGATGA CCGCCAAGCG CCTGACCGGC ATTGTCTCGC GCGGCGGTTC GATCATGGCG AAGTGGTGTC TCTCCCATCA CAAAGAGAAC TTCCTGTTCG AACATTTCCG CGAGATTTGC GAAATCTGCG CCGCCTACGA CGTTTCCCTG TCGCTGGGCG ACGGCCTGCG ACCGGGATCG ATTCAGGACG CGAACGACGA AGCGCAGTTC TCCGAGCTGC ATACGCTGGG CGAGCTGACC AAAATCGCCT GGGAATACGA CGTGCAGGTG ATGATTGAAG GCCCGGGCCA CGTGCCGATG CATATGATTC AGCGCAACAT GACCGAAGAG CTGGAGCACT GCCATGAAGC GCCGTTCTAC ACCTTAGGGC CGCTGACTAC CGATATCGCG CCGGGCTATG ACCACTTCAC CTCCGGCATT GGCGCGGCGA TGATCGGCTG GTTCGGCTGT GCGATGCTGT GTTACGTGAC GCCGAAAGAG CATCTCGGCC TGCCGAACAA AGAAGATGTG AAGCAGGGGC TGATCACCTA CAAGATCGCC GCCCACGCCG CTGACTTAGC CAAAGGGCAT CCGGGCGCGC AGATTCGCGA TAACGCGATG TCGAAAGCGC GCTTTGAATT CCGCTGGGAG GATCAGTTTA ACCTCGCGCT CGACCCGTTC ACCGCCCGCG CCTGGCACGA TGAAACCCTG CCACAGGAGT CCGGCAAAGT GGCGCACTTT TGCTCGATGT GCGGGCCGAA ATTCTGCTCA ATGAAAATCA GCCAGGAAGT TCGTGATTAC GCCGCCGCGC AAACTATTGA AGTCGGGATG GCGGATATGT CGGAAAACTT CCGCGCCAAA GGCGGCGAAA TCTATCTCAA GCGGGAGGAA GTCTGA
|
Protein sequence | MSTTTLTRRE QRAKAQHFID TLEGTAFPNS KRIYVTGSQH DIRVPMREIQ LSPTLIGGSK DNPQFEENEA VPVYDTSGPY GDPEVAINVQ QGLAKLRQPW IDARNDSEEL DDRSSAYTRE RLADDGLDDL RFTGLLTPKR AKAGKRVTQL HYARNGIVTP EMEFIAIREN MGRERIRSEV LRHQHPGMSF GARLPENITP EFVRDEVAAG RAIIPANINH PESEPMIIGR NFLVKVNANI GNSAVTSSIE EEVEKLVWAT RWGADTVMDL STGRYIHETR EWILRNSPVP IGTVPIYQAL EKVNGIAEDL TWEAFRDTLL EQAEQGVDYF TIHAGVLLRY VPMTAKRLTG IVSRGGSIMA KWCLSHHKEN FLFEHFREIC EICAAYDVSL SLGDGLRPGS IQDANDEAQF SELHTLGELT KIAWEYDVQV MIEGPGHVPM HMIQRNMTEE LEHCHEAPFY TLGPLTTDIA PGYDHFTSGI GAAMIGWFGC AMLCYVTPKE HLGLPNKEDV KQGLITYKIA AHAADLAKGH PGAQIRDNAM SKARFEFRWE DQFNLALDPF TARAWHDETL PQESGKVAHF CSMCGPKFCS MKISQEVRDY AAAQTIEVGM ADMSENFRAK GGEIYLKREE V
|
| |