Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4407 |
Symbol | thiC |
ID | 6793012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 4298680 |
End bp | 4300575 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642778502 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002149072 |
Protein GI | 197250591 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.679063 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTACAA CAACGTTAAC CCGCCGCGAG CAGCGCGCTA AAGCCCAGCA TTTTATCGAT ACGCTGGAAG GGACTGCCTT TCCCAACTCG AAACGCATCT ACGTGACCGG TTCGCAGCAT GATATTCGCG TACCGATGCG CGAAATTCAA CTTAGCCCAA CGCTCATCGG CGGCAGCAAA GACAACCCGC AGTTTGAAGA GAACGAAGCC GTGCCGGTGT ACGACACCTC CGGCCCCTAT GGCGATCCTG AGGTGGCGAT TAACGTCCAG CAGGGTCTGG CAAAGCTACG CCAGCCGTGG ATTGAAGCGC GGGCTGATGT AGAAACGCTT GCTGACCGCA GTTCTGCCTA TACCCGCGAA CGCTTGACAG ATGAAGGGCT TGACGCATTA CGCTTTACCG GTCTGTTAAC GCCAAAACGC GCCAAAGCCG GACACCGCGT GACGCAACTT CATTATGCCC GCCAGGGGAT CGTCACTCCC GAAATGGAGT TCATCGCCAT CCGTGAAAAT ATGGGCCGCG AGCGCATTCG CAGTGAAGTA CTGCGCCACC AGCATCCGGG GATGAGCTTT GGCGCGCGCC TGCCGGAAAA CATTACCCCG GAATTCGTGC GTGATGAAGT CGCCGCGGGC CGCGCGATTA TTCCCGCCAA CATCAACCAC CCGGAATCCG AGCCGATGAT TATCGGCCGC AACTTCCTGG TCAAAGTGAA TGCCAACATC GGCAACTCGG CGGTGACCTC CTCCATCGAA GAAGAGGTGG AAAAACTGGT GTGGTCGACC CGCTGGGGCG CGGACACGGT AATGGACCTT TCCACCGGCC GCTATATTCA CGAAACCCGT GAGTGGATCC TGCGTAACAG CCCGGTACCG ATCGGCACCG TCCCGATCTA CCAGGCGCTG GAAAAGGTCA ACGGGATCGC CGAAGATCTT ACCTGGGAAG CGTTCCGCGA CACGCTGCTG GAGCAGGCCG AACAGGGCGT CGACTACTTC ACCATCCACG CGGGCGTGCT GCTGCGCTAC GTGCCGATGA CCGCCAAGCG CCTGACCGGC ATTGTCTCAC GCGGCGGTTC AATCATGGCG AAATGGTGCC TTTCTCATCA CAAAGAGAAC TTCCTGTTCG AACATTTCCG CGAGATTTGT GAAATCTGCG CCGCCTACGA CGTTTCCCTG TCGCTGGGCG ACGGCCTGCG TCCCGGCTCC ATTCAGGACG CCAACGACGA AGCGCAGTTC TCCGAGCTGC ATACGCTGGG CGAATTGACC AAAATCGCCT GGGAATACGA CGTGCAGGTG ATGATTGAAG GCCCGGGCCA CGTACCCATG CATATGATCC AGCGCAATAT GACCGAAGAG CTGGAGAGCT GCCATGAAGC ACCGTTCTAC ACCTTAGGGC CATTGACCAC CGATATCGCG CCGGGCTATG ACCACTTCAC CTCCGGGATC GGTGCCGCGA TGATCGGCTG GTTTGGCTGC GCGATGCTGT GTTATGTGAC GCCGAAAGAA CACCTCGGCC TGCCGAACAA AGAAGATGTG AAGCAGGGGC TAATCACCTA CAAAATCGCC GCCCACGCCG CGGATTTAGC CAAAGGACAT CCGGGCGCAC AGATCCGCGA TAACGCCATG TCGAAAGCGC GCTTTGAATT CCGCTGGGAA GATCAGTTTA ACCTCGCGCT CGACCCGTTC ACCGCCCGCG CTTATCACGA TGAAACCCTG CCGCAGGAGT CCGGTAAGGT CGCTCACTTC TGTTCCATGT GCGGGCCGAA GTTCTGCTCG ATGAAAATCA GCCAGGAAGT CCGTGACTAC GCCGCCGCGC AAACCATTGA AGTGGGGATG GCGAATATGT CGGAAAACTT CCGCGCCAAA GGCGGCGAAA TTTATCTCAA GCGGGAGGAA GTCTGA
|
Protein sequence | MSTTTLTRRE QRAKAQHFID TLEGTAFPNS KRIYVTGSQH DIRVPMREIQ LSPTLIGGSK DNPQFEENEA VPVYDTSGPY GDPEVAINVQ QGLAKLRQPW IEARADVETL ADRSSAYTRE RLTDEGLDAL RFTGLLTPKR AKAGHRVTQL HYARQGIVTP EMEFIAIREN MGRERIRSEV LRHQHPGMSF GARLPENITP EFVRDEVAAG RAIIPANINH PESEPMIIGR NFLVKVNANI GNSAVTSSIE EEVEKLVWST RWGADTVMDL STGRYIHETR EWILRNSPVP IGTVPIYQAL EKVNGIAEDL TWEAFRDTLL EQAEQGVDYF TIHAGVLLRY VPMTAKRLTG IVSRGGSIMA KWCLSHHKEN FLFEHFREIC EICAAYDVSL SLGDGLRPGS IQDANDEAQF SELHTLGELT KIAWEYDVQV MIEGPGHVPM HMIQRNMTEE LESCHEAPFY TLGPLTTDIA PGYDHFTSGI GAAMIGWFGC AMLCYVTPKE HLGLPNKEDV KQGLITYKIA AHAADLAKGH PGAQIRDNAM SKARFEFRWE DQFNLALDPF TARAYHDETL PQESGKVAHF CSMCGPKFCS MKISQEVRDY AAAQTIEVGM ANMSENFRAK GGEIYLKREE V
|
| |