Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CCC13826_1278 |
Symbol | thiH |
ID | 5596332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Campylobacter concisus 13826 |
Kingdom | Bacteria |
Replicon accession | NC_009802 |
Strand | + |
Start bp | 1013843 |
End bp | 1014997 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640929573 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001466864 |
Protein GI | 157165154 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.173544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTA CAAGAAACGA CCACATGAAG CTGCTACCTC ACATGCAGGA TGTTGGCAGC GACATTATGG ATGAGATTTT AAAAGAGTGC GCAAGCTACA AGCCAGAAAT TTACAGCGAA GCAGACGTAA AAGCAGCTCT TAATGCAAAG CACTGCTCGC TTGAAAATTT AAAAGCCCTG CTCTCGCCTG CTGCAGCACC ATTTTTAGAG CCAATAGCCC AGCTTGCTCA AGCAAAAACA AGGGCAAATT TTGGCTCAAA CATAACGCTT TTTACCCCGC TTTACATAGC AAACTACTGC GATAATCTCT GCGTTTATTG CGGTTTTAAC GCTAAAAATA ATATAAAAAG AGCAAAGCTG AGCGACGAGG AGATCACAAG AGAGTTAAGA GAAATTTCAA AGAGTGGCTT AGAGGAAATT TTAATTCTAA CTGGCGAGAG CGAGACAAAC TCAAGTGTCG CTTACATCGC AAACGCCTGC GCTTTGGCAA AGAAATTTTT TAAAGTCGTT GGAGTTGAAA TTTACCCATT AAACTCAGAG GGCTACGCCC TGCTTCACAA AAGTGGCGCA GACTACGTGA CCGTTTTTCA AGAGACCTAC AATCCCACAA AATACGAAAA AATCCACCTT GGCGGCAATA AAAGGATATT CCCATACCGC TTAAATGCGC AAGAGCGAGC GCTTCTTGGA GGCATGAGAG GAGTTGGCTT TGCCGCACTT CTTGGCATAG ATGACTTTAG ACTTGATGCC TTTGCGACCG CACTTCACGC AAGCTTAGTT CAAAAAAAGT ATCCGCACGC CGAGATCGCA TTTTCATGCC CAAGACTTCG CCCTATCATA AACAACGACC GCATCAATCC ACGCGACGTG GGCGAGCGCG AGCTTTTGCA AGTGATCTGT GCTTATAGAA TTTTCATGCC AACAGCTAGC ATAACGATCT CAACTAGAGA AAAGGCGAAA TTTCGCGACA ACGCCGTAAA GATCGCCGCA AATAAGATAA GCGCTGGCGT AAAAGTGAGC ATCGGCGCTC ACGGCGAAGA GAAAAAGGGC GACGAGCAGT TTGAGATAAG TGATAGCAGA AGCGTGGATG AGATAAAAGC AATGATAAAA GCAAACGGCC TAGAGCCCTT GATGAGTGAG TATGTCTATG TTTAA
|
Protein sequence | MKFTRNDHMK LLPHMQDVGS DIMDEILKEC ASYKPEIYSE ADVKAALNAK HCSLENLKAL LSPAAAPFLE PIAQLAQAKT RANFGSNITL FTPLYIANYC DNLCVYCGFN AKNNIKRAKL SDEEITRELR EISKSGLEEI LILTGESETN SSVAYIANAC ALAKKFFKVV GVEIYPLNSE GYALLHKSGA DYVTVFQETY NPTKYEKIHL GGNKRIFPYR LNAQERALLG GMRGVGFAAL LGIDDFRLDA FATALHASLV QKKYPHAEIA FSCPRLRPII NNDRINPRDV GERELLQVIC AYRIFMPTAS ITISTREKAK FRDNAVKIAA NKISAGVKVS IGAHGEEKKG DEQFEISDSR SVDEIKAMIK ANGLEPLMSE YVYV
|
| |