Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cla_1147 |
Symbol | thiH |
ID | 7410881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Campylobacter lari RM2100 |
Kingdom | Bacteria |
Replicon accession | NC_012039 |
Strand | - |
Start bp | 1096142 |
End bp | 1097275 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643718273 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002575720 |
Protein GI | 222824146 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0000413421 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAT ATCCTCATAT GCAAAGCATT GAAAGTGAAA TTTTAACTAA GGCTTTAAAA GAAGTTGAAG AATTTGATGA AAGTAAATTT AGCGCTTTTG ATGTAAAGCA AGCTTTAGTT AAAGATTATC TTAACTTAGA TGATTTAAAA GCCTTGCTTT CAAGTGCAGC AGAAGATTTC ATAGAAGATT TGGCACAAAA ATCAAGCTAT GTTACTCGAA GACACTTTGG AAATTCCATC TCTCTTTTTA CGCCTTTATA TCTTTCTAAT TTTTGCAATA GTAAATGTAC TTATTGTGGA TTTCAAAAAG GAAATAATAT TAAAAGAGCT AAGCTAAATG AGGCCGAAAT TCACAAAGAA ATGCAAGAGA TTAAAAAAAG TGGTTTAGAA GAAATTTTGC TTCTAACAGG CGAAGGTAGG GAGTATGCTA GCGTAGAATA CCTTGCTAAA GCTTGTGAGA TAGCAAAAGA GTATTTTAAA GTTGTGGGGA TTGAAGTTTA TCCTATGAAT ATAGATGAGT ATGCATTGCT TCATGAAAAG GGTTGTGAGT ATGTAAGTGT TTATCAAGAA ACTTATAATC AAAAAACTTA TTCTAAAATT CATATCGAAG GTGAAAAAAG TGTATTTGAG TATCGCTTTT ATGCACAAGA AAGGGCTTTG AAAGCAGGTA TGAGAGGAGT GGCTTTTGGA GCTTTACTTG GCGTGGATGA TTTTAGGAAA GATGCTTTTG CTACGGCTTT ACATGCGTAT TTTTTACAGC AAAAATACCC TCATGCTGAA ATAGCTTTAT CTATCCCTAG GCTAAGACCT ATTATCAATA ATAAAAAAAT TCATCCAAAA GATGTAAGTG AAACAAGACT TTTGCAAGTT TTGTGTGCTT ATAGATTGTT TTTACCTTTT GCAAGTATTA CGATTTCTAC GCGTGAAAGA GCAGGATTTA GAGACGGAGT TATTAAACTA GGAGCTACTA AAATGAGTGC TGGAGTGAGT GTGGGAGTGG GTGAGCATCA AGGTGATAAA AAAGGTGATG ATCAGTTTCA AATTAGTGAT ATGCGTAGTG TTGATGAGGT TTTGGCTATG TTGAAAAATG CAAATTTACA AGCTGTAATG AGCGATAGTA TTTATGTGGG GTAA
|
Protein sequence | MQKYPHMQSI ESEILTKALK EVEEFDESKF SAFDVKQALV KDYLNLDDLK ALLSSAAEDF IEDLAQKSSY VTRRHFGNSI SLFTPLYLSN FCNSKCTYCG FQKGNNIKRA KLNEAEIHKE MQEIKKSGLE EILLLTGEGR EYASVEYLAK ACEIAKEYFK VVGIEVYPMN IDEYALLHEK GCEYVSVYQE TYNQKTYSKI HIEGEKSVFE YRFYAQERAL KAGMRGVAFG ALLGVDDFRK DAFATALHAY FLQQKYPHAE IALSIPRLRP IINNKKIHPK DVSETRLLQV LCAYRLFLPF ASITISTRER AGFRDGVIKL GATKMSAGVS VGVGEHQGDK KGDDQFQISD MRSVDEVLAM LKNANLQAVM SDSIYVG
|
| |