Gene Cla_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCla_1147 
SymbolthiH 
ID7410881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCampylobacter lari RM2100 
KingdomBacteria 
Replicon accessionNC_012039 
Strand
Start bp1096142 
End bp1097275 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content34% 
IMG OID643718273 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002575720 
Protein GI222824146 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0000413421 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAT ATCCTCATAT GCAAAGCATT GAAAGTGAAA TTTTAACTAA GGCTTTAAAA 
GAAGTTGAAG AATTTGATGA AAGTAAATTT AGCGCTTTTG ATGTAAAGCA AGCTTTAGTT
AAAGATTATC TTAACTTAGA TGATTTAAAA GCCTTGCTTT CAAGTGCAGC AGAAGATTTC
ATAGAAGATT TGGCACAAAA ATCAAGCTAT GTTACTCGAA GACACTTTGG AAATTCCATC
TCTCTTTTTA CGCCTTTATA TCTTTCTAAT TTTTGCAATA GTAAATGTAC TTATTGTGGA
TTTCAAAAAG GAAATAATAT TAAAAGAGCT AAGCTAAATG AGGCCGAAAT TCACAAAGAA
ATGCAAGAGA TTAAAAAAAG TGGTTTAGAA GAAATTTTGC TTCTAACAGG CGAAGGTAGG
GAGTATGCTA GCGTAGAATA CCTTGCTAAA GCTTGTGAGA TAGCAAAAGA GTATTTTAAA
GTTGTGGGGA TTGAAGTTTA TCCTATGAAT ATAGATGAGT ATGCATTGCT TCATGAAAAG
GGTTGTGAGT ATGTAAGTGT TTATCAAGAA ACTTATAATC AAAAAACTTA TTCTAAAATT
CATATCGAAG GTGAAAAAAG TGTATTTGAG TATCGCTTTT ATGCACAAGA AAGGGCTTTG
AAAGCAGGTA TGAGAGGAGT GGCTTTTGGA GCTTTACTTG GCGTGGATGA TTTTAGGAAA
GATGCTTTTG CTACGGCTTT ACATGCGTAT TTTTTACAGC AAAAATACCC TCATGCTGAA
ATAGCTTTAT CTATCCCTAG GCTAAGACCT ATTATCAATA ATAAAAAAAT TCATCCAAAA
GATGTAAGTG AAACAAGACT TTTGCAAGTT TTGTGTGCTT ATAGATTGTT TTTACCTTTT
GCAAGTATTA CGATTTCTAC GCGTGAAAGA GCAGGATTTA GAGACGGAGT TATTAAACTA
GGAGCTACTA AAATGAGTGC TGGAGTGAGT GTGGGAGTGG GTGAGCATCA AGGTGATAAA
AAAGGTGATG ATCAGTTTCA AATTAGTGAT ATGCGTAGTG TTGATGAGGT TTTGGCTATG
TTGAAAAATG CAAATTTACA AGCTGTAATG AGCGATAGTA TTTATGTGGG GTAA
 
Protein sequence
MQKYPHMQSI ESEILTKALK EVEEFDESKF SAFDVKQALV KDYLNLDDLK ALLSSAAEDF 
IEDLAQKSSY VTRRHFGNSI SLFTPLYLSN FCNSKCTYCG FQKGNNIKRA KLNEAEIHKE
MQEIKKSGLE EILLLTGEGR EYASVEYLAK ACEIAKEYFK VVGIEVYPMN IDEYALLHEK
GCEYVSVYQE TYNQKTYSKI HIEGEKSVFE YRFYAQERAL KAGMRGVAFG ALLGVDDFRK
DAFATALHAY FLQQKYPHAE IALSIPRLRP IINNKKIHPK DVSETRLLQV LCAYRLFLPF
ASITISTRER AGFRDGVIKL GATKMSAGVS VGVGEHQGDK KGDDQFQISD MRSVDEVLAM
LKNANLQAVM SDSIYVG