Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CCV52592_1868 |
Symbol | thiH |
ID | 5407043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Campylobacter curvus 525.92 |
Kingdom | Bacteria |
Replicon accession | NC_009715 |
Strand | + |
Start bp | 1006226 |
End bp | 1007380 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640872446 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001408265 |
Protein GI | 154174598 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.386281 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTA GCAAAACCGA CCATATGACC TTGCTGCCGC ATATGCAAGA CATCGGTGAT GAGATAATGA ACGAAATTTT GCAAGAAAGA GCAAAATTTA AGCCAGAAAA TTTTACCGCC GAGGACGTGA GAGCCGCTTT AAACGCTAAA ATTTGCACGT TAGAAAATTT TAAAGCTCTG CTTAGCCCGG CGGCTAGCGA TTTTATAGAA GAGATCGCGC ACCTTAGCAT GCAAAAGACG CGAGCGAATT TCGGAGCCAA CATCAACCTC TTCACACCCC TTTACATAGC CAACCACTGC GACAACCTAT GCGTCTACTG CGGCTTTAAC TCGCAAAATA AGATAAAGCG AGCCAAGCTT GATGAGGATG AAATTTTAAG CGAACTTGCA GAAATTTCAA AAAGCGGGCT TGAGGAAATT TTGATATTAA CGGGCGAGAG CGAGGAAAAT TCGAGCGTCG CTTATATCGC GAGGGCTTGC ACCTTGGCAA AGCGCTATTT TAAAGTAGTC GGCGTCGAAG TCTATCCGCT AAATTCCAGC GACTATGCCC TGCTTCATGA AAGCGGCGTC GACTATGTCA CGGTTTTTCA AGAGACCTAC AACCCCGCAA AATACGAGCG CTTACACCTT GCGGGCAACA AGCGGATATT CCCCTACCGT CTAAACGCGC AAGAACGTGC ATTGATGGGT GGCATGAGGG GTGTGGGCTT TGCCGCGCTT TTGGGACTTG ACGACTTTAG ACTAGACGCC TTTTCGACGG GACTTCACGC CTCGCTCGTG CAAAAGAAAT ATCCGCACGC AGAGATCGCC TTTTCCTGCC CGAGACTGCG CCCTATCATC AACAACTCCC GTATAAATCC GCGCGATGTG CACGAGAGGG AACTTTTGCA AGTCATTTGC GCTTATAGGA TATTCATGCC AACAGCCAGC ATCACGATCT CCACGCGTGA GCGAGCGCTC TTTCGCGATA ACGCCATAAA GATCGCCGCA AATAAGATCA GCGCCGGCGT AAATGTCGGC ATCGGTGCGC ACTCGAAAGA AAAAAAGGGC GACGAGCAGT TTGAGATCGA GGACGCTCGC TGCGTAGATG AGATCTACAA CATGATAAAA GCCCAAGGGC TCGAACCGCT GATGAGCGAA TACATCTATG TTTAA
|
Protein sequence | MKFSKTDHMT LLPHMQDIGD EIMNEILQER AKFKPENFTA EDVRAALNAK ICTLENFKAL LSPAASDFIE EIAHLSMQKT RANFGANINL FTPLYIANHC DNLCVYCGFN SQNKIKRAKL DEDEILSELA EISKSGLEEI LILTGESEEN SSVAYIARAC TLAKRYFKVV GVEVYPLNSS DYALLHESGV DYVTVFQETY NPAKYERLHL AGNKRIFPYR LNAQERALMG GMRGVGFAAL LGLDDFRLDA FSTGLHASLV QKKYPHAEIA FSCPRLRPII NNSRINPRDV HERELLQVIC AYRIFMPTAS ITISTRERAL FRDNAIKIAA NKISAGVNVG IGAHSKEKKG DEQFEIEDAR CVDEIYNMIK AQGLEPLMSE YIYV
|
| |