Gene CCV52592_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCCV52592_1868 
SymbolthiH 
ID5407043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCampylobacter curvus 525.92 
KingdomBacteria 
Replicon accessionNC_009715 
Strand
Start bp1006226 
End bp1007380 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content49% 
IMG OID640872446 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001408265 
Protein GI154174598 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.386281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA GCAAAACCGA CCATATGACC TTGCTGCCGC ATATGCAAGA CATCGGTGAT 
GAGATAATGA ACGAAATTTT GCAAGAAAGA GCAAAATTTA AGCCAGAAAA TTTTACCGCC
GAGGACGTGA GAGCCGCTTT AAACGCTAAA ATTTGCACGT TAGAAAATTT TAAAGCTCTG
CTTAGCCCGG CGGCTAGCGA TTTTATAGAA GAGATCGCGC ACCTTAGCAT GCAAAAGACG
CGAGCGAATT TCGGAGCCAA CATCAACCTC TTCACACCCC TTTACATAGC CAACCACTGC
GACAACCTAT GCGTCTACTG CGGCTTTAAC TCGCAAAATA AGATAAAGCG AGCCAAGCTT
GATGAGGATG AAATTTTAAG CGAACTTGCA GAAATTTCAA AAAGCGGGCT TGAGGAAATT
TTGATATTAA CGGGCGAGAG CGAGGAAAAT TCGAGCGTCG CTTATATCGC GAGGGCTTGC
ACCTTGGCAA AGCGCTATTT TAAAGTAGTC GGCGTCGAAG TCTATCCGCT AAATTCCAGC
GACTATGCCC TGCTTCATGA AAGCGGCGTC GACTATGTCA CGGTTTTTCA AGAGACCTAC
AACCCCGCAA AATACGAGCG CTTACACCTT GCGGGCAACA AGCGGATATT CCCCTACCGT
CTAAACGCGC AAGAACGTGC ATTGATGGGT GGCATGAGGG GTGTGGGCTT TGCCGCGCTT
TTGGGACTTG ACGACTTTAG ACTAGACGCC TTTTCGACGG GACTTCACGC CTCGCTCGTG
CAAAAGAAAT ATCCGCACGC AGAGATCGCC TTTTCCTGCC CGAGACTGCG CCCTATCATC
AACAACTCCC GTATAAATCC GCGCGATGTG CACGAGAGGG AACTTTTGCA AGTCATTTGC
GCTTATAGGA TATTCATGCC AACAGCCAGC ATCACGATCT CCACGCGTGA GCGAGCGCTC
TTTCGCGATA ACGCCATAAA GATCGCCGCA AATAAGATCA GCGCCGGCGT AAATGTCGGC
ATCGGTGCGC ACTCGAAAGA AAAAAAGGGC GACGAGCAGT TTGAGATCGA GGACGCTCGC
TGCGTAGATG AGATCTACAA CATGATAAAA GCCCAAGGGC TCGAACCGCT GATGAGCGAA
TACATCTATG TTTAA
 
Protein sequence
MKFSKTDHMT LLPHMQDIGD EIMNEILQER AKFKPENFTA EDVRAALNAK ICTLENFKAL 
LSPAASDFIE EIAHLSMQKT RANFGANINL FTPLYIANHC DNLCVYCGFN SQNKIKRAKL
DEDEILSELA EISKSGLEEI LILTGESEEN SSVAYIARAC TLAKRYFKVV GVEVYPLNSS
DYALLHESGV DYVTVFQETY NPAKYERLHL AGNKRIFPYR LNAQERALMG GMRGVGFAAL
LGLDDFRLDA FSTGLHASLV QKKYPHAEIA FSCPRLRPII NNSRINPRDV HERELLQVIC
AYRIFMPTAS ITISTRERAL FRDNAIKIAA NKISAGVNVG IGAHSKEKKG DEQFEIEDAR
CVDEIYNMIK AQGLEPLMSE YIYV