Gene CCC13826_1278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCCC13826_1278 
SymbolthiH 
ID5596332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCampylobacter concisus 13826 
KingdomBacteria 
Replicon accessionNC_009802 
Strand
Start bp1013843 
End bp1014997 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content45% 
IMG OID640929573 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001466864 
Protein GI157165154 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.173544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA CAAGAAACGA CCACATGAAG CTGCTACCTC ACATGCAGGA TGTTGGCAGC 
GACATTATGG ATGAGATTTT AAAAGAGTGC GCAAGCTACA AGCCAGAAAT TTACAGCGAA
GCAGACGTAA AAGCAGCTCT TAATGCAAAG CACTGCTCGC TTGAAAATTT AAAAGCCCTG
CTCTCGCCTG CTGCAGCACC ATTTTTAGAG CCAATAGCCC AGCTTGCTCA AGCAAAAACA
AGGGCAAATT TTGGCTCAAA CATAACGCTT TTTACCCCGC TTTACATAGC AAACTACTGC
GATAATCTCT GCGTTTATTG CGGTTTTAAC GCTAAAAATA ATATAAAAAG AGCAAAGCTG
AGCGACGAGG AGATCACAAG AGAGTTAAGA GAAATTTCAA AGAGTGGCTT AGAGGAAATT
TTAATTCTAA CTGGCGAGAG CGAGACAAAC TCAAGTGTCG CTTACATCGC AAACGCCTGC
GCTTTGGCAA AGAAATTTTT TAAAGTCGTT GGAGTTGAAA TTTACCCATT AAACTCAGAG
GGCTACGCCC TGCTTCACAA AAGTGGCGCA GACTACGTGA CCGTTTTTCA AGAGACCTAC
AATCCCACAA AATACGAAAA AATCCACCTT GGCGGCAATA AAAGGATATT CCCATACCGC
TTAAATGCGC AAGAGCGAGC GCTTCTTGGA GGCATGAGAG GAGTTGGCTT TGCCGCACTT
CTTGGCATAG ATGACTTTAG ACTTGATGCC TTTGCGACCG CACTTCACGC AAGCTTAGTT
CAAAAAAAGT ATCCGCACGC CGAGATCGCA TTTTCATGCC CAAGACTTCG CCCTATCATA
AACAACGACC GCATCAATCC ACGCGACGTG GGCGAGCGCG AGCTTTTGCA AGTGATCTGT
GCTTATAGAA TTTTCATGCC AACAGCTAGC ATAACGATCT CAACTAGAGA AAAGGCGAAA
TTTCGCGACA ACGCCGTAAA GATCGCCGCA AATAAGATAA GCGCTGGCGT AAAAGTGAGC
ATCGGCGCTC ACGGCGAAGA GAAAAAGGGC GACGAGCAGT TTGAGATAAG TGATAGCAGA
AGCGTGGATG AGATAAAAGC AATGATAAAA GCAAACGGCC TAGAGCCCTT GATGAGTGAG
TATGTCTATG TTTAA
 
Protein sequence
MKFTRNDHMK LLPHMQDVGS DIMDEILKEC ASYKPEIYSE ADVKAALNAK HCSLENLKAL 
LSPAAAPFLE PIAQLAQAKT RANFGSNITL FTPLYIANYC DNLCVYCGFN AKNNIKRAKL
SDEEITRELR EISKSGLEEI LILTGESETN SSVAYIANAC ALAKKFFKVV GVEIYPLNSE
GYALLHKSGA DYVTVFQETY NPTKYEKIHL GGNKRIFPYR LNAQERALLG GMRGVGFAAL
LGIDDFRLDA FATALHASLV QKKYPHAEIA FSCPRLRPII NNDRINPRDV GERELLQVIC
AYRIFMPTAS ITISTREKAK FRDNAVKIAA NKISAGVKVS IGAHGEEKKG DEQFEISDSR
SVDEIKAMIK ANGLEPLMSE YVYV