Gene CPF_1853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1853 
SymbolthiH 
ID4203863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2089404 
End bp2090507 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content31% 
IMG OID638082723 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_696287 
Protein GI110800869 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.287617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTTTTT ATGATGTAGT AGAAAAATAT AGAGATTTTG ATTTTTATGG ATATTTTGAT 
TCTGTAAAAA AGGAAGATGT ATTAAGAAGT ATTTATGAAA GAAATAAGAG ACCAGAGGAT
TTACTTAATT TAATATCTCC TATGGGAGAA GAGCTTTTAG AAGAAATGGC TCAAGAAGCA
AGAAATCTCT CTTTAAAATA TTTTGGAAGA ACAATATTAC TTTATACACC TATGTATATC
TCAAATTATT GTGTAAATAA GTGTTCATAT TGTGGGTATA ATGTAGAAAA TAAAATATGT
AGGAAAAAAT TAAATCAAGA AGAAATAGAA AAAGAGGGGA AAGCTATTTC AAAGGATGGA
TTTAAACATA TTCTAATATT AACAGGGGAA AGTGAATATC ATACTCCAGT AGAGTATATA
GAGGAGAGCA TTAAAACTTT GAAAGAGAAA TTTCCTTCCA TAACTATTGA AATATATCCA
ATGACAGAAG AGGGATATAA AAAGGTGGTA GAAGCAGGTG CTGAAGGGCT TACTGTATAT
CAAGAGACCT ATGATGAAAA AGTATATGAT AGGGTTCATG TGGCTGGTCC AAAGAAAAAT
TATAAATTCA GATTAGAAGC TCCAGAGAGA GGAGCAGAAG CTGGAATGAG AAGCATAAGT
ATAGGAGCCT TATTAGGATT AGCTGATTTT AGAATAGATG CCTTCTTTAC AGCAATGCAT
GGAAAATATT TAAGAGATAA GTATCCTCAT ATAGATATAA GTTATTCGGT GCCAAGAATA
AGACCCTGTG AAGGAGGGCT TAAAAAGTTA AATGAAGTTG ATGATAGGGA ACTAGTACAA
ATACTTTTAG CCTATAGACT ATTTGATCCT CAAGGAGGAA TAAATATATC TACTAGAGAA
GGAAAGGATT TTAGAAGAAA TTTAATTCCT TTAGGAGTAA GTAAAATTAG TGCTGGAGTT
TCTACTGAAG TTGGAGGCCA TTCTTTAAAA GAAAAAGGTA CAAGTCAATT TGATATAAAT
GATGAAAGTT CTGTAAGTGA AGTTAAGGAA TTAATAAAAA GTCAAGGTTA TCAACCTATA
TTTAAGGATT GGCATAGATT TTAA
 
Protein sequence
MSFYDVVEKY RDFDFYGYFD SVKKEDVLRS IYERNKRPED LLNLISPMGE ELLEEMAQEA 
RNLSLKYFGR TILLYTPMYI SNYCVNKCSY CGYNVENKIC RKKLNQEEIE KEGKAISKDG
FKHILILTGE SEYHTPVEYI EESIKTLKEK FPSITIEIYP MTEEGYKKVV EAGAEGLTVY
QETYDEKVYD RVHVAGPKKN YKFRLEAPER GAEAGMRSIS IGALLGLADF RIDAFFTAMH
GKYLRDKYPH IDISYSVPRI RPCEGGLKKL NEVDDRELVQ ILLAYRLFDP QGGINISTRE
GKDFRRNLIP LGVSKISAGV STEVGGHSLK EKGTSQFDIN DESSVSEVKE LIKSQGYQPI
FKDWHRF