Gene Pmob_1864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPmob_1864 
SymbolthiH 
ID5756947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePetrotoga mobilis SJ95 
KingdomBacteria 
Replicon accessionNC_010003 
Strand
Start bp2053909 
End bp2055324 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content35% 
IMG OID641303060 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001568876 
Protein GI160903295 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.804214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTTGGA TAAGAGATAA GGAGAACCAA AAACCATTCA TAAAAGAAGA TGAAATATTT 
AACATATTGG AAGGAACAAA ATCCCCAAGT AAATTAAAAG TCAGAGATAT TATTCAAAAA
TCTTTATCAA AAGAAAGATT GAATCCAGAC GAAGTAGCAA CACTTTTAAA CGTTGAAGAT
GATGAAACGT TAGAAGAGAT TTTTGAAGGA GCAAGAACAT TAAAAAGAAA TGTATATGGG
AATAGAATAG TCTTTTTTGC TCCTCTTTAT ATTGGAAACA AATGTATAAA CAATTGTGAG
TACTGTGGTT TTAGATCAAG CAACACGGAA ATTTATAGAA ACTCTCTTAG TTTTGAACAA
TTAGAGAAAG AGGTGAAAGC ACTTGAAGAC AAAGGCCATA AAAGATTAAT ATTAGTTTAT
GGCGAACATC CTGATTACGA TGCAGATTTC ATAGCCAAAA CCGTTGAAAC GGTATACAAA
ACTAAAAACA GAAACGGTGA GATCAGAAGG GTGAATATCA ATGCTGCCCC TCAGACAATT
GATGATTACA AAAAAATAAA AGAAGTTGGA ATAGGAACAT TTCAAATTTT TCAAGAAACT
TATCATTTTG ATACTTACAA GAAAGTTCAT CCAAAAGGGC CTAAATCTAG TTATATATGG
AGGCTATATG GATTAGACAG AGCTGTAGCT GCTGGGATTG ACGATGTTGG GATCGGTGCT
TTATTTGGAC TTTACGATTA TAAATTCGAG GTCATGGGCC TTTTATACCA TACAATACAT
CTTGAAGAAC GCTTTGGATT TGGTCCCCAT ACAATTTCAT TTCCACGAAT AGAACCAGCT
CTAAACACCC CTTTATCTGA GCAACCTCCA TACCTTGTGA ATAATAATGA ATTCCAAAAG
ATAGTGGCGA TTTTAAGATT AGCTGTACCC TACACGGGAT TAATTTTAAC AGCCAGAGAA
CCTTCCCATA TAAGAAACGA GGTTTTAAAG TTAGGAGTTT CACAAATAGA TGCTGGTTCT
AATATTGGAA TTGGTGCATA TTCAACAGAA GATCAACAAG CTTATAAGAA AAGTCAATTC
ACCTTAGGTG ACCAAAGGAG TTTAGACACT GTAATAAACG AATTAGCAAT CGAAGGTTAT
CTTCCTTCAT TTTGTACCGC ATGCTATCGT ATGGGGAGAA CTGGCGAGCA CTTCATGGAG
TTTGCAATAC CTGGATTTGT GAAGAGGTTT TGCACTCCCA ACGCCATTTT AACTCTTTTA
GAATATGCCC AGGATTATGC CCCAGAAAAT ACTAGGATAT CTATCGAAAA GAGGATCGAA
GAAGAGTTAA AGGTTATGAA TGAAGGGCCT CTAAAAGAGA AATTATTAGA AAGAATGGAT
CTTGTTAAGG CTGGAAAAAG AGATCTATAC TTTTAA
 
Protein sequence
MFWIRDKENQ KPFIKEDEIF NILEGTKSPS KLKVRDIIQK SLSKERLNPD EVATLLNVED 
DETLEEIFEG ARTLKRNVYG NRIVFFAPLY IGNKCINNCE YCGFRSSNTE IYRNSLSFEQ
LEKEVKALED KGHKRLILVY GEHPDYDADF IAKTVETVYK TKNRNGEIRR VNINAAPQTI
DDYKKIKEVG IGTFQIFQET YHFDTYKKVH PKGPKSSYIW RLYGLDRAVA AGIDDVGIGA
LFGLYDYKFE VMGLLYHTIH LEERFGFGPH TISFPRIEPA LNTPLSEQPP YLVNNNEFQK
IVAILRLAVP YTGLILTARE PSHIRNEVLK LGVSQIDAGS NIGIGAYSTE DQQAYKKSQF
TLGDQRSLDT VINELAIEGY LPSFCTACYR MGRTGEHFME FAIPGFVKRF CTPNAILTLL
EYAQDYAPEN TRISIEKRIE EELKVMNEGP LKEKLLERMD LVKAGKRDLY F