Gene Pcar_0608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcar_0608 
SymbolthiH 
ID3724062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelobacter carbinolicus DSM 2380 
KingdomBacteria 
Replicon accessionNC_007498 
Strand
Start bp739855 
End bp740967 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content61% 
IMG OID637750193 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_356037 
Protein GI77918222 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value0.499964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATTTTC TCGACGAATT CAACAGCTAC GATCGCAGCG AGCTTGCGGA ACGGATCATG 
TCATGCCAGG CCGCCGATGT GGAACGGGCG CTGACGGCGG AACATCTGCG AAGTGCCGAT
TTCATGGCGC TGCTGTCGCC GATGGCGCAC GGTTACCTGG AGTCGATGGC ACAAAAAGCC
CACCGTCTGA CCCAGCAGCG TTTCGGCAAG ATCATCCAGC TCTATGCGCC GCTGTACATC
TCCAACGAAT GCAGCAACGG TTGTCTGTAC TGCGGCTTCA ACGCCGCCAA CAAGGTCGCG
CGGCGCACCT TGAGCCTGGA CGAAGTCGAA GCCGAGGCCC GCATCCTGCG CCAGCGCGGT
TTCCGCCATG TGCAGTTACT GACCGGCGAG GCACCGCAGG CGGTCGATGT CGATTTTCTG
GAAAATGTTG TCAAACGCGT ACGACCGTTT TTTTCTTCCA TCAGCATCGA AGTCTTCCCC
ATGGACGAGG CCGGTTATCG CCAACTGGTG GCGGCCGGCG TCGACAACCT GACCGTCTAT
CAGGAGACGT ACGACCGTGA CCTGTACGAC AAACTGCATC CCTTCGGTCG CAAGAAAGAT
TTCAATTGGC GACTGACCAC TCCCGATCGT GGCGGCGCGG CGGGACTGCG CTCGATTGGC
ATCGGTAGCC TGCTGGGGCT GAGTGACTGG CGCATCGAGG GCTACCTGGT CGGCATGCAT
GCCCGCCACC TGGTACGCAC CTGGTGGCGC AGCCGGGTGA ATGTATCCTT CCCGCGCATG
CGACCTGCCG ACGGCGGTTT TCAGCCACCG AAACCGGTAT CCGACAGTGC CCTGGTGCAA
CTGATCTGCG CGCTGCGGCT GTTGATACCC GACGCCGGAC TGGTACTGTC GACGCGCGAA
AGCGCCAGTT TGCGCGATCA TCTGCTGCCT TTGGGTATCA CCCAGCTGAG TGCCGGGTCC
AGCACCGCGC CAGGCGGATA CGGACATCAG CAGGATGGCA GCGAACAGTT TGCTATCGAC
GATGATCGTA ACGCCGAACA GATTTGCGCC ATGCTGCGCG CCCAGGGATA CGAGCCGGTA
TGGAAGGACT GGGACGGCGC CTTTGTGAAA TAG
 
Protein sequence
MNFLDEFNSY DRSELAERIM SCQAADVERA LTAEHLRSAD FMALLSPMAH GYLESMAQKA 
HRLTQQRFGK IIQLYAPLYI SNECSNGCLY CGFNAANKVA RRTLSLDEVE AEARILRQRG
FRHVQLLTGE APQAVDVDFL ENVVKRVRPF FSSISIEVFP MDEAGYRQLV AAGVDNLTVY
QETYDRDLYD KLHPFGRKKD FNWRLTTPDR GGAAGLRSIG IGSLLGLSDW RIEGYLVGMH
ARHLVRTWWR SRVNVSFPRM RPADGGFQPP KPVSDSALVQ LICALRLLIP DAGLVLSTRE
SASLRDHLLP LGITQLSAGS STAPGGYGHQ QDGSEQFAID DDRNAEQICA MLRAQGYEPV
WKDWDGAFVK