Gene Pcar_0340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcar_0340 
SymbolthiH 
ID3723473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelobacter carbinolicus DSM 2380 
KingdomBacteria 
Replicon accessionNC_007498 
Strand
Start bp425109 
End bp426233 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content62% 
IMG OID637749924 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_355770 
Protein GI77917955 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATTTTC TCGACGAATT CAACAGCTAC GATCGCGGCG AGCTTGCGCA ACGGATTATG 
TCATGCCGGG CTGCCGATGT GGAACGAGCG CTGACGGCGG AACATCTGCG CAGTGCCGAT
TTTATGGCGC TGCTGTCGCC GGCGGCGCAC GGTTACCTGG AGTCGATGGC ACAAAAAGCC
CACCGTCTGA CGCAGCAGCG TTTCGGCAAA ACCATCCAGC TCTTTGCGCC GCTGTACATC
TCCAACGAAT GCAGCAACGG CTGCCTGTAC TGCGGCTTCA ACGCCGCCAA CAAGGTCGCG
CGGCGCACCT TGAGCCTGGA CGAAGTCGAA GCCGAGGCCC GCATCCTGCG CCAGCGCGGT
TTCCGTCATG TGCAGATTCT TACCGGTGAA GCTCCCCGGG CTGTGGATAA CGATATGCTG
GCAGCGGTGG TCCGCCGGAT TCGGCCATTG TTTTCCGCCA TCAGCATCGA AGTCTATCCC
ATGGAAGAAG CCGGCTACCG ACAGATGGTC GATGCCGGCG TCGACAACCT GACCGTTTAT
CAGGAGACGT ACGATCGCGA TCTGTACGAC AAGCTGCATC CCTTCGGTCG TAAAAAGGAT
TTTGACTGGC GTCTGACCAC TCCCGACCGC GGCGGTGCGG CGGGACTGCG TTCCATCGGT
ATCGGTGCCT TGTTGGGGCT GAGCGACTGG CGCGTCGAAG GTGTGCTGGT CGGGTTGCAC
GCGCGACACC TGGCGCGTAC CTGGTGGCGC AGTCGGGTGA ATGTATCCTT TCCGCGCATG
CGGCCCGCCG GGGGCGGGTT CAATCCGCTG GCGCCGGTAT CCGACAGTGC CCTGGTGCAA
CTGATCTGCG CGCTGCGACT GTTGATACCC GATGCCGGGC TGGTGCTGTC GACCCGCGAA
AGCTCCAGTT TGCGCGATCA TCTGCTGCCT TTGGGTATCA CCCAGCTGAG TGCCGGCTCC
TGTACTGCCC CGGGCGGGTA TGGCGACGAG GGGCACGGTA GCGAGCAGTT TGCCATTGAC
GACGACCGCG ACGCCGAACA GGTTTGCGCC ATGCTGCGCG CCCAGGGATA TGAGCCGGTA
TGGAAGGATT GGGATCGCAC CTTTATGGAT CGGCAGGCCG TTTGA
 
Protein sequence
MNFLDEFNSY DRGELAQRIM SCRAADVERA LTAEHLRSAD FMALLSPAAH GYLESMAQKA 
HRLTQQRFGK TIQLFAPLYI SNECSNGCLY CGFNAANKVA RRTLSLDEVE AEARILRQRG
FRHVQILTGE APRAVDNDML AAVVRRIRPL FSAISIEVYP MEEAGYRQMV DAGVDNLTVY
QETYDRDLYD KLHPFGRKKD FDWRLTTPDR GGAAGLRSIG IGALLGLSDW RVEGVLVGLH
ARHLARTWWR SRVNVSFPRM RPAGGGFNPL APVSDSALVQ LICALRLLIP DAGLVLSTRE
SSSLRDHLLP LGITQLSAGS CTAPGGYGDE GHGSEQFAID DDRDAEQVCA MLRAQGYEPV
WKDWDRTFMD RQAV