Gene Ppro_1492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpro_1492 
SymbolthiH 
ID4572167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelobacter propionicus DSM 2379 
KingdomBacteria 
Replicon accessionNC_008609 
Strand
Start bp1600358 
End bp1601779 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content60% 
IMG OID639755537 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_901166 
Protein GI118579916 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCCC TGCCATCCAT GAAACTGAGC GCAGGCGCCG CCGACTTCAT CGACGAGGCG 
TATCTGTTCG GCCTTCTGGG AGGCAAGCGT CCCGAAGCGG CACGGGTTCG CGACATAATC
GCCAAGAGCC TTGAAAAGCA GGCACTGGGT GTGGAGGAAA CCGCGCAGCT TCTGCTGGCG
GATACGCCTG AGCTGGTCGA GGAGATCTTT GCCGCGGCCC GCGAACTGAA AGAAAAGGTG
TACGGAAACA GGATCGTCCT GTTCGCCCCG CTCTACATCG GGAATAAATG TGTGAACGAC
TGCGCCTACT GCGCCTTCAA GCGGAGCAAT GCCAGCGCAA TCCGGCGCAC GCTGACCGAA
GGCGAGATCC GCGGGCAGGT CGAAGCCCTG GAAAGCAAGG GGCACAAACG GCTGATCCTG
GTGTTCGGCG AACACCAGGC CTATGATGCC GAGTTCATCG CCCAGAGCGT CAGGACCGTC
TACGACACCA GGGGAAAGGG TGAGATCAGG CGGGTGAACA TCAATGCCGC CCCCCTTGAC
CGGGAGGGGT ATGCCATTGT CAAGGCCGCC GGGATCGGCA CCTACCAGAT CTTTCACGAA
ACATACCATC ACGAAACCTA TGCCCGCCTG CATCCGCGGG GCACCCGCAA GGCAAACTAC
CTCTACCGAC TGGACGGACT CAACCGGGCC TTTGAGGCCG GCTGCGACGA TGTGGGGCTG
GGGGTGCTCT TTGGTCTTGT GGATTGGAGA TTCGAGGTGC TGGGCCTGGT CAGCCACGCC
CTTCACCTGC AGCGGCGCTA CAATGTGGGT CCCCACACCA TCAGTTTTCC GCGCCTGCGG
CCTGCCGCGG GTGTTGCGCT GGACAGGGAG CTCTTGGTTG GCGACCAGGA ATTCAAGCGG
ATCATCGCCA TACTCCGTCT GGCTGTGCCC TATACCGGGC TCATCCTGAC CGCCCGGGAA
AATGCCCAGG TACGCCGGGA ATGCATCTCC TTAGGCGTTT CCCAGATCGA CGCCGGCAGC
CGCATCGAGC TGGGCGGCTA TACCGAGGCC GGGGATGCTC AGGTGATGGA GCGGGAACAG
TTCTCACTTG GCGACGTGCG TTCCCTGGAC GAGGTCATGG CCGAACTGAT GGGAGCTGGC
TATATTCCCA GTTTCTGCAC CTCCTGCTAC CGGGTCGGCA GAACCGGTGA ACATTTCATG
GAATTCAGCA TCCCCGGCTT CATAAAGGAA TTCTGCACGC CCAATGCCCT GCTGACGCTG
CAGGAATACT TGCTGGACTA TGCGTCACCG TCCACCAGGG AAATCGGAGA CAGGCTGATA
GCGGAAGAAC TGGCCAAGCT GGTCGATGGC CATGGTAAAC AGGGCGTGGT GCAGAGGCTC
AAAGAAATTA GGGAGGGCGC ACAACGCGAC CACTGTTTCT GA
 
Protein sequence
MSSLPSMKLS AGAADFIDEA YLFGLLGGKR PEAARVRDII AKSLEKQALG VEETAQLLLA 
DTPELVEEIF AAARELKEKV YGNRIVLFAP LYIGNKCVND CAYCAFKRSN ASAIRRTLTE
GEIRGQVEAL ESKGHKRLIL VFGEHQAYDA EFIAQSVRTV YDTRGKGEIR RVNINAAPLD
REGYAIVKAA GIGTYQIFHE TYHHETYARL HPRGTRKANY LYRLDGLNRA FEAGCDDVGL
GVLFGLVDWR FEVLGLVSHA LHLQRRYNVG PHTISFPRLR PAAGVALDRE LLVGDQEFKR
IIAILRLAVP YTGLILTARE NAQVRRECIS LGVSQIDAGS RIELGGYTEA GDAQVMEREQ
FSLGDVRSLD EVMAELMGAG YIPSFCTSCY RVGRTGEHFM EFSIPGFIKE FCTPNALLTL
QEYLLDYASP STREIGDRLI AEELAKLVDG HGKQGVVQRL KEIREGAQRD HCF