Gene PC1_0211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPC1_0211 
SymbolthiH 
ID8131119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePectobacterium carotovorum subsp. carotovorum PC1 
KingdomBacteria 
Replicon accessionNC_012917 
Strand
Start bp251815 
End bp252936 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content55% 
IMG OID644863487 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_003015807 
Protein GI253686617 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCG ATTTTCAAAC CGTCTGGGAA CAGCTCGACT GGGATGACCT GACGCTACGC 
ATCAACGGCA AAACCGCACA GGATGTTGAA CGGGCGCTCA CTGCACCACA CCTGACGCAT
GACGATTTTA TGGCGCTCAT TTCACCTGCC GCCAGCGCCT ATCTGGAACC GCTTGCCCAG
CGGGCGCAGC AGCTCACCCG CCAGCGTTTC GGCAATACGG TGAGTTTCTA TGTCCCGCTG
TATTTGTCCA ATCTGTGCTC TAACGACTGT ACCTACTGCG GCTTTTCGAT GAGCAACCAC
ATCAAGCGTA AAACGCTGGA TGAGGCAGAG ATCTTGCGTG AATGCGCCGC TATCAAAGAA
CTCGGATTTG AGCACCTGCT GCTCGTCACG GGTGAACACC AGCGTAAAGT GGGGATGGAC
TATTTTCGGC GCGTTTTTCC ACTTATCCGG CCGCTTTTCA GTTCCCTGAT GATCGAAGTT
CAGCCGTTGT CGCAGGACGA GTACGCCGAA TTAAAAGCAC TGGGGCTGGA TGGTGTGATG
GTCTATCAGG AAACCTATCA TACGGCAACC TACCAACTGC ATCATCTAAA AGGACAAAAG
CAGGATTTCC ACTGGCGGCT TGCCACACCG GATCGGCTTG GCCGTGCCGG GATCGATAAG
ATCGGGCTAG GTGCCTTAAT CGGCCTGTCC AATAGCTGGC GTACCGACTG CTACATGGTG
GCAGAGCACC TGTTGCACTT GCAGCAGCAC TACTGGCAGA GCCGGTATTC TATCTCGTTC
CCTCGCCTGC GCCCCTGTGC GGGCGGCATT GAACCGGCGT CGTTAATGGA TGAAGCACAG
CTGATGCAGG TGATTTGCGC ATTCCGCTTG CTGGCGCCGG ATATTGAATT GTCGCTGTCC
ACACGTGAAT CACCGTTCTT TCGCGATCAT GCGATCCCTA TCGCGATTAA CAACGTCAGC
GCCTTCTCCA AAACCCAACC GGGTGGCTAC GCCGATGACC ATCCTGAACT GGAACAATTT
TCTCCCCACG ATTCACGGCG TCCTGAAGAC GTAGCGCAGG CCATCGTGCG TGCAGGTCTT
CAGCCAGTAT GGAAAGACTG GGACGGCTAT TTGGGCAGAT AA
 
Protein sequence
MSVDFQTVWE QLDWDDLTLR INGKTAQDVE RALTAPHLTH DDFMALISPA ASAYLEPLAQ 
RAQQLTRQRF GNTVSFYVPL YLSNLCSNDC TYCGFSMSNH IKRKTLDEAE ILRECAAIKE
LGFEHLLLVT GEHQRKVGMD YFRRVFPLIR PLFSSLMIEV QPLSQDEYAE LKALGLDGVM
VYQETYHTAT YQLHHLKGQK QDFHWRLATP DRLGRAGIDK IGLGALIGLS NSWRTDCYMV
AEHLLHLQQH YWQSRYSISF PRLRPCAGGI EPASLMDEAQ LMQVICAFRL LAPDIELSLS
TRESPFFRDH AIPIAINNVS AFSKTQPGGY ADDHPELEQF SPHDSRRPED VAQAIVRAGL
QPVWKDWDGY LGR