Gene PC1_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPC1_1033 
Symbol 
ID8131962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePectobacterium carotovorum subsp. carotovorum PC1 
KingdomBacteria 
Replicon accessionNC_012917 
Strand
Start bp1203877 
End bp1205325 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content51% 
IMG OID644864316 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_003016618 
Protein GI253687428 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTTA TCATTAAATT GTTCCCGGAA ATCACCATCA AGAGCCAATC TGTGCGATTG 
CGCTTTATCA AGATTCTAAC CGGAAACATT CGCAACGTAT TAAAACACTA TGATGAAACG
CTGGCGGTTG TCCGTCACTG GGATCATATC GAAGTTCGGG CCAAAGACGA AAGCCAGCGT
TCGGCGATTC GTGATGCGTT GACGAGAATT CCGGGTATCC ACCATATTCT GGAAGTTGAA
GATCACGCCT ATACCGATGT GCACAATATC TTTGAACAGG CGCTGGCGCT ATATCGCGAG
CAACTGGAAG GCAAAACGTT TTGTGTACGA GTGAAACGCC GCGGTAAGCA TGAGTTTAGT
TCACAGGACG TCGAACGCTA CGTCGGCGGC GGTCTGAACC AGCACATTGA AACGGCGCGG
GTCAACCTTA CAGCACCGCA GGTTACCGTG CATCTGGAAA TCGAGCAGGA CCGTTTGCTG
CTGATTAAAG GACGTTATGA AGGGATCGGC GGCTTCCCGA TCGGTACGCA GGAAGATGTG
CTGTCGTTGA TTTCCGGTGG TTTTGACTCC GGTGTATCCA GCTACATGCT GATGCGTCGA
GGGTGTCGTG TGCATTATTG TTTCTTCAAT CTTGGTGGTG CGGCACATGA GATTGGTGTG
AAACAGGTAG CGCACTATCT GTGGAACCGT TTTGGTAGCT CGCACCGGGT GCGTTTTATT
GCTATCGACT TCGATCCTGT GGTGGGTGAA ATTCTGGAGA AGGTTGACGA CGGCCAGATG
GGCGTCGTAT TGAAGCGCAT GATGGTGCGT GCGGCCTCCA AAATTGCTGA GCGTTACGGT
GTTCAGGCGC TGGTGACCGG TGAAGCGTTA GGGCAGGTTT CCAGCCAGAC GTTGACCAAT
CTACGTTTGA TCGATAATGC CTCTGATACG CTGATTCTAC GTCCGCTGAT TTCTCATGAT
AAAGAACACA TTATCAAACA GGCTCGCGAA CTGGGAACGG AAGATTTCGC CAAAACGATG
CCGGAATACT GCGGTGTGAT CTCGAAAAGC CCGACGGTGA AGGCGGTAAA AGCGAAGATT
GAGGCTGAAG AAAGTCACTT TGATTTTGCC ATTCTGGAGC GTGTGGTGAG CGAAGCGCGG
AATATTGATA TCCGTCAGAT CGCCGAGCAG ACCAAGCAAG AAGTGGTTGA AATCGAAACG
GTCGCGTCAT TCGCCCCAAC TGACGTACTG CTGGATATCC GCTCGCCGGA TGAACAGGAC
GATAAGCCGC TTGAGCTTGA TCAGATCGAA ATCAAATCCT TACCGTTCTA CAAGTTGGGT
ACGCAGTTTG GCGATTTGGA TCAGAGCAAA ACCTATCTGC TTTACTGCGA GCGTGGCGTA
ATGAGCCGCT TGCAGGCGCT GTACCTGCGC GAGCAAGGCT TTAGCAATGT GAAGGTCTAC
CGACCGTAA
 
Protein sequence
MKFIIKLFPE ITIKSQSVRL RFIKILTGNI RNVLKHYDET LAVVRHWDHI EVRAKDESQR 
SAIRDALTRI PGIHHILEVE DHAYTDVHNI FEQALALYRE QLEGKTFCVR VKRRGKHEFS
SQDVERYVGG GLNQHIETAR VNLTAPQVTV HLEIEQDRLL LIKGRYEGIG GFPIGTQEDV
LSLISGGFDS GVSSYMLMRR GCRVHYCFFN LGGAAHEIGV KQVAHYLWNR FGSSHRVRFI
AIDFDPVVGE ILEKVDDGQM GVVLKRMMVR AASKIAERYG VQALVTGEAL GQVSSQTLTN
LRLIDNASDT LILRPLISHD KEHIIKQARE LGTEDFAKTM PEYCGVISKS PTVKAVKAKI
EAEESHFDFA ILERVVSEAR NIDIRQIAEQ TKQEVVEIET VASFAPTDVL LDIRSPDEQD
DKPLELDQIE IKSLPFYKLG TQFGDLDQSK TYLLYCERGV MSRLQALYLR EQGFSNVKVY
RP