Gene YPK_0347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0347 
Symbol 
ID6090852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp371987 
End bp374032 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content52% 
IMG OID641595412 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001719110 
Protein GI170022605 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAATA ATACAACGTC ATTACCCGCT GAAAATTCGT CACACCCACG TAAAGGCACA 
CCTATTCGTA AAAAGCAGCG CGAAGAGGCC CAACAGTTTA TTAATACCTT ACAAGGCGTT
ACTTTCCCCA ACTCCCAACG TATTTATCTA CAAGGCTCTC GGCCCGATAT TCAAGTGCCG
ATGCGTGAAA TCCAACTCAG CCCGACGCAA ATCGGCGGCA GTAAAAACGA ACCACGCTAT
GAAGATAATG AGGCCATCCC GGTCTATGAC ACCTCCGGTC CCTATGGTGA CCCACAAGCT
AAACTGGATG TTCACAACGG GCTGCCTAAA CTGCGTGCCG CTTGGGTCGC TGATCGCCAA
GATACTGAAG CGCTGGCATC TGTCAGTTCC GGCTTTACCC AACAGCGTTT GGCTGATGAA
GGCCTGGACC ATTTACGTTT TGAGCATCTA CCCCGCCCAC GGAAAGCAGC CACTGGTCAA
TGTGTGACTC AGTTGCATTA CGCCCGACAG GGGAAAATCA CGCCAGAGAT GGAGTTTATC
GCCCTACGGG AAAATATGGG CCGTGAACGT ATTCGGGGTG AAGTCTTGCT TCAACAACAT
CCGGGACAAG CGTTTGGTGC CCATCTGCCG GAAAATATCA CCGCCGAGTT TGTGCGTCAG
GAAGTGGCGG CCGGCCGAGC CATCATCCCC GCCAATATTA ATCACCCAGA ATCTGAACCA
ATGATTATTG GCCGTAATTT TCTGGTCAAA GTGAATGCCA ACATCGGTAA TTCCGCCGTG
ACCTCTTCCA TTGAAGAAGA GGTAGAAAAA CTGGTCTGGT CTACCCGCTG GGGGGCCGAT
ACGGTGATGG ACTTATCTAC GGGCCGCTAT ATTCATGAAA CGCGGGAATG GATCCTACGT
AACAGCCCGG TCCCTATTGG CACGGTACCT ATCTATCAGG CGCTGGAAAA AGTTAATGGC
GTGGCCGAAA ACCTGACCTG GGAAATGTTC CGTGACACCC TGTTAGAGCA GGCAGAGCAA
GGGGTAGACT ATTTTACTCT CCACGCGGGG GTCTTGTTGC GCTATGTGCC GATGACTGCC
AAACGCCTAA CCGGTATCGT CTCTCGCGGC GGTTCAATTA TGGCAAAATG GTGCCTTTCG
CATCATCAGG AAAACTTCCT GTATCAGCAT TTCCGCGAAA TCTGTCAGAT TTGTGCAGCC
TATGACGTTT CATTATCACT GGGCGATGGC CTGCGCCCTG GCTCTATTCA AGATGCCAAT
GATGAGGCCC AATTCGCCGA ACTGCATACC TTGGGTGAAT TGACCAAAAT CGCCTGGGAG
TATGATGTAC AGGTGATGAT CGAAGGCCCA GGGCATGTGC CGATGCAGAT GATCCGCCGC
AATATGACCG AGGAACTGGA ACACTGCCAC GAAGCGCCAT TTTATACCTT AGGCCCACTG
ACCACGGACA TCGCACCGGG CTATGACCAC TTTACCTCAG GGATTGGTGC AGCGATGATC
GGCTGGTTCG GTTGCTCCAT GCTCTGTTAT GTCACCCCCA AAGAGCACCT TGGTCTGCCG
AATAAAGAGG ATGTCAAACA GGGGCTTATT ACTTACAAAA TTGCCGCGCA CGCCGCAGAT
TTGGCTAAAG GCCACCCCGG TGCGCAAATT CGTGATAACG CCATGTCCAA AGCTCGCTTT
GAGTTCCGCT GGGAAGATCA ATTCAATTTG GCGCTTGATC CAGCAACAGC CCGCGCTTAC
CACGATGAAA CCTTGCCGCA AGAGTCCGGG AAAGTCGCTC ATTTTTGTTC CATGTGCGGC
CCGAAATTCT GTTCAATGAA AATCTCACAA GAGGTCCGCG ATTATGCTGC GGCACAAGAA
CAAGCGGCCG CACAAGCACA AGCCGCTACA CCAACCACCG CAGCACAACC AATAGACATC
ACGCAGCCAA TTAATATGCT GCAATCAGGG ATGGAAAAAA TGTCGGCCGA GTTTCGCTCC
CGTGGCAGTG AGCTATACCA CCGCCCGGCG AATCTGAGTG CGGAGGCCAA TAATGAGCCA
ACTTGA
 
Protein sequence
MSNNTTSLPA ENSSHPRKGT PIRKKQREEA QQFINTLQGV TFPNSQRIYL QGSRPDIQVP 
MREIQLSPTQ IGGSKNEPRY EDNEAIPVYD TSGPYGDPQA KLDVHNGLPK LRAAWVADRQ
DTEALASVSS GFTQQRLADE GLDHLRFEHL PRPRKAATGQ CVTQLHYARQ GKITPEMEFI
ALRENMGRER IRGEVLLQQH PGQAFGAHLP ENITAEFVRQ EVAAGRAIIP ANINHPESEP
MIIGRNFLVK VNANIGNSAV TSSIEEEVEK LVWSTRWGAD TVMDLSTGRY IHETREWILR
NSPVPIGTVP IYQALEKVNG VAENLTWEMF RDTLLEQAEQ GVDYFTLHAG VLLRYVPMTA
KRLTGIVSRG GSIMAKWCLS HHQENFLYQH FREICQICAA YDVSLSLGDG LRPGSIQDAN
DEAQFAELHT LGELTKIAWE YDVQVMIEGP GHVPMQMIRR NMTEELEHCH EAPFYTLGPL
TTDIAPGYDH FTSGIGAAMI GWFGCSMLCY VTPKEHLGLP NKEDVKQGLI TYKIAAHAAD
LAKGHPGAQI RDNAMSKARF EFRWEDQFNL ALDPATARAY HDETLPQESG KVAHFCSMCG
PKFCSMKISQ EVRDYAAAQE QAAAQAQAAT PTTAAQPIDI TQPINMLQSG MEKMSAEFRS
RGSELYHRPA NLSAEANNEP T