Gene YpsIP31758_3853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3853 
SymbolthiC 
ID5388544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4339018 
End bp4341063 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content52% 
IMG OID640866878 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001402804 
Protein GI153949362 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAATA ATACAACGTC ATTACCCGCT GAAAATTCGT CACACCCACG TAAAGGCACA 
CCTATTCGTA AAAAGCAGCG CGAAGAGGCC CAACAGTTTA TTAATACCTT ACAAGGCGTT
ACTTTCCCCA ACTCCCAACG TATTTATCTA CAAGGCTCTC GGCCCGATAT TCAAGTGCCG
ATGCGTGAAA TCCAACTCAG CCCGACGCAA ATCGGCGGCA GTAAAAACGA ACCACGCTAT
GAAGATAATG AGGCCATCCC GGTCTATGAC ACCTCCGGTC CCTATGGTGA CCCACAAGCT
AAACTGGATG TTCACAACGG GCTGCCTAAA CTGCGTGCCG CTTGGGTCGC TGATCGCCAA
GATACTGAAG CGCTGGCATC TGTCAGTTCC GGCTTTACCC AACAGCGTTT GGCTGATGAA
GGCCTGGACC ATTTACGTTT TGAGCATCTA CCCCGCCCAC GGAAAGCAGC CACTGGTCAA
TGTGTGACTC AGTTGCATTA CGCCCGACAG GGGAAAATCA CGCCAGAGAT GGAGTTTATC
GCCCTACGGG AAAATATGGG CCGTGAACGT ATTCGGGGTG AAGTCTTGCT TCAACAACAT
CCGGGACAAG CGTTTGGTGC CCATCTGCCG GAAAATATCA CCGCCGAGTT TGTGCGTCAG
GAAGTGGCGG CCGGCCGAGC CATCATCCCC GCCAATATTA ATCACCCAGA ATCTGAACCA
ATGATTATTG GCCGTAATTT TCTGGTCAAA GTGAATGCCA ACATCGGTAA TTCCGCCGTG
ACCTCTTCCA TTGAAGAAGA GGTAGAAAAA CTGGTCTGGT CTACCCGTTG GGGTGCCGAT
ACGGTGATGG ACTTATCTAC GGGCCGCTAT ATTCATGAAA CGCGGGAATG GATCCTACGT
AACAGCCCGG TCCCTATTGG CACGGTACCT ATCTATCAGG CGCTGGAAAA AGTTAATGGC
GTGGCCGAAA ATCTGACCTG GGAAATGTTC CGTGACACCC TGTTAGAGCA GGCAGAGCAA
GGGGTAGACT ATTTTACTCT CCACGCGGGG GTCTTGTTGC GCTATGTGCC GATGACTGCC
AAACGCCTAA CCGGTATCGT CTCTCGCGGC GGTTCAATTA TGGCAAAATG GTGCCTTTCG
CATCATCAGG AAAACTTCCT GTATCAGCAT TTCCGCGAAA TCTGTCAGAT TTGTGCAGCC
TATGACGTTT CATTATCACT GGGCGATGGC CTGCGCCCTG GCTCTATTCA AGATGCCAAT
GATGAGGCCC AATTCGCCGA ACTGCATACC TTGGGTGAAT TGACCAAAAT CGCCTGGGAG
TATGATGTAC AGGTGATGAT CGAAGGCCCA GGGCATGTGC CGATGCAGAT GATCCGCCGC
AATATGACCG AGGAACTGGA ACACTGCCAC GAAGCGCCAT TTTATACCTT AGGCCCACTG
ACCACGGACA TCGCACCGGG CTATGACCAC TTTACCTCAG GGATTGGTGC AGCGATGATC
GGCTGGTTCG GTTGTGCCAT GCTCTGTTAT GTCACCCCCA AAGAGCACCT TGGTCTGCCG
AATAAAGAGG ATGTCAAACA GGGGCTTATT ACTTACAAAA TTGCCGCGCA CGCCGCAGAT
TTGGCTAAAG GCCACCCCGG TGCGCAAATT CGTGATAACG CCATGTCCAA AGCTCGCTTT
GAGTTCCGCT GGGAAGATCA ATTCAATTTG GCGCTTGATC CAGCAACAGC CCGCGCTTAC
CACGATGAAA CCCTGCCGCA AGAGTCCGGG AAAGTCGCTC ATTTTTGTTC CATGTGCGGC
CCGAAATTCT GTTCAATGAA AATCTCACAA GAGGTCCGCG ATTATGCTGC GGCACAAGAA
CAAGCGGCCG CACAAGCACA AGCTGCTACA CCAACCACCG CAGCACAACC AATAGACATC
ACGCAGCCAA TTAATATGCT GCAATCAGGG ATGGAAAAAA TGTCAGCCGA GTTTCGCTCC
CGTGGCAGTG AGCTATACCA CCGCCCGGCG AATCTGAGTG CGGAGGCCAA TAATGAGCCA
ACTTGA
 
Protein sequence
MSNNTTSLPA ENSSHPRKGT PIRKKQREEA QQFINTLQGV TFPNSQRIYL QGSRPDIQVP 
MREIQLSPTQ IGGSKNEPRY EDNEAIPVYD TSGPYGDPQA KLDVHNGLPK LRAAWVADRQ
DTEALASVSS GFTQQRLADE GLDHLRFEHL PRPRKAATGQ CVTQLHYARQ GKITPEMEFI
ALRENMGRER IRGEVLLQQH PGQAFGAHLP ENITAEFVRQ EVAAGRAIIP ANINHPESEP
MIIGRNFLVK VNANIGNSAV TSSIEEEVEK LVWSTRWGAD TVMDLSTGRY IHETREWILR
NSPVPIGTVP IYQALEKVNG VAENLTWEMF RDTLLEQAEQ GVDYFTLHAG VLLRYVPMTA
KRLTGIVSRG GSIMAKWCLS HHQENFLYQH FREICQICAA YDVSLSLGDG LRPGSIQDAN
DEAQFAELHT LGELTKIAWE YDVQVMIEGP GHVPMQMIRR NMTEELEHCH EAPFYTLGPL
TTDIAPGYDH FTSGIGAAMI GWFGCAMLCY VTPKEHLGLP NKEDVKQGLI TYKIAAHAAD
LAKGHPGAQI RDNAMSKARF EFRWEDQFNL ALDPATARAY HDETLPQESG KVAHFCSMCG
PKFCSMKISQ EVRDYAAAQE QAAAQAQAAT PTTAAQPIDI TQPINMLQSG MEKMSAEFRS
RGSELYHRPA NLSAEANNEP T