Gene YpAngola_A0455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0455 
SymbolthiC 
ID5798919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp475914 
End bp477959 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content52% 
IMG OID641338461 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001605060 
Protein GI162419774 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00000571171 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTAATA ATACAACGTC ATTACCCGCT GAAAATTCGT CACACCCACG TAAAGGCACA 
CCTATTCGTA AAAAGCAGCG CGAAGAGGCC CAACAGTTTA TTAATACCTT ACAAGACGTT
ACTTTTCCCA ACTCCCAACG TATTTATCTA CAAGGCTCTC GGCCCGATAT TCAAGTGCCG
ATGCGTGAAA TCCAACTCAG CCCGACGCAA ATCGGCGGCA GTAAAAACGA ACCACGCTAT
GAAGATAATG AGGCCATCCC GGTCTATGAC ACCTCCGGTC CCTATGGTGA CCCACAAGCT
AAACTGGATG TTCATAACGG GCTGCCTAAA CTGCGTGCCG CTTGGGTCGC AGATCGCCAA
GATACTGAAG CGCTGGCATC TGTCAGTTCC GGCTTTACCC AACAGCGTTT GGCTGATGAA
GGCCTGGACC ATTTACGTTT TGAGCATCTA CCCCGCCCAC GGAAAGCAGC CACTGGTCAA
TGTGTGACTC AGTTGCATTA CGCCCGACAG GGGAAAATCA CGCCAGAGAT GGAGTTTATC
GCCCTACGGG AAAATATGGG CCGTGAACGT ATTCGGGGTG AAGTCTTGCT TCAACAACAT
CCGGGACAAG CGTTTGGTGC CCATCTGCCG GAAAATATCA CCGCCGAGTT TGTGCGTCAG
GAAGTGGCGG CCGGCCGAGC CATCATCCCC GCCAATATTA ATCACCCAGA ATCTGAACCA
ATGATTATTG GCCGTAATTT TCTGGTCAAA GTGAATGCCA ACATCGGTAA TTCCGCCGTG
ACCTCTTCCA TTGAAGAAGA GGTAGAAAAA CTGGTCTGGT CTACCCGTTG GGGTGCCGAT
ACGGTGATGG ACTTATCTAC GGGCCGCTAT ATTCATGAAA CGCGGGAATG GATCCTACGT
AACAGCCCGG TCCCTATTGG CACGGTACCT ATCTATCAGG CGCTGGAAAA AGTTAATGGC
GTGGCCGAAA ATCTGACCTG GGAAATGTTC CGTGACACCC TGTTAGAGCA GGCAGAGCAA
GGGGTAGACT ATTTTACTCT CCACGCGGGG GTCTTGTTGC GCTATGTGCC GATGACTGCC
AAACGCCTAA CCGGTATCGT CTCTCGCGGC GGTTCAATTA TGGCAAAATG GTGCCTTTCG
CATCATCAGG AAAACTTCCT GTATCAGCAT TTCCGCGAAA TCTGTCAGAT TTGTGCAGCC
TATGACGTTT CATTATCACT GGGCGATGGC CTGCGCCCTG GCTCTATTCA AGATGCCAAT
GATGAGGCCC AATTCGCCGA ACTGCATACC TTGGGTGAAT TGACCAAAAT CGCCTGGGAG
TATGATGTAC AGGTGATGAT CGAAGGCCCA GGGCATGTGC CGATGCAGAT GATCCGCCGC
AATATGACCG AGGAACTGGA ACACTGCCAC GAAGCGCCAT TTTATACCTT AGGCCCACTG
ACCACGGACA TCGCACCGGG CTATGACCAC TTTACCTCAG GGATTGGTGC AGCGATGATC
GGCTGGTTCG GTTGCGCCAT GCTCTGTTAT GTCACCCCCA AAGAGCACCT TGGTCTGCCG
AATAAAGAGG ATGTCAAACA GGGGCTTATT ACTTACAAAA TTGCCGCGCA CGCCGCAGAT
TTGGCTAAAG GCCACCCCGG TGCGCAAATT CGTGATAACG CCATGTCCAA AGCTCGCTTT
GAGTTCCGCT GGGAAGATCA ATTCAATTTG GCGCTTGATC CAGCAACAGC CCGCGCTTAC
CACGATGAAA CCTTGCCGCA AGAGTCCGGG AAAGTCGCTC ATTTTTGTTC CATGTGCGGC
CCGAAATTCT GTTCAATGAA AATCTCACAA GAGGTCCGCG ATTATGCTGC GGCACAAGAA
CAAGCGGCCG CACAAGCACA AGCCGCTACA CCAACCACCG CAGCACAACC AATAGACATC
ACGCAGCCAA TTAATATGCT GCAATCAGGG ATGGAAAAAA TGTCGGCCGA GTTTCGCTCC
CGTGGCAGTG AGCTATACCA CCGCCCGGCG AATCTGAGTG CGGAGGCCAA TAATGAGCCA
ACTTGA
 
Protein sequence
MSNNTTSLPA ENSSHPRKGT PIRKKQREEA QQFINTLQDV TFPNSQRIYL QGSRPDIQVP 
MREIQLSPTQ IGGSKNEPRY EDNEAIPVYD TSGPYGDPQA KLDVHNGLPK LRAAWVADRQ
DTEALASVSS GFTQQRLADE GLDHLRFEHL PRPRKAATGQ CVTQLHYARQ GKITPEMEFI
ALRENMGRER IRGEVLLQQH PGQAFGAHLP ENITAEFVRQ EVAAGRAIIP ANINHPESEP
MIIGRNFLVK VNANIGNSAV TSSIEEEVEK LVWSTRWGAD TVMDLSTGRY IHETREWILR
NSPVPIGTVP IYQALEKVNG VAENLTWEMF RDTLLEQAEQ GVDYFTLHAG VLLRYVPMTA
KRLTGIVSRG GSIMAKWCLS HHQENFLYQH FREICQICAA YDVSLSLGDG LRPGSIQDAN
DEAQFAELHT LGELTKIAWE YDVQVMIEGP GHVPMQMIRR NMTEELEHCH EAPFYTLGPL
TTDIAPGYDH FTSGIGAAMI GWFGCAMLCY VTPKEHLGLP NKEDVKQGLI TYKIAAHAAD
LAKGHPGAQI RDNAMSKARF EFRWEDQFNL ALDPATARAY HDETLPQESG KVAHFCSMCG
PKFCSMKISQ EVRDYAAAQE QAAAQAQAAT PTTAAQPIDI TQPINMLQSG MEKMSAEFRS
RGSELYHRPA NLSAEANNEP T