Gene YpAngola_A0806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0806 
SymbolthrC 
ID5799268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp824561 
End bp825841 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content48% 
IMG OID641338803 
Productthreonine synthase 
Protein accessionYP_001605381 
Protein GI162419130 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.196525 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGT ATAACCTTAA AGATCATAAC GAGCAGGTCA GCTTCGCACA GGCGATCAAA 
CAAGGCTTGG GCAAACAGCA GGGGCTATTT TTCCCGTTGG AGTTGCCCGA GTTCGAACTG
ACTGAGATCG ATAAACTGCT GGATCTGGAT TTTGTGACCC GTAGTAGCCG CATTCTGTCG
GCTTTTATTG GTAGTGAAAT TGCACCAGAT GTCTTAACCA AACGTGTGCA GGCAGCTTTT
GAATTCCCAG CCCCTGTTGC ACAAGTTGAA AATGATATCG CCGTTCTGGA GTTGTTCCAC
GGCCCGACCT TGGCGTTTAA AGACTTTGGC GCACGTTTTA TGGCACAGAT GCTGGCCGAA
GTTGCTGGCG ATCAACCCGT CACTATTTTA ACGGCAACAT CGGGTGATAC TGGTGCGGCG
GTGGCTCATG CATTCTACGG GCTGAAAAAT GTTCGAGTGG TTATCCTTTA TCCACGAGGC
AAGATTAGTC CGCTGCAAGA AAAGCTGTTT TGTACTCTGG GCGGTAATAT TCACACGGTG
GCGATTGATG GCGATTTTGA TGCTTGTCAG GCACTGGTTA AGCAAGCTTT TGATGATGAA
GAACTGAAAA CCGCGCTGAG TTTGAATTCC GCTAACTCAA TTAATATCAG CCGGTTACTG
GCACAAATTT GTTACTACTT TGAAGCTGTA GCCCAGTTAC CACAAGAGGC GCGCAACCAA
CTGGTTATCT CTGTACCAAG CGGTAACTTT GGTGATTTGA CTGCCGGTTT GTTGGCAAAA
TCTTTAGGGC TACCGGTGAA ACGCTTTATT GCTGCCACTA ATGCTAACGA CACCGTGCCA
CGCTTTTTGG TTAACGGCCA GTGGCAGCCA AAGCCAACTG TAGCCACGTT ATCGAATGCC
ATGGATGTCA GTCAGCCAAA TAATTGGCCA CGTGTGGAAG AGCTATATCG TCGCAAAATC
TGGCAGTTGA AGGATTTAGG GCATGGTGCG GTTAGCGATA AAACCACAAA AGATACGATG
CGTGAATTAG CTGGTTTGGG GTATATCTCT GAGCCTCACG CGGCGATTGC TTACCGTGTA
TTGCGTGATC AGTTGCAAGA TGGTGAATTT GGCTTGTTTA TCGGAACCGC ACATCCCGCG
AAATTCAAAG AGAGCGTAGA GGCAATTCTG GGTCAGGAAT TACCTTTACC AAAACCACTG
GCCGTGAGAG CGCAATTACC GTTGCTATCC CATGATTTGC CCGCAGACTT TGCACAAATG
CGGGCATTTT TGATGGCATA G
 
Protein sequence
MKLYNLKDHN EQVSFAQAIK QGLGKQQGLF FPLELPEFEL TEIDKLLDLD FVTRSSRILS 
AFIGSEIAPD VLTKRVQAAF EFPAPVAQVE NDIAVLELFH GPTLAFKDFG ARFMAQMLAE
VAGDQPVTIL TATSGDTGAA VAHAFYGLKN VRVVILYPRG KISPLQEKLF CTLGGNIHTV
AIDGDFDACQ ALVKQAFDDE ELKTALSLNS ANSINISRLL AQICYYFEAV AQLPQEARNQ
LVISVPSGNF GDLTAGLLAK SLGLPVKRFI AATNANDTVP RFLVNGQWQP KPTVATLSNA
MDVSQPNNWP RVEELYRRKI WQLKDLGHGA VSDKTTKDTM RELAGLGYIS EPHAAIAYRV
LRDQLQDGEF GLFIGTAHPA KFKESVEAIL GQELPLPKPL AVRAQLPLLS HDLPADFAQM
RAFLMA