Gene ECH74115_0005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0005 
SymbolthrC 
ID6966720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3751 
End bp5037 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content53% 
IMG OID643384089 
Productthreonine synthase 
Protein accessionYP_002268612 
Protein GI209396634 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.674616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCT ACAATCTTAA AGATCACAAT GAGCAGGTCA GCTTTGCGCA AGCCGTAACC 
CAGGGGTTGG GCAAAAATCA GGGGCTGTTT TTTCCGCACG ACCTGCCGGA ATTCAGCCTG
ACTGAAATTG ATGAGATGCT GAAGCTGGAT TTTGTCACCC GCAGTGCGAA GATCCTCTCG
GCGTTTATTG GTGATGAAAT CCCGCAGGAA ATCCTGGAAG AGCGCGTGCG CGCGGCGTTT
GCCTTCCCGG CTCCGGTCGC CAATGTTGAA AGCGATGTCG GTTGTCTGGA ATTGTTCCAC
GGGCCAACGC TGGCATTTAA AGATTTCGGC GGTCGCTTTA TGGCACAAAT GCTGACCCAT
ATTGCGGGCG ATAAGCCAGT GACCATTCTG ACCGCGACCT CCGGTGATAC CGGAGCGGCA
GTGGCTCATG CTTTCTACGG TTTACCGAAT GTGAAAGTGG TTATCCTTTA TCCACGAGGC
AAAATCAGTC CACTGCAAGA AAAACTGTTC TGTACATTGG GCGGCAATAT CGAAACTGTT
GCCATCGACG GCGATTTCGA TGCCTGTCAG GCGCTGGTGA AGCAGGCGTT TGATGATGAA
GAGCTGAAAG TGGCGCTGGG GTTAAACTCA GCTAACTCGA TTAACATTAG CCGGTTGCTG
GCGCAGATTT GCTACTACTT TGAAGCAGTT GCGCAGCTGC CGCAGGAAGC GCGCAACCAG
CTGGTTGTCT CGGTGCCAAG CGGAAACTTC GGCGATTTGA CGGCGGGTCT GCTGGCGAAG
TCACTCGGTC TGCCGGTGAA ACGTTTTATT GCTGCGACCA ACGTGAACGA TACCGTGCCA
CGTTTCCTGC ATGACGGTCA GTGGTCACCC AAAGCGACTC AGGCGACGTT ATCCAACGCG
ATGGACGTGA GTCAGCCGAA CAACTGGCCG CGTGTGGAAG AGTTGTTCCG CCGCAAAATC
TGGCAACTGA AAGAGCTGGG TTATGCCGCC GTGGATGATG AAACCACGCA ACAGACAATG
CGTGAGTTAA AAGAACTGGG CTACACTTCG GAGCCGCACG CTGCCGTAGC GTATCGTGCG
CTGCGTGACC AGTTGAATCC AGGCGAATAT GGCTTGTTCC TCGGCACCGC GCATCCGGCG
AAATTTAAAG AGAGCGTGGA AGCGATTCTC GGTGAAACGT TGGATCTGCC AAAAGAGCTG
GCAGAACGTG CCGATTTACC CTTGCTTTCG CATAATCTGC CCGCCGATTT TGCTGCGTTG
CGTAAATTGA TGATGAATCA TCAGTAA
 
Protein sequence
MKLYNLKDHN EQVSFAQAVT QGLGKNQGLF FPHDLPEFSL TEIDEMLKLD FVTRSAKILS 
AFIGDEIPQE ILEERVRAAF AFPAPVANVE SDVGCLELFH GPTLAFKDFG GRFMAQMLTH
IAGDKPVTIL TATSGDTGAA VAHAFYGLPN VKVVILYPRG KISPLQEKLF CTLGGNIETV
AIDGDFDACQ ALVKQAFDDE ELKVALGLNS ANSINISRLL AQICYYFEAV AQLPQEARNQ
LVVSVPSGNF GDLTAGLLAK SLGLPVKRFI AATNVNDTVP RFLHDGQWSP KATQATLSNA
MDVSQPNNWP RVEELFRRKI WQLKELGYAA VDDETTQQTM RELKELGYTS EPHAAVAYRA
LRDQLNPGEY GLFLGTAHPA KFKESVEAIL GETLDLPKEL AERADLPLLS HNLPADFAAL
RKLMMNHQ