Gene ECH74115_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1031 
SymbolltaE 
ID6966822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1042761 
End bp1043762 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content56% 
IMG OID643385044 
ProductL-threonine aldolase 
Protein accessionYP_002269544 
Protein GI209400875 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATT TACGCAGTGA TACCGTAACC CGACCGAGCC GCGCCATGCT CGAAGCGATG 
ATGGCCGCCC CGGTTGGGGA CGACGTTTAC GGAGACGACC CTACCGTTAA TGCTCTGCAG
GACTATGCAG CAGAGCTTTC CGGTAAAGAA GCCGCCATTT TTCTGCCTAC CGGCACTCAG
GCCAACCTGG TCGCTCTGCT CAGTCACTGC GAACGCGGCG AAGAGTATAT TGTCGGTCAG
GCCGCGCATA ACTATCTGTT TGAAGCCGGT GGCGCGGCGG TGCTGGGCAG TATTCAACCG
CAACCCATCG ACGCGGCTGC CGACGGCACG CTACCGCTGG ATAAAGTGGC GATGAAAATC
AAACCCGACG ATATCCATTT CGCCCGCACC AAATTACTCA GTCTGGAAAA CACCCACAAC
GGCAAAGTGT TGCCGCGTGA ATACCTTAAA GATGCATGGG AATTTACCCG CGAGCGCAAT
CTGGCGCTGC ATGTTGACGG TGCGCGCATC TTTAATGCCG TGGTGGCTTA CGGCAGCGAA
CTGAAAGAGC TCACGCAATA TTGTGATTCG TTCACCATTT GCCTGTCGAA AGGTCTTGGG
ACGCCAGTCG GTTCATTACT CGTCGGTAAT CGTGATTACA TTAAACGTGC CATTCGCTGG
CGGAAAATGA CAGGTGGCGG GATGCGCCAG TCCGGCATTC TGGCTGCCGC CGGGATGTAT
GCGCTGAAAA ATAACGTCGC ACGGTTGCAG GAAGATCACG ACAACGCCGC CTGGATGGCG
GAGCAACTGC GTGAAGCAGG CGCGGATGTG ATGCGTCAGG ACACTAATAT GCTGTTTGTT
CGCGTCGGCG AAGAAAATGC TGCCGCGTTA GGCGAATACA TGAAAGCGAG AAACGTACTG
ATTAACGCCT CGCCGATTGT CCGCCTGGTG ACCCATCTTG ACGTCTCGCG CGAACAACTG
GCAGAAGTCG CCGCCCACTG GCGCGCATTC CTGGCGCGTT AA
 
Protein sequence
MIDLRSDTVT RPSRAMLEAM MAAPVGDDVY GDDPTVNALQ DYAAELSGKE AAIFLPTGTQ 
ANLVALLSHC ERGEEYIVGQ AAHNYLFEAG GAAVLGSIQP QPIDAAADGT LPLDKVAMKI
KPDDIHFART KLLSLENTHN GKVLPREYLK DAWEFTRERN LALHVDGARI FNAVVAYGSE
LKELTQYCDS FTICLSKGLG TPVGSLLVGN RDYIKRAIRW RKMTGGGMRQ SGILAAAGMY
ALKNNVARLQ EDHDNAAWMA EQLREAGADV MRQDTNMLFV RVGEENAAAL GEYMKARNVL
INASPIVRLV THLDVSREQL AEVAAHWRAF LAR