Gene EcSMS35_0899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0899 
SymbolltaE 
ID6145197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp903192 
End bp904193 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content56% 
IMG OID641615787 
ProductL-threonine aldolase 
Protein accessionYP_001742979 
Protein GI170681482 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.859325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATT TACGCAGTGA TACCGTTACC CGACCGAGCC GCGCCATGCT CGAAGCAATG 
ATGGCCGCCC CGGTTGGGGA CGACGTTTAC GGAGACGACC CTACCGTTAA TGCTCTGCAG
GACTATGCGG CAGAGCTTTC CGGTAAAGAA GCCGCCATTT TTCTGCCGAC CGGCACCCAG
GCTAACCTGG TCGCTCTGCT CAGTCACTGC GAACGCGGCG AAGAGTATAT TGTCGGTCAG
GCCGCACACA ACTATCTGTT TGAAGCCGGT GGCGCAGCGG TGCTGGGCAG TATTCAACCG
CAACCCATCG ACGCGGCTGC CGACGGCACG CTACCGCTGG ATAAAGTGGC GATGAAAATC
AAACCCGACG ATATCCATTT CGCCCGCACC AAATTACTCA GTCTGGAAAA CACTCACAAC
GGCAAAGTGC TGCCGCGTGA ATACCTCAAA GAAGCATGGG AATTTACCCG CGAGCGCAAT
CTGGCGCTGC ATGTGGACGG TGCGCGCATC TTTAATGCTG TGGTGGCTTA CGGCTGCGAA
CTGAAAGAGA TCACACAATA TTGTGATTCG TTCACCATTT GCCTGTCGAA AGGTCTTGGG
ACGCCAGTCG GTTCATTACT CGTCGGTAAT CGTGATTACA TTAAACGTGC CATTCGCTGG
CGGAAAATGA CAGGTGGCGG GATGCGCCAG TCCGGCATTC TGGCTGCCGC CGGAATATAT
GCCCTGAAAA ATAACGTTGC GCGCTTGCAG GAAGACCACG ACAACGCTGC CTGGATGGCG
GAGCAGCTGC GTGAAGCAGG CGCGGATGTG ATGCGTCAGG ACACCAATAT GCTGTTTGTT
CGCGTCGGGG AAGAAAATGC TGCCGCGTTA GGCGAATACA TGAAAGCGAG AAACGTGCTG
ATTAACGCCT CGCCGATTGT CCGCCTGGTG ACGCATCTTG ACGTCTCGCG CGCTCAGCTG
GCGGAAGTCG CCGCCCACTG GCGCGCATTC CTGGCGCGTT AA
 
Protein sequence
MIDLRSDTVT RPSRAMLEAM MAAPVGDDVY GDDPTVNALQ DYAAELSGKE AAIFLPTGTQ 
ANLVALLSHC ERGEEYIVGQ AAHNYLFEAG GAAVLGSIQP QPIDAAADGT LPLDKVAMKI
KPDDIHFART KLLSLENTHN GKVLPREYLK EAWEFTRERN LALHVDGARI FNAVVAYGCE
LKEITQYCDS FTICLSKGLG TPVGSLLVGN RDYIKRAIRW RKMTGGGMRQ SGILAAAGIY
ALKNNVARLQ EDHDNAAWMA EQLREAGADV MRQDTNMLFV RVGEENAAAL GEYMKARNVL
INASPIVRLV THLDVSRAQL AEVAAHWRAF LAR