Gene EcSMS35_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0003 
SymbolthrC 
ID6146078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3733 
End bp5019 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content53% 
IMG OID641614904 
Productthreonine synthase 
Protein accessionYP_001742120 
Protein GI170682268 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCT ACAATCTGAA AGATCACAAT GAGCAGGTCA GCTTTGCGCA AGCCGTAACC 
CAGGGGTTGG GCAAAAATCA GGGGCTGTTT TTCCCGCACG ACCTGCCGGA ATTCAGCCTG
ACTGAAATTG ATGAGATGCT GAAGCTGGAT TTTGTCACCC GCAGTGCGAA GATCCTCTCG
GCGTTTATTG GTGATGAAAT CCCGCAGGAA ATCCTGGAAG AGCGCGTACG CGCGGCGTTT
GCCTTCCCGG CTCCGGTCGC CAATGTTGAA AGCGATGTCG GTTGTCTGGA ATTGTTTCAC
GGGCCAACGC TGGCATTTAA AGATTTCGGC GGTCGCTTTA TGGCACAAAT GCTGACCCAT
ATTGCGGGCG ATAAGCCAGT GACCATTCTG ACCGCGACAT CTGGTGATAC TGGAGCGGCA
GTGGCTCATG CTTTCTACGG TTTACCGAAT GTGAAAGTGG TTATCCTCTA TCCACGAGGC
AAAATCAGTC CACTGCAAGA AAAACTGTTC TGTACATTGG GCGGCAATAT CGAAACTGTT
GCCATCGACG GCGATTTCGA TGCCTGTCAG GCGCTGGTGA AGCAGGCGTT TGATGATGAA
GAACTGAAAG TGGCGCTGGG GCTAAACTCT GCTAACTCCA TTAACATCAG CCGTTTGCTG
GCGCAGATTT GCTACTACTT TGAAGCTGTT GCGCAGCTGC CGCAGGAAGC GCGCAACCAG
CTGGTTGTCT CGGTGCCAAG CGGAAACTTC GGCGATTTGA CGGCGGGTCT GCTGGCGAAG
TCACTCGGTC TGCCGGTGAA ACGTTTTATT GCTGCGACCA ACGTGAACGA TACCGTGCCA
CGTTTCCTGC ACGACGGTCA GTGGTCACCC AAAGCGACTC AGGCGACGTT ATCCAATGCG
ATGGATGTTA GCCAGCCAAA CAACTGGCCG CGTGTGGAAG AGTTGTTCCG CCGCAAAATC
TGGCAACTGA AAGAGCTGGG GTATGCAGCC GTGGATGATG AAACCACGCA ACAGACAATG
CGTGAGTTAA AAGAACTGGG CTATACCTCG GAGCCGCACG CTGCCGTAGC TTATCGTGCG
CTGCGTGACC AGTTGCATCC AGGCGAATAT GGCTTGTTCC TCGGCACCGC GCATCCGGCG
AAATTTAAAG AGAGCGTGGA AGCGATTCTC GGTGAAACAT TGGATCTGCC AAAAGAGCTG
GCAGAACGTG CTGATTTACC CTTGCTTTCG CATAACCTGC CCGCCGATTT TGCTGCGTTG
CGTAAATTGA TGATGAATCA TCAGTAA
 
Protein sequence
MKLYNLKDHN EQVSFAQAVT QGLGKNQGLF FPHDLPEFSL TEIDEMLKLD FVTRSAKILS 
AFIGDEIPQE ILEERVRAAF AFPAPVANVE SDVGCLELFH GPTLAFKDFG GRFMAQMLTH
IAGDKPVTIL TATSGDTGAA VAHAFYGLPN VKVVILYPRG KISPLQEKLF CTLGGNIETV
AIDGDFDACQ ALVKQAFDDE ELKVALGLNS ANSINISRLL AQICYYFEAV AQLPQEARNQ
LVVSVPSGNF GDLTAGLLAK SLGLPVKRFI AATNVNDTVP RFLHDGQWSP KATQATLSNA
MDVSQPNNWP RVEELFRRKI WQLKELGYAA VDDETTQQTM RELKELGYTS EPHAAVAYRA
LRDQLHPGEY GLFLGTAHPA KFKESVEAIL GETLDLPKEL AERADLPLLS HNLPADFAAL
RKLMMNHQ