Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0003 |
Symbol | thrC |
ID | 6146078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3733 |
End bp | 5019 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641614904 |
Product | threonine synthase |
Protein accession | YP_001742120 |
Protein GI | 170682268 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0498] Threonine synthase |
TIGRFAM ID | [TIGR00260] threonine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTCT ACAATCTGAA AGATCACAAT GAGCAGGTCA GCTTTGCGCA AGCCGTAACC CAGGGGTTGG GCAAAAATCA GGGGCTGTTT TTCCCGCACG ACCTGCCGGA ATTCAGCCTG ACTGAAATTG ATGAGATGCT GAAGCTGGAT TTTGTCACCC GCAGTGCGAA GATCCTCTCG GCGTTTATTG GTGATGAAAT CCCGCAGGAA ATCCTGGAAG AGCGCGTACG CGCGGCGTTT GCCTTCCCGG CTCCGGTCGC CAATGTTGAA AGCGATGTCG GTTGTCTGGA ATTGTTTCAC GGGCCAACGC TGGCATTTAA AGATTTCGGC GGTCGCTTTA TGGCACAAAT GCTGACCCAT ATTGCGGGCG ATAAGCCAGT GACCATTCTG ACCGCGACAT CTGGTGATAC TGGAGCGGCA GTGGCTCATG CTTTCTACGG TTTACCGAAT GTGAAAGTGG TTATCCTCTA TCCACGAGGC AAAATCAGTC CACTGCAAGA AAAACTGTTC TGTACATTGG GCGGCAATAT CGAAACTGTT GCCATCGACG GCGATTTCGA TGCCTGTCAG GCGCTGGTGA AGCAGGCGTT TGATGATGAA GAACTGAAAG TGGCGCTGGG GCTAAACTCT GCTAACTCCA TTAACATCAG CCGTTTGCTG GCGCAGATTT GCTACTACTT TGAAGCTGTT GCGCAGCTGC CGCAGGAAGC GCGCAACCAG CTGGTTGTCT CGGTGCCAAG CGGAAACTTC GGCGATTTGA CGGCGGGTCT GCTGGCGAAG TCACTCGGTC TGCCGGTGAA ACGTTTTATT GCTGCGACCA ACGTGAACGA TACCGTGCCA CGTTTCCTGC ACGACGGTCA GTGGTCACCC AAAGCGACTC AGGCGACGTT ATCCAATGCG ATGGATGTTA GCCAGCCAAA CAACTGGCCG CGTGTGGAAG AGTTGTTCCG CCGCAAAATC TGGCAACTGA AAGAGCTGGG GTATGCAGCC GTGGATGATG AAACCACGCA ACAGACAATG CGTGAGTTAA AAGAACTGGG CTATACCTCG GAGCCGCACG CTGCCGTAGC TTATCGTGCG CTGCGTGACC AGTTGCATCC AGGCGAATAT GGCTTGTTCC TCGGCACCGC GCATCCGGCG AAATTTAAAG AGAGCGTGGA AGCGATTCTC GGTGAAACAT TGGATCTGCC AAAAGAGCTG GCAGAACGTG CTGATTTACC CTTGCTTTCG CATAACCTGC CCGCCGATTT TGCTGCGTTG CGTAAATTGA TGATGAATCA TCAGTAA
|
Protein sequence | MKLYNLKDHN EQVSFAQAVT QGLGKNQGLF FPHDLPEFSL TEIDEMLKLD FVTRSAKILS AFIGDEIPQE ILEERVRAAF AFPAPVANVE SDVGCLELFH GPTLAFKDFG GRFMAQMLTH IAGDKPVTIL TATSGDTGAA VAHAFYGLPN VKVVILYPRG KISPLQEKLF CTLGGNIETV AIDGDFDACQ ALVKQAFDDE ELKVALGLNS ANSINISRLL AQICYYFEAV AQLPQEARNQ LVVSVPSGNF GDLTAGLLAK SLGLPVKRFI AATNVNDTVP RFLHDGQWSP KATQATLSNA MDVSQPNNWP RVEELFRRKI WQLKELGYAA VDDETTQQTM RELKELGYTS EPHAAVAYRA LRDQLHPGEY GLFLGTAHPA KFKESVEAIL GETLDLPKEL AERADLPLLS HNLPADFAAL RKLMMNHQ
|
| |