Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4139 |
Symbol | ilvA |
ID | 6146922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4235118 |
End bp | 4236665 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641618961 |
Product | threonine dehydratase |
Protein accession | YP_001746093 |
Protein GI | 170680546 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01124] threonine ammonia-lyase, biosynthetic, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0936158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGCAG ACTCACAACC CCTGTCCGGT GCCCCTGAAG GTGCCGAATA TTTAAGAGCG GTGCTGCGCG CGCCGGTCTA CGAAGCGGCG CAGGTCACGC CGCTACAGAA AATGGAAAAA CTGTCGTCGC GTCTTGATAA CGTGATTCTG GTGAAGCGCG AAGATCGCCA GCCAGTACAC AGCTTTAAGC TGCGCGGCGC ATACGCCATG ATGGCGGGCC TGACGGAAGA ACAAAAAGCG CACGGCGTGA TCACCGCTTC TGCGGGTAAC CACGCGCAGG GCGTCGCGTT TTCTTCCGCA CGGTTAGGCG TGAAGGCGCT GATCGTCATG CCAACCGCCA CCGCCGATAT CAAAGTTGAT GCGGTGCGCG GCTTTGGCGG CGAAGTGCTG CTTCATGGCG CGAACTTTGA CGAAGCGAAA GCGAAAGCAA TCGAACTGTC ACAGCAGCAG GGCTTCACCT GGGTGCCGCC GTTCGACCAT CCGATGGTGA TCGCCGGGCA AGGCACGCTG GCGCTGGAAC TGCTCCAGCA GGACGCCCAT CTCGACCGCG TATTTGTACC GGTCGGCGGC GGCGGTCTGG CGGCGGGCGT CGCGGTACTG ATCAAACAAC TGATGCCGCA AATCAAAGTG ATCGCCGTGG AAGCGGAAGA TTCCGCCTGC CTGAAAGCGG CGCTGAATGC AGGTCATCCG GTTGATCTGC CACGCGTGGG GCTGTTTGCC GAAGGTGTTG CGGTGAAACG CATCGGCGAC GAAACCTTCC GTTTATGCCA GGAGTATCTC GACGACATCA TCACCGTCGA TAGCGATGCC ATCTGTGCGG CGATGAAAGA TCTGTTCGAA GATGTGCGCG CGGTGGCGGA ACCCTCTGGC GCGCTGGCAC TGGCGGGAAT GAAAAAATAT ATCGCCCAGC ACAACATTCG CGGTGAGCGG CTGGCGCATA TTCTTTCCGG TGCTAACGTG AACTTTCACG GTCTGCGCTA CGTCTCGGAA CGCTGCGAAC TGGGCGAACA GCGTGAAGCG TTACTGGCGG TGACCATTCC GGAAGAAAAA GGCAGCTTCC TCAAATTCTG CCAACTGCTT GGCGGGCGTT CGGTCACCGA GTTCAACTAC CGTTTTGCCG ATGCCAAAAA CGCCTGCATC TTTGTCGGCG TGCGCTTAAG CCGTGGCCTC GAAGAGCGCA AAGAAATTTT GCAGATGCTC AACGACGGTG GCTACAGCGT GGTTGATCTC TCCGACGACG AAATGGCGAA GCTGCATGTG CGCTATATGG TTGGCGGGCG TCCATCGCAT CCGTTGCAGG AACGCCTCTA CAGTTTCGAA TTCCCGGAAT CACCGGGCGC GCTGCTGCGC TTCCTCAACA CGCTGGGTAC GCACTGGAAC ATTTCTTTGT TCCATTATCG CAGCCACGGT ACCGACTATG GGCGCGTACT GGCGGCGTTC GAGCTTGGCG ATCATGAACC GGATTTTGAA ACCCGGCTGA ATGAACTGGG TTACGATTGC CACGACGAAA CCAATAACCC GGCGTTCAGG TTCTTTTTGG CGGGTTAG
|
Protein sequence | MMADSQPLSG APEGAEYLRA VLRAPVYEAA QVTPLQKMEK LSSRLDNVIL VKREDRQPVH SFKLRGAYAM MAGLTEEQKA HGVITASAGN HAQGVAFSSA RLGVKALIVM PTATADIKVD AVRGFGGEVL LHGANFDEAK AKAIELSQQQ GFTWVPPFDH PMVIAGQGTL ALELLQQDAH LDRVFVPVGG GGLAAGVAVL IKQLMPQIKV IAVEAEDSAC LKAALNAGHP VDLPRVGLFA EGVAVKRIGD ETFRLCQEYL DDIITVDSDA ICAAMKDLFE DVRAVAEPSG ALALAGMKKY IAQHNIRGER LAHILSGANV NFHGLRYVSE RCELGEQREA LLAVTIPEEK GSFLKFCQLL GGRSVTEFNY RFADAKNACI FVGVRLSRGL EERKEILQML NDGGYSVVDL SDDEMAKLHV RYMVGGRPSH PLQERLYSFE FPESPGALLR FLNTLGTHWN ISLFHYRSHG TDYGRVLAAF ELGDHEPDFE TRLNELGYDC HDETNNPAFR FFLAG
|
| |