Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4230 |
Symbol | |
ID | 6067843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4673109 |
End bp | 4674656 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641603661 |
Product | threonine dehydratase |
Protein accession | YP_001727153 |
Protein GI | 170022199 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01124] threonine ammonia-lyase, biosynthetic, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGCTG ACTCGCAACC CCTGTCCGGC ACCCCGGAAG GTGCCGAATA TTTAAGAGCG GTGCTACGCG CGCCGGTCTA TGAAGCGGCG CAGGTTACGC CGCTACAGAA AATGGAAAAA CTGTCGTCGC GTCTTGATAA CGTGATTCTG GTGAAGCGCG AAGATCGCCA GCCGGTGCAC AGCTTTAAGC TGCGCGGTGC ATACGCCATG ATGGCGGGCC TGACGGAAGA ACAAAAAGCG CACGGCGTGA TCACCGCTTC TGCGGGTAAC CACGCGCAGG GCGTCGCGTT TTCTTCCGCA CGGTTAGGCG TGAAGGCACT GATCGTCATG CCAACCGCCA CCGCCGATAT CAAAGTTGAT GCGGTGCGCG GCTTCGGCGG CGAAGTGCTG CTCCACGGTG CGAACTTTGA TGAAGCGAAA GCCAAAGCGA TCGAACTGTC ACAGCAGCAG GGGTTCACCT GGGTGCCGCC GTTCGACCAT CCGATGGTGA TTGCCGGGCA AGGCACGCTG GCGCTGGAAC TGCTCCAGCA GGACGCCCAT CTCGACCGCG TATTTGTGCC AGTCGGCGGC GGCGGTCTGG CTGCTGGCGT GGCGGTGCTG ATCAAACAAC TGATGCCGCA AATCAAAGTG ATCGCCGTAG AAGCGGAAGA CTCCGCCTGC CTGAAAGCAG CGCTGGATGC GGGTCATCCG GTTGATCTGC CGCGCGTAGG GCTATTTGCT GAAGGCGTAG CGGTAAAACG CATCGGTGAC GAAACCTTCC GTTTATGCCA GGAGTATCTC GACGACATCA TCACCGTCGA TAGCGATGCG ATCTGTGCGG CGATGAAGGA TTTATTCGAA GATGTGCGCG CGGTGGCGGA ACCCTCTGGC GCGCTGGCGC TGGCGGGAAT GAAAAAATAT ATCGCCCTGC ACAACATTCG CGGCGAACGG CTGGCGCATA TTCTTTCCGG TGCCAACGTG AACTTCCACG GCCTGCGCTA CGTCTCAGAA CGCTGCGAAC TGGGCGAACA GCGTGAAGCG TTGTTGGCGG TGACCATTCC GGAAGAAAAA GGCAGCTTCC TCAAATTCTG CCAACTGCTT GGCGGGCGTT CGGTCACCGA GTTCAACTAC CGTTTTGCCG ATGCCAAAAA CGCCTGCATC TTTGTCGGTG TGCGCCTGAG CCGCGGCCTC GAAGAGCGCA AAGAAATTTT GCAGATGCTC AACGACGGCG GCTACAGCGT GGTTGATCTC TCCGACGACG AAATGGCGAA GCTACACGTG CGCTATATGG TCGGCGGACG TCCATCGCAT CCGTTGCAGG AACGCCTCTA CAGCTTCGAA TTCCCGGAAT CACCGGGCGC GCTGCTGCGC TTCCTCAACA CGCTGGGTAC GTACTGGAAC ATTTCTTTGT TCCACTATCG CAGCCATGGC ACCGACTACG GGCGCGTACT GGCGGCGTTC GAACTTGGCG ACCATGAACC GGATTTCGAA ACCCGGCTGA ATGAGCTGGG CTACGATTGC CACGACGAAA CCAATAACCC GGCGTTCAGG TTCTTTTTGG CGGGTTAG
|
Protein sequence | MMADSQPLSG TPEGAEYLRA VLRAPVYEAA QVTPLQKMEK LSSRLDNVIL VKREDRQPVH SFKLRGAYAM MAGLTEEQKA HGVITASAGN HAQGVAFSSA RLGVKALIVM PTATADIKVD AVRGFGGEVL LHGANFDEAK AKAIELSQQQ GFTWVPPFDH PMVIAGQGTL ALELLQQDAH LDRVFVPVGG GGLAAGVAVL IKQLMPQIKV IAVEAEDSAC LKAALDAGHP VDLPRVGLFA EGVAVKRIGD ETFRLCQEYL DDIITVDSDA ICAAMKDLFE DVRAVAEPSG ALALAGMKKY IALHNIRGER LAHILSGANV NFHGLRYVSE RCELGEQREA LLAVTIPEEK GSFLKFCQLL GGRSVTEFNY RFADAKNACI FVGVRLSRGL EERKEILQML NDGGYSVVDL SDDEMAKLHV RYMVGGRPSH PLQERLYSFE FPESPGALLR FLNTLGTYWN ISLFHYRSHG TDYGRVLAAF ELGDHEPDFE TRLNELGYDC HDETNNPAFR FFLAG
|
| |