Gene EcolC_4230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4230 
Symbol 
ID6067843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4673109 
End bp4674656 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content59% 
IMG OID641603661 
Productthreonine dehydratase 
Protein accessionYP_001727153 
Protein GI170022199 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCTG ACTCGCAACC CCTGTCCGGC ACCCCGGAAG GTGCCGAATA TTTAAGAGCG 
GTGCTACGCG CGCCGGTCTA TGAAGCGGCG CAGGTTACGC CGCTACAGAA AATGGAAAAA
CTGTCGTCGC GTCTTGATAA CGTGATTCTG GTGAAGCGCG AAGATCGCCA GCCGGTGCAC
AGCTTTAAGC TGCGCGGTGC ATACGCCATG ATGGCGGGCC TGACGGAAGA ACAAAAAGCG
CACGGCGTGA TCACCGCTTC TGCGGGTAAC CACGCGCAGG GCGTCGCGTT TTCTTCCGCA
CGGTTAGGCG TGAAGGCACT GATCGTCATG CCAACCGCCA CCGCCGATAT CAAAGTTGAT
GCGGTGCGCG GCTTCGGCGG CGAAGTGCTG CTCCACGGTG CGAACTTTGA TGAAGCGAAA
GCCAAAGCGA TCGAACTGTC ACAGCAGCAG GGGTTCACCT GGGTGCCGCC GTTCGACCAT
CCGATGGTGA TTGCCGGGCA AGGCACGCTG GCGCTGGAAC TGCTCCAGCA GGACGCCCAT
CTCGACCGCG TATTTGTGCC AGTCGGCGGC GGCGGTCTGG CTGCTGGCGT GGCGGTGCTG
ATCAAACAAC TGATGCCGCA AATCAAAGTG ATCGCCGTAG AAGCGGAAGA CTCCGCCTGC
CTGAAAGCAG CGCTGGATGC GGGTCATCCG GTTGATCTGC CGCGCGTAGG GCTATTTGCT
GAAGGCGTAG CGGTAAAACG CATCGGTGAC GAAACCTTCC GTTTATGCCA GGAGTATCTC
GACGACATCA TCACCGTCGA TAGCGATGCG ATCTGTGCGG CGATGAAGGA TTTATTCGAA
GATGTGCGCG CGGTGGCGGA ACCCTCTGGC GCGCTGGCGC TGGCGGGAAT GAAAAAATAT
ATCGCCCTGC ACAACATTCG CGGCGAACGG CTGGCGCATA TTCTTTCCGG TGCCAACGTG
AACTTCCACG GCCTGCGCTA CGTCTCAGAA CGCTGCGAAC TGGGCGAACA GCGTGAAGCG
TTGTTGGCGG TGACCATTCC GGAAGAAAAA GGCAGCTTCC TCAAATTCTG CCAACTGCTT
GGCGGGCGTT CGGTCACCGA GTTCAACTAC CGTTTTGCCG ATGCCAAAAA CGCCTGCATC
TTTGTCGGTG TGCGCCTGAG CCGCGGCCTC GAAGAGCGCA AAGAAATTTT GCAGATGCTC
AACGACGGCG GCTACAGCGT GGTTGATCTC TCCGACGACG AAATGGCGAA GCTACACGTG
CGCTATATGG TCGGCGGACG TCCATCGCAT CCGTTGCAGG AACGCCTCTA CAGCTTCGAA
TTCCCGGAAT CACCGGGCGC GCTGCTGCGC TTCCTCAACA CGCTGGGTAC GTACTGGAAC
ATTTCTTTGT TCCACTATCG CAGCCATGGC ACCGACTACG GGCGCGTACT GGCGGCGTTC
GAACTTGGCG ACCATGAACC GGATTTCGAA ACCCGGCTGA ATGAGCTGGG CTACGATTGC
CACGACGAAA CCAATAACCC GGCGTTCAGG TTCTTTTTGG CGGGTTAG
 
Protein sequence
MMADSQPLSG TPEGAEYLRA VLRAPVYEAA QVTPLQKMEK LSSRLDNVIL VKREDRQPVH 
SFKLRGAYAM MAGLTEEQKA HGVITASAGN HAQGVAFSSA RLGVKALIVM PTATADIKVD
AVRGFGGEVL LHGANFDEAK AKAIELSQQQ GFTWVPPFDH PMVIAGQGTL ALELLQQDAH
LDRVFVPVGG GGLAAGVAVL IKQLMPQIKV IAVEAEDSAC LKAALDAGHP VDLPRVGLFA
EGVAVKRIGD ETFRLCQEYL DDIITVDSDA ICAAMKDLFE DVRAVAEPSG ALALAGMKKY
IALHNIRGER LAHILSGANV NFHGLRYVSE RCELGEQREA LLAVTIPEEK GSFLKFCQLL
GGRSVTEFNY RFADAKNACI FVGVRLSRGL EERKEILQML NDGGYSVVDL SDDEMAKLHV
RYMVGGRPSH PLQERLYSFE FPESPGALLR FLNTLGTYWN ISLFHYRSHG TDYGRVLAAF
ELGDHEPDFE TRLNELGYDC HDETNNPAFR FFLAG