Gene EcDH1_4204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4204 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4557595 
End bp4559139 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content59% 
IMG OID 
Productthreonine dehydratase, biosynthetic 
Protein accessionACX41804 
Protein GI260451382 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.806906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACT CGCAACCCCT GTCCGGTGCT CCGGAAGGTG CCGAATATTT AAGAGCAGTG 
CTGCGCGCGC CGGTTTACGA GGCGGCGCAG GTTACGCCGC TACAAAAAAT GGAAAAACTG
TCGTCGCGTC TTGATAACGT CATTCTGGTG AAGCGCGAAG ATCGCCAGCC AGTGCACAGC
TTTAAGCTGC GCGGCGCATA CGCCATGATG GCGGGCCTGA CGGAAGAACA GAAAGCGCAC
GGCGTGATCA CTGCTTCTGC GGGTAACCAC GCGCAGGGCG TCGCGTTTTC TTCTGCGCGG
TTAGGCGTGA AGGCCCTGAT CGTTATGCCA ACCGCCACCG CCGACATCAA AGTCGACGCG
GTGCGCGGCT TCGGCGGCGA AGTGCTGCTC CACGGCGCGA ACTTTGATGA AGCGAAAGCC
AAAGCGATCG AACTGTCACA GCAGCAGGGG TTCACCTGGG TGCCGCCGTT CGACCATCCG
ATGGTGATTG CCGGGCAAGG CACGCTGGCG CTGGAACTGC TCCAGCAGGA CGCCCATCTC
GACCGCGTAT TTGTGCCAGT CGGCGGCGGC GGTCTGGCTG CTGGCGTGGC GGTGCTGATC
AAACAACTGA TGCCGCAAAT CAAAGTGATC GCCGTAGAAG CGGAAGACTC CGCCTGCCTG
AAAGCAGCGC TGAATGCGGG TCATCCGGTT GATCTGCCGC GCGTAGGGCT ATTTGCTGAA
GGCGTAGCGG TAAAACGCAT CGGTGACGAA ACCTTCCGTT TATGCCAGGA GTATCTCGAC
GACATCATCA CCGTCGATAG CGATGCGATC TGTGCGGCGA TGAAGGATTT ATTCGAAGAT
GTGCGCGCGG TGGCGGAACC CTCTGGCGCG CTGGCGCTGG CGGGAATGAA AAAATATATC
GCCCTGCACA ACATTCGCGG CGAACGGCTG GCGCATATTC TTTCCGGTGC CAACGTGAAC
TTCCACGGCC TGCGCTACGT CTCAGAACGC TGCGAACTGG GCGAACAGCG TGAAGCGTTG
TTGGCGGTGA CCATTCCGGA AGAAAAAGGC AGCTTCCTCA AATTCTGCCA ACTGCTTGGC
GGGCGTTCGG TCACCGAGTT CAACTACCGT TTTGCCGATG CCAAAAACGC CTGCATCTTT
GTCGGTGTGC GCCTGAGCCG CGGCCTCGAA GAGCGCAAAG AAATTTTGCA GATGCTCAAC
GACGGCGGCT ACAGCGTGGT TGATCTCTCC GACGACGAAA TGGCGAAGCT ACACGTGCGC
TATATGGTCG GCGGACGTCC ATCGCATCCG TTGCAGGAAC GCCTCTACAG CTTCGAATTC
CCGGAATCAC CGGGCGCGCT GCTGCGCTTC CTCAACACGC TGGGTACGTA CTGGAACATT
TCTTTGTTCC ACTATCGCAG CCATGGCACC GACTACGGGC GCGTACTGGC GGCGTTCGAA
CTTGGCGACC ATGAACCGGA TTTCGAAACC CGGCTGAATG AGCTGGGCTA CGATTGCCAC
GACGAAACCA ATAACCCGGC GTTCAGGTTC TTTTTGGCGG GTTAG
 
Protein sequence
MADSQPLSGA PEGAEYLRAV LRAPVYEAAQ VTPLQKMEKL SSRLDNVILV KREDRQPVHS 
FKLRGAYAMM AGLTEEQKAH GVITASAGNH AQGVAFSSAR LGVKALIVMP TATADIKVDA
VRGFGGEVLL HGANFDEAKA KAIELSQQQG FTWVPPFDHP MVIAGQGTLA LELLQQDAHL
DRVFVPVGGG GLAAGVAVLI KQLMPQIKVI AVEAEDSACL KAALNAGHPV DLPRVGLFAE
GVAVKRIGDE TFRLCQEYLD DIITVDSDAI CAAMKDLFED VRAVAEPSGA LALAGMKKYI
ALHNIRGERL AHILSGANVN FHGLRYVSER CELGEQREAL LAVTIPEEKG SFLKFCQLLG
GRSVTEFNYR FADAKNACIF VGVRLSRGLE ERKEILQMLN DGGYSVVDLS DDEMAKLHVR
YMVGGRPSHP LQERLYSFEF PESPGALLRF LNTLGTYWNI SLFHYRSHGT DYGRVLAAFE
LGDHEPDFET RLNELGYDCH DETNNPAFRF FLAG