Gene EcDH1_3592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3592 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3866357 
End bp3867643 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content53% 
IMG OID 
Productthreonine synthase 
Protein accessionACX41205 
Protein GI260450783 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCT ACAATCTGAA AGATCACAAC GAGCAGGTCA GCTTTGCGCA AGCCGTAACC 
CAGGGGTTGG GCAAAAATCA GGGGCTGTTT TTTCCGCACG ACCTGCCGGA ATTCAGCCTG
ACTGAAATTG ATGAGATGCT GAAGCTGGAT TTTGTCACCC GCAGTGCGAA GATCCTCTCG
GCGTTTATTG GTGATGAAAT CCCACAGGAA ATCCTGGAAG AGCGCGTGCG CGCGGCGTTT
GCCTTCCCGG CTCCGGTCGC CAATGTTGAA AGCGATGTCG GTTGTCTGGA ATTGTTCCAC
GGGCCAACGC TGGCATTTAA AGATTTCGGC GGTCGCTTTA TGGCACAAAT GCTGACCCAT
ATTGCGGGTG ATAAGCCAGT GACCATTCTG ACCGCGACCT CCGGTGATAC CGGAGCGGCA
GTGGCTCATG CTTTCTACGG TTTACCGAAT GTGAAAGTGG TTATCCTCTA TCCACGAGGC
AAAATCAGTC CACTGCAAGA AAAACTGTTC TGTACATTGG GCGGCAATAT CGAAACTGTT
GCCATCGACG GCGATTTCGA TGCCTGTCAG GCGCTGGTGA AGCAGGCGTT TGATGATGAA
GAACTGAAAG TGGCGCTAGG GTTAAACTCG GCTAACTCGA TTAACATCAG CCGTTTGCTG
GCGCAGATTT GCTACTACTT TGAAGCTGTT GCGCAGCTGC CGCAGGAGAC GCGCAACCAG
CTGGTTGTCT CGGTGCCAAG CGGAAACTTC GGCGATTTGA CGGCGGGTCT GCTGGCGAAG
TCACTCGGTC TGCCGGTGAA ACGTTTTATT GCTGCGACCA ACGTGAACGA TACCGTGCCA
CGTTTCCTGC ACGACGGTCA GTGGTCACCC AAAGCGACTC AGGCGACGTT ATCCAACGCG
ATGGACGTGA GTCAGCCGAA CAACTGGCCG CGTGTGGAAG AGTTGTTCCG CCGCAAAATC
TGGCAACTGA AAGAGCTGGG TTATGCAGCC GTGGATGATG AAACCACGCA ACAGACAATG
CGTGAGTTAA AAGAACTGGG CTACACTTCG GAGCCGCACG CTGCCGTAGC TTATCGTGCG
CTGCGTGATC AGTTGAATCC AGGCGAATAT GGCTTGTTCC TCGGCACCGC GCATCCGGCG
AAATTTAAAG AGAGCGTGGA AGCGATTCTC GGTGAAACGT TGGATCTGCC AAAAGAGCTG
GCAGAACGTG CTGATTTACC CTTGCTTTCA CATAATCTGC CCGCCGATTT TGCTGCGTTG
CGTAAATTGA TGATGAATCA TCAGTAA
 
Protein sequence
MKLYNLKDHN EQVSFAQAVT QGLGKNQGLF FPHDLPEFSL TEIDEMLKLD FVTRSAKILS 
AFIGDEIPQE ILEERVRAAF AFPAPVANVE SDVGCLELFH GPTLAFKDFG GRFMAQMLTH
IAGDKPVTIL TATSGDTGAA VAHAFYGLPN VKVVILYPRG KISPLQEKLF CTLGGNIETV
AIDGDFDACQ ALVKQAFDDE ELKVALGLNS ANSINISRLL AQICYYFEAV AQLPQETRNQ
LVVSVPSGNF GDLTAGLLAK SLGLPVKRFI AATNVNDTVP RFLHDGQWSP KATQATLSNA
MDVSQPNNWP RVEELFRRKI WQLKELGYAA VDDETTQQTM RELKELGYTS EPHAAVAYRA
LRDQLNPGEY GLFLGTAHPA KFKESVEAIL GETLDLPKEL AERADLPLLS HNLPADFAAL
RKLMMNHQ