Gene ECH74115_3693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3693 
SymboldapE 
ID6967106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3411848 
End bp3412975 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content53% 
IMG OID643387487 
Productsuccinyl-diaminopimelate desuccinylase 
Protein accessionYP_002271940 
Protein GI209396481 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01246] succinyl-diaminopimelate desuccinylase, proteobacterial clade 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTGCC CGGTTATTGA GCTGACACAA CAGCTTATTC GCCGCCCTTC CCTGAGTCCT 
GATGACGCAG GATGCCAGGC TTTGTTGATT GAACGTTTGC AGGCGATCGG TTTTACCGTT
GAACGCATGG ACTTTGCCGA TACGCAGAAT TTTTGGGCAT GGCGTGGGCA AGGTGAAACA
TTGGCCTTTG CCGGGCATAC CGACGTGGTG CCGCCTGGCG ACGCCGATCG TTGGATCAAT
CCGCCGTTTG AACCAACCAT TCGTGACGGC ATGTTATTCG GGCGCGGTGC GGCAGATATG
AAAGGCTCGC TGGCGGCGAT GGTGGTAGCT GCAGAACGTT TTGTCGCACA ACATCCCAAC
CATACAGGGC GACTGGCATT TCTGATCACC TCTGATGAAG AAGCCAGTGC CCACAATGGT
ACGGTAAAAG CCGTCGAAGC GTTAATGGCA CGTAATGAGC GTCTCGATTA CTGCCTGGTC
GGCGAACCGT CGAGTATCGA AGTGGTAGGT GATGTGGTGA AAAATGGTCG TCGTGGATCG
TTAACCTGCA ACCTAACCAT TCATGGCGTT CAGGGGCATG TTGCCTACCC ACATCTGGCT
GACAATCCGG TACATCGCGC AGCACCTTTC CTTAATGAAT TAGTGGCTAT TGAGTGGGAT
CAGGGCAATG AATTCTTCCC GGCGACCAGT ATGCAGATTG CCAATATTCA GGCGGGAACG
GGCAGTAACA ACGTTATTCC GGGTGAACTG TTTGTGCAGT TTAACTTCCG CTTCAGCACC
GAACTGACTG ATGAGATGAT CAAAGCGCAG GTGCTTGCCC TGCTTGAAAA ACATCAACTG
CGCTATACGG TGGATTGGTG GCTTTCCGGG CAGCCATTTT TGACCGCGCG CGGTAAACTG
GTGGATGCGG TCGTTAACGC GGTTGAGCAC TATAATGAAA TTAAACCGCA GCTACTGACC
ACAGGCGGAA CGTCCGACGG GCGCTTTATT GCCCGCATGG GGGCGCAGGT GGTGGAACTC
GGGCCGGTCA ATGCCACTAT TCATAAAATT AATGAATGTG TGAACGCTGC CGACCTGCAG
CTACTTGCCC GTATGTATCA ACGTATCATG GAACAGCTCG TCGCGTGA
 
Protein sequence
MSCPVIELTQ QLIRRPSLSP DDAGCQALLI ERLQAIGFTV ERMDFADTQN FWAWRGQGET 
LAFAGHTDVV PPGDADRWIN PPFEPTIRDG MLFGRGAADM KGSLAAMVVA AERFVAQHPN
HTGRLAFLIT SDEEASAHNG TVKAVEALMA RNERLDYCLV GEPSSIEVVG DVVKNGRRGS
LTCNLTIHGV QGHVAYPHLA DNPVHRAAPF LNELVAIEWD QGNEFFPATS MQIANIQAGT
GSNNVIPGEL FVQFNFRFST ELTDEMIKAQ VLALLEKHQL RYTVDWWLSG QPFLTARGKL
VDAVVNAVEH YNEIKPQLLT TGGTSDGRFI ARMGAQVVEL GPVNATIHKI NECVNAADLQ
LLARMYQRIM EQLVA