Gene EcHS_A2602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2602 
SymboldapE 
ID5590991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2613514 
End bp2614641 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content53% 
IMG OID640921723 
Productsuccinyl-diaminopimelate desuccinylase 
Protein accessionYP_001459250 
Protein GI157161932 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01246] succinyl-diaminopimelate desuccinylase, proteobacterial clade 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTGCC CGGTTATTGA GCTGACACAA CAGCTTATTC GCCGCCCTTC CCTGAGTCCT 
GATGACGCAG GATGTCAGGC TTTGTTGATT GAACGTTTGC AGGCGATCGG TTTTACCGTT
GAACGCATGG ATTTTGCCGA TACGCAGAAT TTTTGGGCAT GGCGTGGGCA AGGTGAAACG
TTAGCCTTTG CCGGGCATAC CGACGTGGTG CCGCCTGGCG ACGCCGATCG TTGGATCAAT
CCGCCGTTTG AACCCACCAT TCGTGATGGC ATGTTATTCG GGCGCGGTGC GGCAGATATG
AAAGGCTCGC TGGCGGCGAT GGTGGTAGCG GCAGAACGCT TTGTCGCACA ACATCCCAAC
CATACGGGGC GACTGGCATT TCTGATCACC TCTGATGAAG AAGCCAGTGC CCACAACGGT
ACGGTAAAAG TCGTCGAAGC GTTGATGGCA CGTAATGAGC GTCTCGATTA CTGCCTGGTC
GGCGAACCGT CGAGTATCGA AGTGGTTGGT GATGTGGTGA AAAATGGTCG TCGCGGATCG
TTAACCTGCA ACCTTACCAT TCATGGCGTT CAGGGGCATG TGGCCTACCC GCATCTGGCT
GACAATCCAG TACATCGCGC AGCACCTTTC CTTAATGAAT TAGTGGCTAT TGAGTGGGAT
CAGGGCAATG AATTTTTCCC GGCGACCAGT ATGCAGATTG CTAATATTCA GGCGGGAACG
GGCAGTAACA ATGTTATTCC GGGTGAACTG TTTGTGCAGT TTAACTTCCG CTTCAGCACC
GAACTGACTG ATGAGATGAT CAAAGCGCAG GTGCTTGCCC TGCTTGAAAA ACATCAACTG
CGCTATACGG TGGATTGGTG GCTTTCCGGG CAGCCATTTT TGACCGCGCG CGGTAAACTG
GTGGATGCGG TCGTTAACGC GGTTGAGCAC TATAATGAAA TTAAACCGCA GCTACTGACC
ACAGGCGGAA CGTCCGACGG GCGCTTTATT GCTCGCATGG GGGCGCAGGT GGTGGAACTC
GGGCCGGTCA ATGCCACTAT TCATAAAATT AATGAATGTG TGAACGCTGC CGACCTGCAG
CTACTTGCCC GTATGTATCA ACGTATCATG GAACAGCTCG TCGCCTGA
 
Protein sequence
MSCPVIELTQ QLIRRPSLSP DDAGCQALLI ERLQAIGFTV ERMDFADTQN FWAWRGQGET 
LAFAGHTDVV PPGDADRWIN PPFEPTIRDG MLFGRGAADM KGSLAAMVVA AERFVAQHPN
HTGRLAFLIT SDEEASAHNG TVKVVEALMA RNERLDYCLV GEPSSIEVVG DVVKNGRRGS
LTCNLTIHGV QGHVAYPHLA DNPVHRAAPF LNELVAIEWD QGNEFFPATS MQIANIQAGT
GSNNVIPGEL FVQFNFRFST ELTDEMIKAQ VLALLEKHQL RYTVDWWLSG QPFLTARGKL
VDAVVNAVEH YNEIKPQLLT TGGTSDGRFI ARMGAQVVEL GPVNATIHKI NECVNAADLQ
LLARMYQRIM EQLVA