Gene ECH74115_4107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4107 
SymbollysA 
ID6969884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3804435 
End bp3805697 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content53% 
IMG OID643387862 
Productdiaminopimelate decarboxylase 
Protein accessionYP_002272302 
Protein GI209399305 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.475873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000027959 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCACATT CACTGTTCAG CACCGATACC GATCTCACCG CCGAAAATCT GCTGCGTTTG 
CCCGCTGAAT TTGGCTGCCC GGTGTGGGTC TACGATGCGC AAATTATTCG TCGGCAGATT
GCAGCGCTGA AACAGTTTGA TGTGGTGCGC TTTGCACAGA AAGCCTGTTC CAATATTCAT
ATTTTGCGCT TAATGCGTGA GCAGGGCGTG AAAGTGGATT CCGTCTCGTT AGGCGAAATA
GAGCGTGCGT TGGCGGCGGG TTACAATCCG CAAACGCACC CCGATGATAT TGTTTTTACG
GCAGATGTTA TCGATCAGGC GACGCTTGAA CGCGTCAGTG AATTGCAAAT TCCGGTGAAT
GCGGGTTCTG TTGATATGTT CGACCAACTG GGCCAGGTTT CGCCAGGGCA TCGGGTATGG
CTGCGCGTTA ATCCGGGGTT TGGTCACGGA CATAGCCAAA AAACCAATAC CGGTGGCGAA
AACAGCAAGC ACGGTATCTG GTACACCGAT CTGCCCGCCG CACTGGACGT GATACAACGT
CATCATCTGC AGCTGGTCGG CATTCACATG CACATTGGTT CTGGCGTTGA TTATGCCCAT
CTGGAACAGG TGTGTGGTGC TATGGTGCGT CAGGTCATCG AATTCGGTCA GGATTTACAG
GCTATTTCTG CGGGCGGTGG GCTTTCTATT CCTTATCAAC AGGGTGAAGA GGCGGTTGAT
ACCGAACATT ATTATGGTCT GTGGAATGCC GCGCGTGAGC AAATCGCCCG CCATTTGGGC
CACCCTGTGA AACTGGAAAT TGAACCGGGT CGCTTCCTGG TAGCGCAGTC TGGCGTGTTA
ATTACTCAGG TGCGGAGCGT CAAACAAATG GGTAGCCGCC ACTTTGTGCT GGTTGATGCC
GGGTTTAACG ATCTGATGCG CCCGGCAATG TACGGTAGTT ACCACCATAT CAGTGCCCTG
GCAGCTGATG GTCGTTCTCT GGAACACGCA CCAACGGTGG AAACCGTCGT CGCCGGACCG
TTATGTGAAT CGGGCGATGT CTTTACCCAG CAGGAAGGGG GAAATGTTGA ACCCCGCGCC
TTGCCGGAAG TGAAGGCAGG TGATTATCTG GTACTGCATG ATACAGGGGC ATATGGCGCA
TCAATGTCAT CCAACTACAA TAGCCGTCCG CTGTTACCAG AAGTTCTGTT TGATAATGGT
CAGGCGCGGT TGATTCGCCG TCGCCAGACC ATCGAAGAAT TACTGGCGCT GGAATTGCTT
TAA
 
Protein sequence
MPHSLFSTDT DLTAENLLRL PAEFGCPVWV YDAQIIRRQI AALKQFDVVR FAQKACSNIH 
ILRLMREQGV KVDSVSLGEI ERALAAGYNP QTHPDDIVFT ADVIDQATLE RVSELQIPVN
AGSVDMFDQL GQVSPGHRVW LRVNPGFGHG HSQKTNTGGE NSKHGIWYTD LPAALDVIQR
HHLQLVGIHM HIGSGVDYAH LEQVCGAMVR QVIEFGQDLQ AISAGGGLSI PYQQGEEAVD
TEHYYGLWNA AREQIARHLG HPVKLEIEPG RFLVAQSGVL ITQVRSVKQM GSRHFVLVDA
GFNDLMRPAM YGSYHHISAL AADGRSLEHA PTVETVVAGP LCESGDVFTQ QEGGNVEPRA
LPEVKAGDYL VLHDTGAYGA SMSSNYNSRP LLPEVLFDNG QARLIRRRQT IEELLALELL