Gene ECH74115_3616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3616 
Symbol 
ID6967379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3336760 
End bp3337797 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content58% 
IMG OID643387411 
Productexoaminopeptidase 
Protein accessionYP_002271870 
Protein GI209395774 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.874424 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTAT CGCTATTGAA AGCGTTGAGC GAGGCAGATG CGATCGCCTC CTCGGAACAG 
GAAGTGCGGC AGATCCTGCT GGAAGAAGCG GATCGCCTGC AAAAAGAAGT GCGATTTGAT
GGGCTGGGAT CGGTGCTGAT CCGCCTCAAT GAATCGACAG GTCCGAAGGT GATGATCTGT
GCGCATATGG ACGAAGTGGG ATTTATGGTG CGCAGCATCT CCCGCGAAGG GGCGATTGAT
GTGCTGCCGG TTGGCAATGT ACGCATGGCT GCCCGCCAGC TGCAGCCGGT GCGCATCACC
ACCCGTGAAG AGTGCAAAAT TCCAGGCCTG CTTGACGGCG ACCGGCAGGG GAATGATGTC
AGCGCCATGC GCGTGGATAT TGGCGCGCGC TCGTATGACG AAGTGATGCA GGCGGGAATT
CGTCCAGGCG ATCGCGTCAC GTTCGATACC ACTTTTCAGG TTCTCCCCCA CCAGCGGGTG
ATGGGGAAAG CCTTTGATGA CCGCCTCGGT TGCTACCTGC TGGTGACGTT ACTGCGCGAA
CTACACAGCG CTGAACTGCC TGCGGAAGTG TGGCTGGTGG CCAGTTCCAG CGAAGAGGTG
GGGTTACGCG GCGGGCAAAC TGCCACCCGC GCGGTGTCGC CGGACGTCGC CATTGTCCTT
GATACCGCCT GCTGGGCGAA AAACTTTAAT TATGGCGCGG CTAACCATCG CCAGATTGGT
AACGGCCCGA TGCTGGTGTT AAGCGACAAG TCACTGATTG CGCCGCCAAA ACTCACCGCC
TGGATCGAAA CCGTGGCGGC AGAAATTGGC GTGCCGTTAC AGGCGGATAT GTTCAGTAAC
GGCGGCACGG ACGGTGGAGC GGTGCACTTA ACCGGTACTG GCGTACCCAC AGTGGTGATG
GGGCCTGCCA CCCGCCACGG ACATTGCGCC GCGTCGATTG CCGATTGCCG TGACATTTTG
CAGATGGAGC AACTTTTATC TGCCCTTATT CAACGTCTTA CGCGTGAGAC GGTTGTTCAA
CTGACGGATT TCAGATGA
 
Protein sequence
MDLSLLKALS EADAIASSEQ EVRQILLEEA DRLQKEVRFD GLGSVLIRLN ESTGPKVMIC 
AHMDEVGFMV RSISREGAID VLPVGNVRMA ARQLQPVRIT TREECKIPGL LDGDRQGNDV
SAMRVDIGAR SYDEVMQAGI RPGDRVTFDT TFQVLPHQRV MGKAFDDRLG CYLLVTLLRE
LHSAELPAEV WLVASSSEEV GLRGGQTATR AVSPDVAIVL DTACWAKNFN YGAANHRQIG
NGPMLVLSDK SLIAPPKLTA WIETVAAEIG VPLQADMFSN GGTDGGAVHL TGTGVPTVVM
GPATRHGHCA ASIADCRDIL QMEQLLSALI QRLTRETVVQ LTDFR