Gene ECH74115_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2154 
Symboldcp 
ID6968120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2065831 
End bp2067876 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content50% 
IMG OID643386049 
Productdipeptidyl carboxypeptidase II 
Protein accessionYP_002270538 
Protein GI209396978 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000542867 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAA TTAATCCTTT CCTTGTGCAA AGCACACTGC CGTATCTGGC TCCTCATTTT 
GATCAAATTG CCAATCATCA CTATCGCCCG GCATTCGATG AGGGAATACA GCAAAAGCGG
GCAGAAATTG CTGCCATCGC GCTTAACCCG CAAACACCTG ATTTCAACAA TACTATTCTG
GCACTGGAAC AAAGCGGAGA ATTACTTACC CGCGTTACCA GCGTCTTTTT TGCGATGACT
GCGGCGCATA CCAATGATGA ATTACAGCGT CTTGACGAGC AGTTTTCCGC TGAACTGGCG
GAACTGGCTA ATGATATCTA TCTGAACGGT GAATTATTCG CGCGGGTAGA TGCTGTCTGG
CAACGCCGTG AATCTCTGGG GCTTGATAGT GAATCCATCC GCCTGGTGGA GGTGATTCAT
CAACGTTTTG TCCTAGCCGG AGCCAAACTT GCACAAGCTG ATAAAGCAAA ATTAAAAGTA
CTGAATACAG AAGCTGCGAC ACTGACCAGT CAGTTTAACC AGCGGTTACT GGCAGCAAAT
AAATCCGGCG GTCTGGTTGT GAACGATATC GCGCAGCTGG CAGGAATGAG TGAGCAAGAG
ATTGCGCTGG CGGCAGAGGC GGCTCGCGAG AAAGGTCTGG ATAACAAATG GCTGATTCCG
CTGCTGAATA CCACCCAACA ACCGGCGCTT GCCGAAATGC GCGATCGTGC GACGCGTGAA
AAACTGTTTA TTGCGGGCTG GACGCGAGCG GAAAAAAATG ATGCCAATGA TACCCGCGCT
ATCATTCAAC GTCTGGTGGA GATCCGTGCA CAACAGGCTA CATTACTCGG TTTTCCTCAT
TATGCCGCAT GGAAAATCGC CGATCAGATG GCAAAAACAC CTGAAGCAGC ACTCAACTTT
ATGCGGGAAA TTGTTCCAGC GGCGCGTCAA CGTGCGAGCG ATGAATTAGC CTCCATACAG
GCGGTTATTG ATAAGCAGCA GGGCGGGTTT AGCGCGCAGC CGTGGGACTG GGCATTTTAT
GCCGAACAGG TCCGGCGAGA GAAATTTGAT CTTGATGAGG CGCAGCTCAA GCCATATTTT
GAATTAAACA CGGTGTTGAA TGAAGGTGTA TTCTGGACCG CGAATCAGCT CTTCGGTATT
AAGTTTGTCG AACGTTTTGA TATTCCTGTC TACCATCCTG ACGTTCGGGT GTGGGAAATT
TTTGATCATA ATGGCGTGGG GCTGGCGTTA TTTTACGGTG ATTTCTTCGC CCGTGATTCA
AAAAGCGGCG GTGCATGGAT GGGCAATTTT GTTGAGCAAT CAACGCTTAA TGAAACGCAT
CCGGTAATTT ATAACGTCTG CAATTATCAG AAACCCGCTG CCGGTGAGCC TGCGTTGTTA
CTCTGGGATG ATGTCATAAC CTTATTCCAT GAATTTGGTC ATACGCTGCA CGGCCTTTTT
GCCCGCCAGC GTTATGCCAC GCTCTCCGGC ACCAACACGC CGCGTGATTT TGTCGAATTT
CCGTCGCAAA TCAACGAACA CTGGGCAACG CATCCGCAGG TATTCGCTCG CTACGCCCGG
CATTATCAGA GCGGGGCAGC AATGCCTGAC GAACTGCAAC AGAAAATGCG TAATGCCAGC
CTGTTCAACA AAGGGTATGA GATGAGCGAA CTGCTTAGCG CCGCACTTCT CGATATGCGC
TGGCATTGCC TGGAAGAAAA CGAAGCAATG CAGGATGTCG ATGATTTTGA ATTGCGGGCG
CTGGTGGCAG AAAATATGGA TCTTCCTGCT ATACCGCCAC GCTATCGCAG CAGTTATTTT
GCCCATATTT TTGGTGGCGG ATATGCCGCA GGTTATTACG CTTATCTGTG GACGCAAATG
TTGGCCGATG ATGGTTACCA GTGGTTTGTT GAGCAGGGCG GATTAACGCG TGAAAATGGG
CTGCGTTTTC GCGAGGCGAT CCTTTCCAGA GGTAACAGCG AAGATCTGGA ACGCCTGTAT
CGACAATGGC GCGGTAAGGC TCCTCAGATT ATGCCGATGC TGCAACATCG TGGCTTGAAC
ATATAA
 
Protein sequence
MTTINPFLVQ STLPYLAPHF DQIANHHYRP AFDEGIQQKR AEIAAIALNP QTPDFNNTIL 
ALEQSGELLT RVTSVFFAMT AAHTNDELQR LDEQFSAELA ELANDIYLNG ELFARVDAVW
QRRESLGLDS ESIRLVEVIH QRFVLAGAKL AQADKAKLKV LNTEAATLTS QFNQRLLAAN
KSGGLVVNDI AQLAGMSEQE IALAAEAARE KGLDNKWLIP LLNTTQQPAL AEMRDRATRE
KLFIAGWTRA EKNDANDTRA IIQRLVEIRA QQATLLGFPH YAAWKIADQM AKTPEAALNF
MREIVPAARQ RASDELASIQ AVIDKQQGGF SAQPWDWAFY AEQVRREKFD LDEAQLKPYF
ELNTVLNEGV FWTANQLFGI KFVERFDIPV YHPDVRVWEI FDHNGVGLAL FYGDFFARDS
KSGGAWMGNF VEQSTLNETH PVIYNVCNYQ KPAAGEPALL LWDDVITLFH EFGHTLHGLF
ARQRYATLSG TNTPRDFVEF PSQINEHWAT HPQVFARYAR HYQSGAAMPD ELQQKMRNAS
LFNKGYEMSE LLSAALLDMR WHCLEENEAM QDVDDFELRA LVAENMDLPA IPPRYRSSYF
AHIFGGGYAA GYYAYLWTQM LADDGYQWFV EQGGLTRENG LRFREAILSR GNSEDLERLY
RQWRGKAPQI MPMLQHRGLN I