Gene ECH74115_2579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2579 
SymbolprtB 
ID6969210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2434904 
End bp2436964 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content50% 
IMG OID643386445 
Productprotease 2 
Protein accessionYP_002270927 
Protein GI209398588 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.639161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACCAA AAGCCGCCCG CATTCCCCAC GCCATGACGC TTCATGGCGA TACGCGCATC 
GATAATTACT ACTGGCTGCG GGACGATACG CGTTCTCAGC CGGAAGTCCT GGACTACCTG
CAACAAGAAA ATAGTTACGG TCATCGGGTG ATGGCCTCAC AACAAGCCTT GCAGGATCGC
ATCTTAAAGG AAATCATCGA CCGCATTCCG CAACGAGAAG TTTCTGCGCC CTACATCAAA
AATGGCTACC GCTACCGGCA TATTTATGAA CCAGGCTGTG AATATGCTAT CTACCAGCGT
CAATCGGCGT TCAGTGAAGA GTGGGACGAG TGGGAAATAT TGCTCGATGC CAACAAGCGC
GCGGCTCATA GTGAGTTTTA TTCGATGGGC GGAATGGCGA TTACGCCCGA TAACACCATT
ATGGCGCTGG CAGAAGATTT TCTTTCCCGA CGCCAGTACG GCATTCGTTT TCGTAATCTG
GAAACAGGTA ACTGGTACCC GGAACTGCTG GATAACGTTG AACCCAGCTT TGTCTGGGCA
AATGACTCCT GGACTTTCTA CTATGTTCGC AAGCATCCGG TGACGCTGCT GCCTTATCAG
GTCTGGCGTC ACGCCATCGG TACGCCAGCA TCGCAAGATA AACTGATTTA CGAAGAAAAA
GACGATACCT ATTACGTCAG CCTGCATAAA ACGACCTCAA AGCACTATGT AGTCATTCAT
TTGGCCAGCG CCACCACCAG TGAAGTTCGC CTGCTGGACG CGGAAATGGC AGATGCCGAG
CCGTTTGTTT TTCTGCCGCG CCGCAAAGAT CACGAATACA GCCTTGATCA CTACCAGCAT
CGTTTTTATC TGCGTTCCAA CCGCCACGGC AAAAACTTTG GCTTATACCG TACCCGTATG
CGTGATGAGC AACAGTGGGA AGAGTTAATT CCGCCACGCG AAAACATCAT GCTGGAAGGG
TTTACGCTGT TTACCGACTG GCTGGTGGTT GAAGAGCGTC AGCGCGGGTT AACCAGTTTG
CGCCAAATTA ACCGCAAGAC CCGGGAAGTC ATTGGTATTG CCTTTGATGA TCCAGCCTAT
GTGACCTGGA TTGCCTACAA TCCAGAACCT GAAACCGCGC GATTGCGTTA TGGTTATTCT
TCCATGACCA CACCAGACAC TTTGTTTGAA CTGGATATGG ATACCGGTGA GCGTCGTGTA
TTAAAACAAA CAGAAGTTCC TGGTTTTGAT GCGGCGAATT ACCGCAGTGA ACACCTGTGG
ATAGTCGCCC GTGATGGTGT CGAAGTTCCG GTTTCGTTGG TCTACCATCG CAATCATTTT
CGCAAAGGAC ACAACCCGCT GCTGGTGTAT GGCTACGGTT CTTACGGCGC AAGTATTGAT
GCCGATTTCA GTTTTAGCCG CTTGAGTTTG TTAGATCGTG GCTTTGTCTA CGCCATTGTC
CATGTTCGCG GCGGCGGTGA GCTGGGGCAA CAATGGTACG AAGACGGTAA ATTTCTGAAG
AAGAAAAATA CGTTTAATGA TTATCTTGAT GCCTGCGATG CATTGTTAAA ACTGGGCTAT
GGCTCTCCTT CGCTTTGTTA TGCGATGGGC GGGAGTGCGG GGGGCATGTT GATGGGCGTT
GCAATTAATC AACGCCCGGA ATTATTCCAC GGCGTTATCG CCCAGGTACC GTTTGTTGAT
GTTGTAACAA CGATGCTTGA TGAATCAATT CCTCTTACCA CTGGTGAGTT TGAAGAGTGG
GGGAATCCGC AGGATCTGCA ATATTACGAG TATATGAAAA GCTACAGCCC GTATGACAAC
GTCACCGCAC AGGCTTATCC ACATTTACTG GTAACGACCG GTTTACACGA TTCTCAGGTG
CAATATTGGG AACCGGCAAA ATGGGTCGCT AAGTTGCGCG AGCTGAAAAC CGATAACCAT
CTTTTATTGC TCTGTACCGA CATGGACTCA GGCCATGGCG GTAAATCTGG TCGCTTTAAA
TCGTACGAAG GCGTAGCGAT GGAATATGCT TTTCTGGTCG CGCTGGCGCA GGGAACATTA
CCCGCTAGGC CTGCGGATTA G
 
Protein sequence
MLPKAARIPH AMTLHGDTRI DNYYWLRDDT RSQPEVLDYL QQENSYGHRV MASQQALQDR 
ILKEIIDRIP QREVSAPYIK NGYRYRHIYE PGCEYAIYQR QSAFSEEWDE WEILLDANKR
AAHSEFYSMG GMAITPDNTI MALAEDFLSR RQYGIRFRNL ETGNWYPELL DNVEPSFVWA
NDSWTFYYVR KHPVTLLPYQ VWRHAIGTPA SQDKLIYEEK DDTYYVSLHK TTSKHYVVIH
LASATTSEVR LLDAEMADAE PFVFLPRRKD HEYSLDHYQH RFYLRSNRHG KNFGLYRTRM
RDEQQWEELI PPRENIMLEG FTLFTDWLVV EERQRGLTSL RQINRKTREV IGIAFDDPAY
VTWIAYNPEP ETARLRYGYS SMTTPDTLFE LDMDTGERRV LKQTEVPGFD AANYRSEHLW
IVARDGVEVP VSLVYHRNHF RKGHNPLLVY GYGSYGASID ADFSFSRLSL LDRGFVYAIV
HVRGGGELGQ QWYEDGKFLK KKNTFNDYLD ACDALLKLGY GSPSLCYAMG GSAGGMLMGV
AINQRPELFH GVIAQVPFVD VVTTMLDESI PLTTGEFEEW GNPQDLQYYE YMKSYSPYDN
VTAQAYPHLL VTTGLHDSQV QYWEPAKWVA KLRELKTDNH LLLLCTDMDS GHGGKSGRFK
SYEGVAMEYA FLVALAQGTL PARPAD