Gene ECH74115_5652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5652 
SymboldipZ 
ID6967013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5292996 
End bp5294693 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content54% 
IMG OID643389286 
Productthiol:disulfide interchange protein precursor 
Protein accessionYP_002273682 
Protein GI209396839 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000390549 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.76636 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAC GCATCTTTAC GCTGATCCTG CTACTTTGCA GCACTTCCGT TTTTGCCGGA 
TTATTCGACG CGCCGGGACG TTCACAATTT GTCCCCGTGG ATCAAGCCTT TGCTTTTGAT
TTTCAGCAAA ACCAACATGA CCTTAATCTG ACCTGGCAGA TCAAAGACGG TTACTACCTC
TACCGTAAAC AGATCCGCAT TACGCCGGAA CACGCGAAAA TTGCCGACGT GCAGCTGCCG
CAAGGCGTCT GGCATGAAGA TGAGTTTTAC GGCAAAAGCG AGATTTACCG CGATCGGCTG
ACGCTTCCCG TAACCATCAA CCAGGCGAGT GCGGGAGCGA CGTTAACTGT CACCTACCAG
GGCTGTGCTG ATGCCGGTTT CTGTTATCCG CCAGAAACCA AAACCGTTCC GTTAAGCGAA
GTGGTCGCCA ACAACGCAGC GTCACAGCCT GTGTCTGTTC CGCAGCAAGA GCAGCCCACC
GCGCAATTGC CCTTTTCCGC GCTCTGGGCG TTGTTGATCG GTATTGGTAT CGCCTTTACG
CCATGCGTGC TGCCAATGTA CCCACTGATT TCTGGCATCG TGCTGGGTGG TAAACAGCGG
CTCTCCACTG CCAGAGCATT GTTGCTGACC TTTATTTATG TGCAGGGGAT GGCACTGACT
TACACGGCGC TGGGTCTGGT AGTTGCCGCC GCAGGGTTAC AGTTCCAGGC GGCGCTACAG
CACCCATACG TGCTCATTGG CCTCACCATC GTCTTTACCT TGCTGGCGAT GTCAATGTTT
GGCTTGCTTA CTCTACAACT CCCCTCCTCG CTGCAAACGC GCCTCACGCT GATGAGCAAT
CGCCAACAGG GCGGCTCACC CGGCGGAGTG TTTATTATGG GGACGATTGC CGGACTGATC
TGTTCACCAT GCACCACCGC ACCGCTTAGC GCGATTCTGC TGTATATCGC CCAAAGCGGG
AACATGTGGC TGGGCGGCGG CACGCTTTAT CTCTATGCGT TGGGCATGGG CCTGCCGCTG
ATGCTAATTA CCGTCTTTGG TAACCGCTTG CTGCCGAAAA GCGGCCCGTG GATGGAACAA
GTCAAAACCG CGTTTGGTTT TGTGATCCTC GCACTGCCGG TCTTCCTGCT GGAGCGAGTG
ATTGGTGATG TATGGGGATT ACGCTTGTGG TCGGCGCTGG GTGTCGCATT CTTTGGCTGG
GCCTTTATCA CCAGCCTACA GGCCAAACGC GGCTGGATGC GCGTGGTGCA AATAATCCTG
CTTGCAGCGG CATTGGTTAG CGTGCGCCCA CTTCAGGATT GGGCATTTGG TGCGACGCAT
ACCGCGCAAA CTCAGACGCA TCTCAACTTT ACACAAATCA AAACGGTAGA TGAGTTAAAT
CAGGCGCTCG TTGAAGCCAA AGGCAAACCG GTGATGTTAG ATCTTTATGC CGACTGGTGC
GTCGCCTGTA AAGAGTTTGA GAAATACACC TTCAGCGACC CGCAGGTGCA AAAAGCGTTA
GCAGACACGG TATTACTTCA GGCCAACGTC ACTGCCAACG ACGCACAAGA TGTGGCGCTG
TTAAAGCATC TTAATGTCCT TGGCCTACCG ACAATTCTCT TTTTTGACGG ACAAGGCCAG
GAGCATCCAC AAGCACGCGT CACGGGCTTT ATGGATGCTG AAACCTTCAG CGCACAATTG
CGCGATCGCC AACCGTGA
 
Protein sequence
MAQRIFTLIL LLCSTSVFAG LFDAPGRSQF VPVDQAFAFD FQQNQHDLNL TWQIKDGYYL 
YRKQIRITPE HAKIADVQLP QGVWHEDEFY GKSEIYRDRL TLPVTINQAS AGATLTVTYQ
GCADAGFCYP PETKTVPLSE VVANNAASQP VSVPQQEQPT AQLPFSALWA LLIGIGIAFT
PCVLPMYPLI SGIVLGGKQR LSTARALLLT FIYVQGMALT YTALGLVVAA AGLQFQAALQ
HPYVLIGLTI VFTLLAMSMF GLLTLQLPSS LQTRLTLMSN RQQGGSPGGV FIMGTIAGLI
CSPCTTAPLS AILLYIAQSG NMWLGGGTLY LYALGMGLPL MLITVFGNRL LPKSGPWMEQ
VKTAFGFVIL ALPVFLLERV IGDVWGLRLW SALGVAFFGW AFITSLQAKR GWMRVVQIIL
LAAALVSVRP LQDWAFGATH TAQTQTHLNF TQIKTVDELN QALVEAKGKP VMLDLYADWC
VACKEFEKYT FSDPQVQKAL ADTVLLQANV TANDAQDVAL LKHLNVLGLP TILFFDGQGQ
EHPQARVTGF MDAETFSAQL RDRQP