Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5652 |
Symbol | dipZ |
ID | 6967013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5292996 |
End bp | 5294693 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643389286 |
Product | thiol:disulfide interchange protein precursor |
Protein accession | YP_002273682 |
Protein GI | 209396839 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000390549 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.76636 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAAC GCATCTTTAC GCTGATCCTG CTACTTTGCA GCACTTCCGT TTTTGCCGGA TTATTCGACG CGCCGGGACG TTCACAATTT GTCCCCGTGG ATCAAGCCTT TGCTTTTGAT TTTCAGCAAA ACCAACATGA CCTTAATCTG ACCTGGCAGA TCAAAGACGG TTACTACCTC TACCGTAAAC AGATCCGCAT TACGCCGGAA CACGCGAAAA TTGCCGACGT GCAGCTGCCG CAAGGCGTCT GGCATGAAGA TGAGTTTTAC GGCAAAAGCG AGATTTACCG CGATCGGCTG ACGCTTCCCG TAACCATCAA CCAGGCGAGT GCGGGAGCGA CGTTAACTGT CACCTACCAG GGCTGTGCTG ATGCCGGTTT CTGTTATCCG CCAGAAACCA AAACCGTTCC GTTAAGCGAA GTGGTCGCCA ACAACGCAGC GTCACAGCCT GTGTCTGTTC CGCAGCAAGA GCAGCCCACC GCGCAATTGC CCTTTTCCGC GCTCTGGGCG TTGTTGATCG GTATTGGTAT CGCCTTTACG CCATGCGTGC TGCCAATGTA CCCACTGATT TCTGGCATCG TGCTGGGTGG TAAACAGCGG CTCTCCACTG CCAGAGCATT GTTGCTGACC TTTATTTATG TGCAGGGGAT GGCACTGACT TACACGGCGC TGGGTCTGGT AGTTGCCGCC GCAGGGTTAC AGTTCCAGGC GGCGCTACAG CACCCATACG TGCTCATTGG CCTCACCATC GTCTTTACCT TGCTGGCGAT GTCAATGTTT GGCTTGCTTA CTCTACAACT CCCCTCCTCG CTGCAAACGC GCCTCACGCT GATGAGCAAT CGCCAACAGG GCGGCTCACC CGGCGGAGTG TTTATTATGG GGACGATTGC CGGACTGATC TGTTCACCAT GCACCACCGC ACCGCTTAGC GCGATTCTGC TGTATATCGC CCAAAGCGGG AACATGTGGC TGGGCGGCGG CACGCTTTAT CTCTATGCGT TGGGCATGGG CCTGCCGCTG ATGCTAATTA CCGTCTTTGG TAACCGCTTG CTGCCGAAAA GCGGCCCGTG GATGGAACAA GTCAAAACCG CGTTTGGTTT TGTGATCCTC GCACTGCCGG TCTTCCTGCT GGAGCGAGTG ATTGGTGATG TATGGGGATT ACGCTTGTGG TCGGCGCTGG GTGTCGCATT CTTTGGCTGG GCCTTTATCA CCAGCCTACA GGCCAAACGC GGCTGGATGC GCGTGGTGCA AATAATCCTG CTTGCAGCGG CATTGGTTAG CGTGCGCCCA CTTCAGGATT GGGCATTTGG TGCGACGCAT ACCGCGCAAA CTCAGACGCA TCTCAACTTT ACACAAATCA AAACGGTAGA TGAGTTAAAT CAGGCGCTCG TTGAAGCCAA AGGCAAACCG GTGATGTTAG ATCTTTATGC CGACTGGTGC GTCGCCTGTA AAGAGTTTGA GAAATACACC TTCAGCGACC CGCAGGTGCA AAAAGCGTTA GCAGACACGG TATTACTTCA GGCCAACGTC ACTGCCAACG ACGCACAAGA TGTGGCGCTG TTAAAGCATC TTAATGTCCT TGGCCTACCG ACAATTCTCT TTTTTGACGG ACAAGGCCAG GAGCATCCAC AAGCACGCGT CACGGGCTTT ATGGATGCTG AAACCTTCAG CGCACAATTG CGCGATCGCC AACCGTGA
|
Protein sequence | MAQRIFTLIL LLCSTSVFAG LFDAPGRSQF VPVDQAFAFD FQQNQHDLNL TWQIKDGYYL YRKQIRITPE HAKIADVQLP QGVWHEDEFY GKSEIYRDRL TLPVTINQAS AGATLTVTYQ GCADAGFCYP PETKTVPLSE VVANNAASQP VSVPQQEQPT AQLPFSALWA LLIGIGIAFT PCVLPMYPLI SGIVLGGKQR LSTARALLLT FIYVQGMALT YTALGLVVAA AGLQFQAALQ HPYVLIGLTI VFTLLAMSMF GLLTLQLPSS LQTRLTLMSN RQQGGSPGGV FIMGTIAGLI CSPCTTAPLS AILLYIAQSG NMWLGGGTLY LYALGMGLPL MLITVFGNRL LPKSGPWMEQ VKTAFGFVIL ALPVFLLERV IGDVWGLRLW SALGVAFFGW AFITSLQAKR GWMRVVQIIL LAAALVSVRP LQDWAFGATH TAQTQTHLNF TQIKTVDELN QALVEAKGKP VMLDLYADWC VACKEFEKYT FSDPQVQKAL ADTVLLQANV TANDAQDVAL LKHLNVLGLP TILFFDGQGQ EHPQARVTGF MDAETFSAQL RDRQP
|
| |