Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3876 |
Symbol | dipZ |
ID | 6066324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4232032 |
End bp | 4233729 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641603291 |
Product | thiol:disulfide interchange protein precursor |
Protein accession | YP_001726807 |
Protein GI | 170021853 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAAC GCATCTTTAC GCTGATCCTG CTACTTTGCA GCACTTCCGT TTTTGCCGGA TTATTCGACG CGCCGGGACG TTCACAATTT GTCCCCGCGG ATCAAGCCTT TGCTTTTGAT TTTCAGCAAA ACCAACATGA CCTTAATCTG ACCTGGCAGA TCAAAGACGG TTACTACCTC TACCGTAAAC AGATCCGCAT TACGCCGGAA CACGCGAAAA TTGCCGACGT GCAGCTGCCG CAAGGCGTCT GGCATGAAGA TGAGTTTTAC GGCAAAAGCG AGATTTACCG CGATCGGCTG ACGCTTCCCG TCACCATCAA CCAGGCGAGT GCGGGAGCGA CGTTAACTGT CACCTACCAG GGCTGTGCTG ATGCCGGTTT CTGTTATCCG CCAGAAACCA AAACCGTTCC GTTAAGCGAA GTGGTCGCCA ACAACGCAGC GCCACAGCCT GTGTCTGTTC CGCAGCAAGA GCAGCCCACC GCGCAATTGC CCTTTTCCGC GCTCTGGGCG TTGTTGATCG GTATTGGTAT CGCCTTTACG CCATGCGTGC TGCCAATGTA CCCACTGATT TCTGGCATCG TGCTGGGTGG TAAACAGCGG CTCTCCACTG CCAGAGCATT GTTGCTGACC TTTATTTATG TGCAGGGGAT GGCGCTGACC TACACGGCGC TGGGTCTGGT GGTTGCCGCC GCAGGGTTAC AGTTCCAGGC GGCGCTACAG CACCCATACG TGCTCATTGG CCTCGCCATC GTCTTTACCT TGCTGGCGAT GTCAATGTTT GGCTTGTTTA CCCTGCAACT CCCCTCTTCG CTGCAAACAC GTCTCACGTT GATGAGCAAT CGCCAACAGG GCGGCTCACC TGGCGGTGTG TTTGTTATGG GGGCGATTGC CGGACTGATC TGTTCACCAT GCACCACCGC ACCGCTTAGC GCGATTCTGC TGTATATCGC CCAAAGCGGG AACATGTGGC TGGGCGGCGG CACGCTTTAT CTCTATGCGT TGGGCATGGG CCTGCCGCTG ATGCTAATTA CCGTCTTTGG TAACCGCTTG CTGCCGAAAA GCGGCCCGTG GATGGAACAA GTCAAAACCG CGTTTGGTTT TGTGATCCTC GCACTGCCGG TCTTCCTGCT GGAGCGAGTG ATTGGTGATG TATGGGGATT ACGCTTGTGG TCGGCGCTGG GTGTCGCATT CTTTGGCTGG GCCTTTATCA CCAGCCTACA GGCTAAACGC GGCTGGATGC GTATTGTGCA AATTATTCTG CTGGCAGCGG CATTGGTTAG CGTGCGCCCA CTTCAGGATT GGGCATTTGG TGCGACGCAT ACCGCGCAAA CTCAGACGCA TCTCAACTTT ACACAAATCA AAACGGTAGA TGAGTTAAAT CAGGCGCTCG TTGAAGCCAA AGGCAAACCG GTGATGTTAG ATCTTTATGC CGACTGGTGC GTCGCCTGTA AAGAGTTTGA GAAATACACC TTCAGCGACC CGCAGGTGCA AAAAGCGTTA GCAGACACGG TCTTACTTCA GGCCAACGTC ACGGCCAACG ACGCACAAGA TGTGGCGCTG TTAAAGCATC TTAATGTCCT TGGCCTACCG ACAATTCTCT TTTTTGACGG ACAAGGCCAG GAGCATCCAC AAGCACGCGT CACGGGCTTT ATGGATGCTG AAACCTTCAG CGCACATTTG CGCGATCGCC AACCGTGA
|
Protein sequence | MAQRIFTLIL LLCSTSVFAG LFDAPGRSQF VPADQAFAFD FQQNQHDLNL TWQIKDGYYL YRKQIRITPE HAKIADVQLP QGVWHEDEFY GKSEIYRDRL TLPVTINQAS AGATLTVTYQ GCADAGFCYP PETKTVPLSE VVANNAAPQP VSVPQQEQPT AQLPFSALWA LLIGIGIAFT PCVLPMYPLI SGIVLGGKQR LSTARALLLT FIYVQGMALT YTALGLVVAA AGLQFQAALQ HPYVLIGLAI VFTLLAMSMF GLFTLQLPSS LQTRLTLMSN RQQGGSPGGV FVMGAIAGLI CSPCTTAPLS AILLYIAQSG NMWLGGGTLY LYALGMGLPL MLITVFGNRL LPKSGPWMEQ VKTAFGFVIL ALPVFLLERV IGDVWGLRLW SALGVAFFGW AFITSLQAKR GWMRIVQIIL LAAALVSVRP LQDWAFGATH TAQTQTHLNF TQIKTVDELN QALVEAKGKP VMLDLYADWC VACKEFEKYT FSDPQVQKAL ADTVLLQANV TANDAQDVAL LKHLNVLGLP TILFFDGQGQ EHPQARVTGF MDAETFSAHL RDRQP
|
| |