Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4605 |
Symbol | dipZ |
ID | 6143776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4708312 |
End bp | 4710009 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619421 |
Product | thiol:disulfide interchange protein precursor |
Protein accession | YP_001746532 |
Protein GI | 170683820 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0011627 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.838171 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAAC GCATCTTTAC GCTGATCCTG CTACTTTGCA GCACTTCCGT TTTTGCCGGA TTATTCGACG CGCCGGGACG TTCACAATTT GTCCCCGCGG ATCAAGCCTT TGCTTTTGAT TTTCAGCAAA AGCAACATGA CCTTAATCTG ACCTGGCAGA TCAAAGACGG TTACTACCTC TACCGTAAAC AGATCCGCAT TACGCCGGAA CACGCGAAAA TTGCCGACGT GCAGCTGCCA CAAGGCGTCT GGCATGAAGA TGAGTTTTAC GGCAAAAGCG AGATTTACCG CGATCGGCTG ACGCTTCCCG TAACCATCAA CCAGGCGAGT GCGGGAGCGA CGTTAACTGT CACCTACCAG GGCTGTGCTG ATGCCGGTTT CTGTTATCCG CCAGAAACCA AAACCGTTCC GTTAAGCGAA GTGGTCGCCA ACAACGCAGC GTCACAGCCT GTGTCTGTTC CGCAGCAAGA GCAGCACACC GCGCAATTGC CCTTTTCCGC GCTCTGGGCG TTGTTGATCG GTATTGGTAT CGCCTTTACG CCATGCGTGC TGCCAATGTA CCCTCTGATT TCTGGTATCG TGCTGGGCGG TAAACAGCGG CTCTCCACTG CCAGAGCATT GTTGCTGACC TTTATTTATG TGCAGGGGAT GGCACTGACT TACACGGCGC TGGGTCTGGT AGTTGCCGCC GCAGGGTTAC AGTTCCAGGC GGCGCTACAG CACCCATACG TGCTCATTGG CCTCGCTATC GTCTTTACCT TGCTGGCGAT GTCAATGTTT GGCTTGTTTA CCCTGCAACT CCCCTCTTCG CTGCAAACGC GCCTCACGCT GATGAGCAAT CGCCAACAGG GCGGCTCACC TGGCGGTGTG TTTGTTATGG GAGCGATTGC CGGACTGATC TGTTCACCAT GCACCACCGC ACCGCTTAGC GCGATTCTGT TGTATATCGC CCAAAGCGGG AACATGTGGC TGGGGGGCGG CACGCTTTAT CTCTATGCGT TGGGCATGGG CCTGCCGCTG ATGCTAATAA CCGTCTTTGG TAACCGCCTG CTGCCGAAAA GCGGCCCGTG GATGGAACAA GTCAAAACCG CGTTTGGTTT TGTGATCCTC GCACTGCCGG TCTTCCTGCT GGAGCGAGTG ATTGGTGATA TATGGGGATT ACGCTTGTGG TCGGCGCTTG GTGTCGCATT CTTTGGCTGG GCCTTTATCA CCAGCCTACA GGCCAAACGC GGCTGGATGC GCGTGGTGCA AATAATCCTG CTGGCAGCGG CATTGGTTAG CGTGCGCCCA CTTCAGGATT GGGCATTTGG TGCGACGCAT ACCGTGCAAA CTCAGACGCA TCTCAACTTT ACACAAATCA AAACGGTAGA TGAGTTAAAT CAGGCGCTCG TTGAAGCCAA AGGCAAACCA GTGATGTTAG ATCTCTATGC CGACTGGTGC GTCGCCTGTA AAGAGTTTGA GAAATACACC TTCAGCGACC CGCAGGTGCA AAAAGCGTTA GCAGACACGG TCTTACTTCA GGCTAACGTC ACTGCCAACG ACGCCCAAGA TGTGGCGCTG TTAAAGCATC TTAATGTCCT TGGCCTACCG ACAATTCTCT TTTTTGACGG ACAAGGCCAG GAGCATCCAC AAGAACGCGT CACGGGCTTT ATGGATGCTG AAACCTTCAG CGCACATTTG CGCGATCGCC AACCGTGA
|
Protein sequence | MAQRIFTLIL LLCSTSVFAG LFDAPGRSQF VPADQAFAFD FQQKQHDLNL TWQIKDGYYL YRKQIRITPE HAKIADVQLP QGVWHEDEFY GKSEIYRDRL TLPVTINQAS AGATLTVTYQ GCADAGFCYP PETKTVPLSE VVANNAASQP VSVPQQEQHT AQLPFSALWA LLIGIGIAFT PCVLPMYPLI SGIVLGGKQR LSTARALLLT FIYVQGMALT YTALGLVVAA AGLQFQAALQ HPYVLIGLAI VFTLLAMSMF GLFTLQLPSS LQTRLTLMSN RQQGGSPGGV FVMGAIAGLI CSPCTTAPLS AILLYIAQSG NMWLGGGTLY LYALGMGLPL MLITVFGNRL LPKSGPWMEQ VKTAFGFVIL ALPVFLLERV IGDIWGLRLW SALGVAFFGW AFITSLQAKR GWMRVVQIIL LAAALVSVRP LQDWAFGATH TVQTQTHLNF TQIKTVDELN QALVEAKGKP VMLDLYADWC VACKEFEKYT FSDPQVQKAL ADTVLLQANV TANDAQDVAL LKHLNVLGLP TILFFDGQGQ EHPQERVTGF MDAETFSAHL RDRQP
|
| |