Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4719 |
Symbol | dipZ |
ID | 6872426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 4579895 |
End bp | 4581598 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642787615 |
Product | thiol:disulfide interchange protein precursor |
Protein accession | YP_002218212 |
Protein GI | 198245746 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0129572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAAC GCATCTTTAC GCTGATCCTG CTGCTGTGCA GCACATCCGC CTTCGCCGGA TTATTCGACG CGCCGGGTCG CTCGCAGTTC GTCCCTGCCG ACCGGGCGTT TGTTTTTGAT TTTCAGCAAA ATCAACACGA TCTTACCCTC TCCTGGCAGG TAAAAGAGGG TTACTACCTT TACCGTAAAC AAATCAGTAT CACGCCGACG AAAGCGGATA TCGCAGCAGT CCAACTCCCG GCAGGCGTCT GGCATGAAGA TGAGTTCTAC GGGAAAAGCG AAATCTACCG TAAGCGGCTA AACGTTCCGG TAACGGTTAA CCAGGCGGCG GCTGGCGCGA CATTAACGAT AACGTACCAG GGGTGCGCCG ATGCGGGATT CTGTTATCCG CCGGAAACGA AAACGGTGCC GCTAAGTGAA GTCGCCGCGG CAATAGACGC CACGCCGACG CCTGCTGTCA CCCAGACGAG TGAGACGTCA AAACCCGCCG CCCAGCTACC TTTTTCCGCG CTCTGGGCAC TGCTGATCGG CATAGGCATC GCCTTTACGC CCTGCGTTTT ACCCATGTAT CCGCTGATTT CCGGTATTGT CCTGGGGGGA AGGCAACGTT TGTCGACGGG GCGCGCACTG CTGCTGGCCT TTATCTATGT ACAAGGAATG GCGCTGACCT ATACCGCACT GGGTCTGGTC GTCGCCGCAG CGGGATTACA GTTTCAGGCG GCGCTGCAGC ACCCTTATGT GTTAATCGGC CTTGCCATCG TGTTCACCCT GCTTGCGTTG TCGATGTTTG GCCTGTTCAC ACTACAGCTC CCCTCCTCAC TGCAAACCCG GCTGACATTA ATGAGTAATC GTCAGCAAGG CGGCTCGCCC GGCGGTGTTT TTGTTATGGG GGCGATTGCC GGATTAATTT GCTCGCCCTG CACTACAGCG CCGTTAAGCG CGATTCTGCT TTATATCGCT CAGAGCGGCA ACATGTGGCT GGGCGGCGGC ACGCTGTATT TGTATGCATT AGGTATGGGT CTGCCGCTAA TGCTGGTCAC CGTGTTTGGC AACCGCCTGT TACCGAAAAG CGGCCCGTGG ATGGCGCATG TTAAAACCGC TTTTGGTTTC GTGATCCTCG CGTTACCGGT CTTTTTACTG GAGCGGATTA TTGGCGAGGC TTGGGGTTTA CGTCTGTGGT CGCTGCTGGG CGTCGCTTTC TTTGGTTGGG CCTTTATCAC CAGCCTTCAG GCCAGACGAG CGTGGATGCG TATCGTACAG ATTATCCTGC TGGCGGCAGC GCTCATTAGC GTGCGGCCTC TACAGGATTG GGCGTTCGGA TCGCCATCCG CGCAAGCTCC AGCGCACCTC AATTTCACGG CTATTTCTAC CGTGGACGAA CTCAATCAGG CGCTGGCGCA GGCCAAAGGC AAACCCGTTA TGCTGGATTT CTACGCCGAC TGGTGCGTGG CCTGTAAAGA GTTTGAAAAG TATACCTTCA GCGATCCGCG GGTCCAGCAG GCGCTCGGCG ACACGGTGCT CTTGCAGGCT AACGTCACCG CTAACAATGC GCAGGATGTC GCGCTGTTAA AGCATCTGCA AGTCCTCGGG CTGCCCACCA TTCTGTTCTT TAATGCCCAA GGCCAGGAAC AGCCGCAATC GCGAGTCACC GGCTTTATGG ACGCCGCCAC CTTTAGCGCG CATTTGCACG ATCGCCAACC GTGA
|
Protein sequence | MAQRIFTLIL LLCSTSAFAG LFDAPGRSQF VPADRAFVFD FQQNQHDLTL SWQVKEGYYL YRKQISITPT KADIAAVQLP AGVWHEDEFY GKSEIYRKRL NVPVTVNQAA AGATLTITYQ GCADAGFCYP PETKTVPLSE VAAAIDATPT PAVTQTSETS KPAAQLPFSA LWALLIGIGI AFTPCVLPMY PLISGIVLGG RQRLSTGRAL LLAFIYVQGM ALTYTALGLV VAAAGLQFQA ALQHPYVLIG LAIVFTLLAL SMFGLFTLQL PSSLQTRLTL MSNRQQGGSP GGVFVMGAIA GLICSPCTTA PLSAILLYIA QSGNMWLGGG TLYLYALGMG LPLMLVTVFG NRLLPKSGPW MAHVKTAFGF VILALPVFLL ERIIGEAWGL RLWSLLGVAF FGWAFITSLQ ARRAWMRIVQ IILLAAALIS VRPLQDWAFG SPSAQAPAHL NFTAISTVDE LNQALAQAKG KPVMLDFYAD WCVACKEFEK YTFSDPRVQQ ALGDTVLLQA NVTANNAQDV ALLKHLQVLG LPTILFFNAQ GQEQPQSRVT GFMDAATFSA HLHDRQP
|
| |