Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3406 |
Symbol | dipZ |
ID | 4253972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 4066050 |
End bp | 4067891 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 638120044 |
Product | thiol:disulfide interchange protein precursor |
Protein accession | YP_735529 |
Protein GI | 113971736 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0237355 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TACTTAGCCT TATTTTCACC GCGTTACTGT TCCTTACGTC TCTCACCCTA AGTGGCAACG CCTTCGCCAA TAGCTTTGGT TTTCTAAAAA GCGAACCCGA GTTAATGCCC GTCGATCAAG CCTTTGCCTT CGATTTTAAG CAAGAAGGTA ATAAGGTCAC CCTCAATTGG GTGATTGCCG ATGGCTATTA TATGTATCGG GATAAGCTCA AATTTAGCGT CAATGGCGCC GAACTTGGCA CCATCGACCT GCCAAAGGGT AAGCCCCACA ACGACGAATA TTTCGGTGAG CAAGAGGTTT ACTACACCTA TATCGATATT CCCGTTGGCC TGAAACAAGC CGATGATAAC GGCACCTTAA GCGTGACCTT TATGGGCTGC GCCGAGGGTA AACTCTGTTA TCCACCGACC AAACGTGATG TGACCCTAAA AGCTGTCGCG GCCAATGATG GGAACATTGC AACGGGTGCA GACAGCAACG AAACGACTGA ACCTACCGCC ACTGCTGATG CTAAGCAAAC AGGTAGCGCC CCAAGCCAAC CTATTACTCA GCAGGACAGC CTAAGCCAGA TGCTGTCTAA CGACAGTTTA CTCTGGACCT TAGTGATCTT CTTCGGCCTA GGGATTGGTC TGGCGCTCAC GCCTTGCGTC TTCCCTATGT ATCCGATCCT ATCGGGCATT ATTGTCGGCC AAGGGCAAAA ACTATCAACC GCCAAAGCCT TCACCCTGTC GATGGCCTAT GTGCAAGGTA TGGCGATCAC CTACTCGATT TTAGGTCTAG TGGTTGCCTC TGCAGGGATG AAATACCAAG CGGCGCTACA GCATCCCGCT GTGCTGATCT TTTTAGCAAT CCTGTTCTTT GTGCTCAGTT TATCCATGTT TGGTTTGTAC GATCTCAAAC TGCCTTCTAG CTGGCAGGAG AAGATGAACT CGATTTCAAA CAATCAAAAG GGCGGCAATT TAGTTGGCGT GTTTTTGATG GGGGTGATCT CAGGATTAGT GGCCTCACCT TGTACTACGG CCCCGCTCTC AGGCGCCTTA GTCTATGTGG CGCAAACTGG CGATTTATTG CAAGGTTTCC TCGCGCTCTA CGTGCTGAGT ATGGGTATGG GTGTACCGCT ACTCATCATT GGTACCTCGG GTGGTAAGTT ACTGCCACGC GCTGGCGCTT GGATGAATAT CATTAAAACC GTGTTCGGTT TCCTCCTGAT TGCGGTATCG ATTGTGATGC TTGGCCGTAT TTGGACGGGC GTGGTTTCCG ATCTGCTCTG GTCGCTGTGG GGCATTAGCT TTACCGGTTA CCTGATGCAC CAAAACAAAC TCAGCGCCTT TAACTGGAAA CAAACCGTGC GCTCAGTGCT GCTGACACTC ACGCTGTTGG CCAGCTTCTC CTACGGTTTC CAAGCGGTCA TGGGCCACTT TGGCTTTACC CATGCGCCAA TGGGCGGCGT GGCGACAACT GAGGAGGAAC ACGGCTTTAA ACGGATTAAA TCCATTGAAG ATTTAGACCG TGAAATCGCC GCAGCGACTG CTGCGGGCAA GCCCGTGATG CTGGACCTCT ACGCCGATTG GTGTGTGGCC TGTAAAGAGT TTGAAGCCAT CACCTTTAAG GACGCCGAAG TCTTAGCGCG GATGAATAAA ATAGTGTTAT TGCAGGCCGA CGTGACTAAG AGTGATGCCA TTGATGTGGC ACTGCTGGAG AAATACAATG TCCTCGGTTT ACCCACGCTG CTGATGTTTA ATGAGCAAGG CGAGCAAAGG GAAGATTTAA GAGTGACTGG CTTTATGGGA CCGAAAGAAT TTGCCGCCCA TTTAGATCAC TTAGTGAAAT AA
|
Protein sequence | MKKLLSLIFT ALLFLTSLTL SGNAFANSFG FLKSEPELMP VDQAFAFDFK QEGNKVTLNW VIADGYYMYR DKLKFSVNGA ELGTIDLPKG KPHNDEYFGE QEVYYTYIDI PVGLKQADDN GTLSVTFMGC AEGKLCYPPT KRDVTLKAVA ANDGNIATGA DSNETTEPTA TADAKQTGSA PSQPITQQDS LSQMLSNDSL LWTLVIFFGL GIGLALTPCV FPMYPILSGI IVGQGQKLST AKAFTLSMAY VQGMAITYSI LGLVVASAGM KYQAALQHPA VLIFLAILFF VLSLSMFGLY DLKLPSSWQE KMNSISNNQK GGNLVGVFLM GVISGLVASP CTTAPLSGAL VYVAQTGDLL QGFLALYVLS MGMGVPLLII GTSGGKLLPR AGAWMNIIKT VFGFLLIAVS IVMLGRIWTG VVSDLLWSLW GISFTGYLMH QNKLSAFNWK QTVRSVLLTL TLLASFSYGF QAVMGHFGFT HAPMGGVATT EEEHGFKRIK SIEDLDREIA AATAAGKPVM LDLYADWCVA CKEFEAITFK DAEVLARMNK IVLLQADVTK SDAIDVALLE KYNVLGLPTL LMFNEQGEQR EDLRVTGFMG PKEFAAHLDH LVK
|
| |