Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_06980 |
Symbol | dipZ |
ID | 7759651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 662323 |
End bp | 664230 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643803619 |
Product | thiol:disulfide interchange protein precursor |
Protein accession | YP_002797923 |
Protein GI | 226942850 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGGAA TACCGGGACA ACGCGGCACC GCGCCCGCCT GCGACCAAGG TGGCGCTTGC CAGGCCGCCT GCGGCCTGCA GCGCCTCGCC GCCTCCGGTA TCATCCGTCC CCTTTGTATT TCTGCACGTT TCTGCGAGAA CGCCATGCGC CGCCTGCTGC TGCCCCTGTT CCTGCTGCTG CTCGCCCTGC CCGCCGCCGC CGGCCTGTTC GACAACCGGC CCGGCGCCGC GCTGGGCGGC CTGGACAACC GCGGCGACTT CCTGCCGGTC CGCGAGGCCT TCCGCCTGAG CCTGGTGGAC GCCACGCCGC AGGCGGTGAA ACTGCGCTTC GTCGCCGCCG AGGGCTACTA CCTGTACCGC CAGCGCTTGC AGTTCCGCAG CGAGACACTG GGGGTCGCGC CGGGCGAGCC GCGCCTGCCG GACGGCACGC GCAAGACCGA CGAGTATTTC GGGGAGACGG AGGTCTACTA CGGCGTGCTC GACGTCGAGT TGCCCGTGGC CAACCCGGAC GGGCGCCCCT TCTACCTGCA GGTCGGCTAC CAGGGCTGCG CCGACAAAGG GTTGTGCTAT CCGCCGGAAA CCGAACGGCT GCAGATCGGC GGCACTCCCG CGGCCGGGGC CGCTTCGCCC GCCGACGCGC CGGGCTGGGG CTGGCGGGAA CTGGCCCTGT TCTTCCTCGC CGGCCTCGGC CTGACCTTCA CTCCCTGCGT GCTGCCGATG CTGCCGATCC TCTCCGGCGT GGTGCTGCGC GGGGAGATCG GCGGCGCACG CGGCCTCGCC CTGTCGCTGG CCTACGTACT GCCGATGGCC GCCTGTTTCG CCGTGCTCGG CGCGCTGACG GGACTGTTCG GCGCAGAGTT GAACCTGCAG GCGCGCCTGC AATCGCCCTG GGTGCTGGCG CCCTTCGCCG CCTTCTTCGG CCTCTTCGCC CTGGCCATGT TCGGGGTCTT CGAGCTGCGC CTGCCGCCGG CCCTGGCCGC GCCGCTGGAG CGCCTGGCCG GCAATACGCG CGGCGGCTCG CTGTGGGGCG CCGCCGTCCT CGGCGTGCTG TCCAGCCTGC TGGTTTCGCC TTGCGTTTCC GCCCCGCTGG CCGGCGCGCT GCTGTACATC AGTTCGAGCG GCGATGCCCT GGGCGGCGGC CTGAAGCTCT TCGCCCTGGG ACTCGGCATG GGCGCGCCGC TGGTACTGTT CGCCGCCGGT GGAGGCGCCC TGCTGCCCAG GAGCGGTCCC TGGATGGTCG TGGTGCGCAA TGCCTTCGGC GTGCTGCTGC TGGCGGTCGC CGTGTGGCTG CTCGAACGGG TGCTGCCCGG CCCGCTGGCG CTGGCCCTGT GGGGCTCGCT GGCCGGCGGG GTGGCGCTGT TTCTCGGCGC CCTGGAGTTC ACCGCGAAGA GCCACCGGCA GAAGCTCGGC CAGTTGGCCG GGCTGGCCCT GCTGGTCTAT GCCCTGGCCA GTTGGACCGG CGCGCTGCGC GGCGAGTCCG ATCCGCTGCG TCCACTGGGC GGCGCGTCCC TCTCCGCCGC TCCCGCGGCC CGGACCGCCG GCGCCTGGCA GACCCTCGAC ACGCCGGAAG CACTGGACGC CGCCCTGGCC GAAGCGAAGA ATGCCGCCCA GCCGCTGCTG CTCGACTGGT ATGCCGATTG GTGCATCAGT TGCAAAGTGA TCGAGCGCGA GGTGTTCGCC GACCCACGGG TCGCCGCGCA GCTCGTCGGC TACCGGCTGA TCCGCTTCGA CATCACCGCC GGCACTCCGG CACAGCGCGC GCTGCTGGAT CGCTACCGGC TGTTCGGCCC GCCGGCGATT CAGTTCTTCG CCGCCGACGG CACGGAGCAC GAAAGGCTGC GGGTGGTCGG CGAGATCGAC GCCGCCGCTT TTGCGCAACG CCTGCGGGAA AGTCAATCCC AACGCTAA
|
Protein sequence | MGGIPGQRGT APACDQGGAC QAACGLQRLA ASGIIRPLCI SARFCENAMR RLLLPLFLLL LALPAAAGLF DNRPGAALGG LDNRGDFLPV REAFRLSLVD ATPQAVKLRF VAAEGYYLYR QRLQFRSETL GVAPGEPRLP DGTRKTDEYF GETEVYYGVL DVELPVANPD GRPFYLQVGY QGCADKGLCY PPETERLQIG GTPAAGAASP ADAPGWGWRE LALFFLAGLG LTFTPCVLPM LPILSGVVLR GEIGGARGLA LSLAYVLPMA ACFAVLGALT GLFGAELNLQ ARLQSPWVLA PFAAFFGLFA LAMFGVFELR LPPALAAPLE RLAGNTRGGS LWGAAVLGVL SSLLVSPCVS APLAGALLYI SSSGDALGGG LKLFALGLGM GAPLVLFAAG GGALLPRSGP WMVVVRNAFG VLLLAVAVWL LERVLPGPLA LALWGSLAGG VALFLGALEF TAKSHRQKLG QLAGLALLVY ALASWTGALR GESDPLRPLG GASLSAAPAA RTAGAWQTLD TPEALDAALA EAKNAAQPLL LDWYADWCIS CKVIEREVFA DPRVAAQLVG YRLIRFDITA GTPAQRALLD RYRLFGPPAI QFFAADGTEH ERLRVVGEID AAAFAQRLRE SQSQR
|
| |