Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1683 |
Symbol | |
ID | 3746373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 2183772 |
End bp | 2185646 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637774221 |
Product | thiol:disulfide interchange protein DsbD |
Protein accession | YP_379978 |
Protein GI | 78189640 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.424941 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAAA CCATGATTGG GCGCGTGGTG GCGCTTTTTT TTATGCTAAT AGTGGCGCTT GTACAAACCC TCCCTCTTTC AGCCGCTGAA TTATTGGGTC CCGACGAGGC GTTTCGATTG CAAGCTGAGT TACAGGATAA GCGTGCTCTG CGCCTTCAGT GGACAATTGC TAACCATTAT AAGCTCTATC GGGAATATGT ACGTGTAAGC GTTACTGAGG GTAAAGCTGA GCTGCAACCA CTGACGTTGC CCAAAGGCAT TATGACAACC GATCCCGTGA GTGGCGAGAA AATTGAAATT TATCACGATC AACTGACGGT ATCGCTTCCT ATGCTTAACG CTGATGCGCC ATTCACCCTC AATGTTTCCT ATCAAGGATG TGCTGAAGAT GGACTTTGCT TTCCCCCTAT CACCAAACGC TTTAGAGTGA ATCCCGAACA AGTTGGCAAA CTAACACCGT TAGCTGATGC CTCTCTTGAT GGTGGGGCAA ATCCATTTGC TGCCATGCAA CAAGAGGCTA CAAGTGAATC TGCAACCTCC GCTAAAGCTT CCACCCCACA GCCAAAAAAT CAACCTGAAA ACGATTTCTC TCTTGCAACC TCAGCGTTGG CAAGTGGCAG CTTGTGGCAA ATTGTGCCGC TCTTTTTTCT TTTTGGGCTG CTTCTCTCCT TTACGCCCTG CATTTTGCCG ATGGTGCCTA TTATCTCCTC CATTATTGTA GGTGAAGGCA ACAGCAGTCG TAGCCGTAGC TTTTTATTAG CCGTAGCCTA CTGCTTAGGC ATGGCGTTAG TTTATACCTC GCTTGGCGTG GCGGCTGGCT TAGCAGGCGA AGGCTTTGCG GGTTTTCTGC AAAAACCGTG GGTACTCATA CTCTTTTCGC TCTTGCTCTT TGTTTTTGCC CTTTCGATGT TTGATCTCTA TCAACTGCAA ATTCCAACGG CACTGCAAAA TCGCCTATGC AAAGCATCGG GTAATTTGAA ACGGGGACGT TTTGTTGGTG TCTTTTTTAT GGGTGCACTC TCAGCGCTGT TGGTAGGTCC TTGCGTTGCG GGTCCACTTG CTGGCACGCT GCTCTACATT AGCCAAAGCA GGGATGTGCT GCTTGGCGGT TTTGCGCTTT TTGCTATGGC AACGGGCATG AGTGTGCCAC TCTTGTTGGT TGGTGTTTCA GCAGGAAGTT TGCTGCCAAA AGCTGGTACA TGGATGGTGG GAGTAAAATA TCTTTTTGGC GTGCTTTTGA TTGGTGTGGC AATTTGGATG GTAACCCCTG TGTTGCCAAT GGCGCTGCAA ATGGTGTTGT GGGCAGCGTT AATGCTACTC TCTGCGCTTT TTTTGGGGTT GTTGGATGCT GCTCCCGAAA AAGCAACGGT TGGGATGCGC TTTAAAAAAA CAGCGGCATT ACTCCTTCTC TGTGGCGCAT TAGTTGAAGT GGTTGGAGCG GCTTCAGGGG GAAGTAATCC CTTGCAGCCT CTTGCCCATT TACGCCCATC AGCAGGAAGC AGTGATGCGC CAGCAAACAA CCAACTCCAC TTTACCACCG TGCGTTCACT TGCCGAGTTG GAGACAATCT TGCAATCCAC CAACAAACCC GTCATGCTCG ATTTTTATGC CGATTGGTGC GTGTCGTGCA AAGAGATGGA TGCCTTTGTG TTTGAAAAGC CCGAAGTGCA GCAAGCCCTA AGTTCCATGC AACTATTGCG CGTTGATGTT ACCGCCAACA ATGCTGATGA TCGCGCCTTG TTGAAGCGCT TTAACCTTTT TGGACCGCCC GGTATTATTT TTTTTAATGC AGAGGGAAAA GAAATTGCAG GCAGTCATAT TGTAGGTGCT CTTGATGCCG AAGCTTTTCT TCAGCATCTA CAAACTCTAC CATAA
|
Protein sequence | MKQTMIGRVV ALFFMLIVAL VQTLPLSAAE LLGPDEAFRL QAELQDKRAL RLQWTIANHY KLYREYVRVS VTEGKAELQP LTLPKGIMTT DPVSGEKIEI YHDQLTVSLP MLNADAPFTL NVSYQGCAED GLCFPPITKR FRVNPEQVGK LTPLADASLD GGANPFAAMQ QEATSESATS AKASTPQPKN QPENDFSLAT SALASGSLWQ IVPLFFLFGL LLSFTPCILP MVPIISSIIV GEGNSSRSRS FLLAVAYCLG MALVYTSLGV AAGLAGEGFA GFLQKPWVLI LFSLLLFVFA LSMFDLYQLQ IPTALQNRLC KASGNLKRGR FVGVFFMGAL SALLVGPCVA GPLAGTLLYI SQSRDVLLGG FALFAMATGM SVPLLLVGVS AGSLLPKAGT WMVGVKYLFG VLLIGVAIWM VTPVLPMALQ MVLWAALMLL SALFLGLLDA APEKATVGMR FKKTAALLLL CGALVEVVGA ASGGSNPLQP LAHLRPSAGS SDAPANNQLH FTTVRSLAEL ETILQSTNKP VMLDFYADWC VSCKEMDAFV FEKPEVQQAL SSMQLLRVDV TANNADDRAL LKRFNLFGPP GIIFFNAEGK EIAGSHIVGA LDAEAFLQHL QTLP
|
| |