Gene Cag_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1683 
Symbol 
ID3746373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2183772 
End bp2185646 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content48% 
IMG OID637774221 
Productthiol:disulfide interchange protein DsbD 
Protein accessionYP_379978 
Protein GI78189640 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.424941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAA CCATGATTGG GCGCGTGGTG GCGCTTTTTT TTATGCTAAT AGTGGCGCTT 
GTACAAACCC TCCCTCTTTC AGCCGCTGAA TTATTGGGTC CCGACGAGGC GTTTCGATTG
CAAGCTGAGT TACAGGATAA GCGTGCTCTG CGCCTTCAGT GGACAATTGC TAACCATTAT
AAGCTCTATC GGGAATATGT ACGTGTAAGC GTTACTGAGG GTAAAGCTGA GCTGCAACCA
CTGACGTTGC CCAAAGGCAT TATGACAACC GATCCCGTGA GTGGCGAGAA AATTGAAATT
TATCACGATC AACTGACGGT ATCGCTTCCT ATGCTTAACG CTGATGCGCC ATTCACCCTC
AATGTTTCCT ATCAAGGATG TGCTGAAGAT GGACTTTGCT TTCCCCCTAT CACCAAACGC
TTTAGAGTGA ATCCCGAACA AGTTGGCAAA CTAACACCGT TAGCTGATGC CTCTCTTGAT
GGTGGGGCAA ATCCATTTGC TGCCATGCAA CAAGAGGCTA CAAGTGAATC TGCAACCTCC
GCTAAAGCTT CCACCCCACA GCCAAAAAAT CAACCTGAAA ACGATTTCTC TCTTGCAACC
TCAGCGTTGG CAAGTGGCAG CTTGTGGCAA ATTGTGCCGC TCTTTTTTCT TTTTGGGCTG
CTTCTCTCCT TTACGCCCTG CATTTTGCCG ATGGTGCCTA TTATCTCCTC CATTATTGTA
GGTGAAGGCA ACAGCAGTCG TAGCCGTAGC TTTTTATTAG CCGTAGCCTA CTGCTTAGGC
ATGGCGTTAG TTTATACCTC GCTTGGCGTG GCGGCTGGCT TAGCAGGCGA AGGCTTTGCG
GGTTTTCTGC AAAAACCGTG GGTACTCATA CTCTTTTCGC TCTTGCTCTT TGTTTTTGCC
CTTTCGATGT TTGATCTCTA TCAACTGCAA ATTCCAACGG CACTGCAAAA TCGCCTATGC
AAAGCATCGG GTAATTTGAA ACGGGGACGT TTTGTTGGTG TCTTTTTTAT GGGTGCACTC
TCAGCGCTGT TGGTAGGTCC TTGCGTTGCG GGTCCACTTG CTGGCACGCT GCTCTACATT
AGCCAAAGCA GGGATGTGCT GCTTGGCGGT TTTGCGCTTT TTGCTATGGC AACGGGCATG
AGTGTGCCAC TCTTGTTGGT TGGTGTTTCA GCAGGAAGTT TGCTGCCAAA AGCTGGTACA
TGGATGGTGG GAGTAAAATA TCTTTTTGGC GTGCTTTTGA TTGGTGTGGC AATTTGGATG
GTAACCCCTG TGTTGCCAAT GGCGCTGCAA ATGGTGTTGT GGGCAGCGTT AATGCTACTC
TCTGCGCTTT TTTTGGGGTT GTTGGATGCT GCTCCCGAAA AAGCAACGGT TGGGATGCGC
TTTAAAAAAA CAGCGGCATT ACTCCTTCTC TGTGGCGCAT TAGTTGAAGT GGTTGGAGCG
GCTTCAGGGG GAAGTAATCC CTTGCAGCCT CTTGCCCATT TACGCCCATC AGCAGGAAGC
AGTGATGCGC CAGCAAACAA CCAACTCCAC TTTACCACCG TGCGTTCACT TGCCGAGTTG
GAGACAATCT TGCAATCCAC CAACAAACCC GTCATGCTCG ATTTTTATGC CGATTGGTGC
GTGTCGTGCA AAGAGATGGA TGCCTTTGTG TTTGAAAAGC CCGAAGTGCA GCAAGCCCTA
AGTTCCATGC AACTATTGCG CGTTGATGTT ACCGCCAACA ATGCTGATGA TCGCGCCTTG
TTGAAGCGCT TTAACCTTTT TGGACCGCCC GGTATTATTT TTTTTAATGC AGAGGGAAAA
GAAATTGCAG GCAGTCATAT TGTAGGTGCT CTTGATGCCG AAGCTTTTCT TCAGCATCTA
CAAACTCTAC CATAA
 
Protein sequence
MKQTMIGRVV ALFFMLIVAL VQTLPLSAAE LLGPDEAFRL QAELQDKRAL RLQWTIANHY 
KLYREYVRVS VTEGKAELQP LTLPKGIMTT DPVSGEKIEI YHDQLTVSLP MLNADAPFTL
NVSYQGCAED GLCFPPITKR FRVNPEQVGK LTPLADASLD GGANPFAAMQ QEATSESATS
AKASTPQPKN QPENDFSLAT SALASGSLWQ IVPLFFLFGL LLSFTPCILP MVPIISSIIV
GEGNSSRSRS FLLAVAYCLG MALVYTSLGV AAGLAGEGFA GFLQKPWVLI LFSLLLFVFA
LSMFDLYQLQ IPTALQNRLC KASGNLKRGR FVGVFFMGAL SALLVGPCVA GPLAGTLLYI
SQSRDVLLGG FALFAMATGM SVPLLLVGVS AGSLLPKAGT WMVGVKYLFG VLLIGVAIWM
VTPVLPMALQ MVLWAALMLL SALFLGLLDA APEKATVGMR FKKTAALLLL CGALVEVVGA
ASGGSNPLQP LAHLRPSAGS SDAPANNQLH FTTVRSLAEL ETILQSTNKP VMLDFYADWC
VSCKEMDAFV FEKPEVQQAL SSMQLLRVDV TANNADDRAL LKRFNLFGPP GIIFFNAEGK
EIAGSHIVGA LDAEAFLQHL QTLP