Gene Caul_4691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4691 
Symbol 
ID5902153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5072205 
End bp5074421 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content71% 
IMG OID641565210 
Productcytochrome c biogenesis protein transmembrane region 
Protein accessionYP_001686309 
Protein GI167648646 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein
[COG4233] Uncharacterized protein predicted to be involved in C-type cytochrome biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.937324 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTTCGA CCCGTATCCG CCGCGTCCGG AACCCGTTCA TGTCGCTCCG CAAGCTATTG 
TCCGCCCTTC CGGCGCTCGT TGTCGCTCTG CTGCTGGGCG CGCCTGCCTT TACGGGCATG
GCGTGGGCCG AGCCGGTCCA CACCGGCCAT ATCGACGTCG AACTGATTTC CCAGGAGGCC
GGGGCCGCGC CCGGCTCGAC GGTGTTCGTG GCCTTGCGCC AGAAGATCCA GCCCGGCTGG
CATACCTATT GGCGCAATCC AGGCGACGCG GGCGACGCGA CGCGTATCGC CTGGACCTTG
CCGCCGGGCT GGGCCGCCGG CGACATCGTC TGGCCGACCC CCGAGAAGAG CCGCGTCGGG
CCGCTGCTCG ACTTTGCCTA TACCGGTGAG GTGTTGTTGC CCGTGCCGAT CAGCGTGCCG
GCCAACGCCC AGGTCGGCTC GGTCGTTACG CTCAAGGCCG CCGCCGCCTT CCTGGTCTGC
GAGCAGGTCT GCGTGCCCGA GGACGCCGTC GTCACCCTGA CCCTGCCGGT CGTGGCCGGC
ATGCCCCAGG CCGATCCCAA GTGGGGCGCC AAGGTCGCCG ACACCCTGGC CAAGGCCCCC
AAGCCAGCGG GTCTGAAGGC GGTGTTCAAC CTGCAAGGCT CGGTGCTGAA GCTGGCCGTG
ACCGGCGCGC CGCTGAAGGG CGCCGACGTC GCGGGAGCGT TCTTCTATCC CTATTCCGGC
AAGGTCATCG AACATCCGCC CGAGCAGGCG ATCGAGCGCG GCCCCGAGGG CCTGACCCTG
TCCCTGACCC CCGGCTATGA CTTCACCCAA GCCGAGGCCA AGCCGACCGA GCTGGCCGGG
GTCCTGGCCC TGAATGGCGC GGCTTACGAG ATCACCGCCA CGCCGGGCGC GATCCCCGCC
GGGGCCGGCG GGCTGGGCGC CCCGGCGGCG GCCAAGACCC TCTCGCAACC GGTCGCCAGC
CTGGGCCTGC CGCTGGCCGT GGTCTTCGCC TTCATCGGCG GCTTGATCCT CAATTTGATG
CCCTGCGTTT TCCCGATCCT GTCGATGAAG GCCGCCAGCC TGACCGCCCA CGCCCATGAC
GCCGGCAAGA CCCGCGTGCA GGGCCTGGCC TTCCTGGCCG GCGTGGTCGT CACCTTCCTT
GTCTTGGCGG GCCTGCTGAT CGCCGTGCGG GCTGGCGGGG CGGCGGTGGG CTGGGGCTTC
CAGTTGCAGT CGCCGGCCGT GGTCGCGGCC CTGGCTCTGT TGATGCTGCT GGTCGCCTTG
AACATGTCGG GCGTGTTCGA GGTCGGCGCC TCGGTGCAGG GCGTCGCCTC GGGCGCGGGC
GGCGGAGGCG GGCTGGGCGG TTCGTTCCTG ACCGGCGCCC TGGCCGTGGT CGTCGCAGCC
CCCTGCACCG CGCCGTTCAT GGCCGGCGCC CTGGGCTACG CCCTGACCCA GCCGCCCCTG
GCCTCGCTAC TGGTCTTCCT GGGCTTGGCC CTCGGCTTCG CCGCGCCGTT CGTGCTGCTG
GCCTTCATCC CCGGCCTGCT GGCTCGCCTG CCGCGTCCCG GTCCATGGAT GGACGTGCTG
AAGAAGGGCC TGGCCTTCCC GATGTACGCC ACCGCCGCCT GGCTGGCCTG GGTCTACGGC
CAGCAGACCG GCTCGATCCC CCTGGGCGCC CTGCTGGCCG CCAGCGTGCT GGTCGCCTTC
GCCGCCTGGC TCTATGGCCT CGGTCAAGCG CGGTCGATCA TGGGCAAGAG CGTGGGCGTG
CCGTTCGTGC TGGCCGGCCT GGCGGGGCTG GCCGCCATCG CCCTGGTGGT GGTCGGCGTC
CGCGCTGTTC CCGCGACATC GGCCCCGTCC GTCATGGCGT CCGCTGAAGC CCCTGCCGGT
CCCGGTCTGG CCGCCGAGCC CTGGAGCCCC GAAAGGGTCA AGGCGCTGCA AGCCGAGGGC
AAGGTGGTGA TGGTCGATTT CACCGCCGAC TGGTGCGTGA CCTGCAAGGT CAACGAAGGA
ACCGCCCTCA AGGGCCAGCG CCTGGTCGAC GCCTTCCAGG CCTCGGACGC GGTGCTGCTG
CGCGCCGACT GGACCAAGCG CGACGCCACC ATCGCCGCCG CCCTGTCCGA GCACGGCCGC
GCCGGCGTGC CGTTGTACCT GGTCTATCCG AAGGGCGGCG GCGAGCCGGT CATCCTGCCC
CAACTGCTGA CCGAGGGCTT GGTGATCGAA GCGATCGAGA AGGCCGCGAA GGGGTAG
 
Protein sequence
MSSTRIRRVR NPFMSLRKLL SALPALVVAL LLGAPAFTGM AWAEPVHTGH IDVELISQEA 
GAAPGSTVFV ALRQKIQPGW HTYWRNPGDA GDATRIAWTL PPGWAAGDIV WPTPEKSRVG
PLLDFAYTGE VLLPVPISVP ANAQVGSVVT LKAAAAFLVC EQVCVPEDAV VTLTLPVVAG
MPQADPKWGA KVADTLAKAP KPAGLKAVFN LQGSVLKLAV TGAPLKGADV AGAFFYPYSG
KVIEHPPEQA IERGPEGLTL SLTPGYDFTQ AEAKPTELAG VLALNGAAYE ITATPGAIPA
GAGGLGAPAA AKTLSQPVAS LGLPLAVVFA FIGGLILNLM PCVFPILSMK AASLTAHAHD
AGKTRVQGLA FLAGVVVTFL VLAGLLIAVR AGGAAVGWGF QLQSPAVVAA LALLMLLVAL
NMSGVFEVGA SVQGVASGAG GGGGLGGSFL TGALAVVVAA PCTAPFMAGA LGYALTQPPL
ASLLVFLGLA LGFAAPFVLL AFIPGLLARL PRPGPWMDVL KKGLAFPMYA TAAWLAWVYG
QQTGSIPLGA LLAASVLVAF AAWLYGLGQA RSIMGKSVGV PFVLAGLAGL AAIALVVVGV
RAVPATSAPS VMASAEAPAG PGLAAEPWSP ERVKALQAEG KVVMVDFTAD WCVTCKVNEG
TALKGQRLVD AFQASDAVLL RADWTKRDAT IAAALSEHGR AGVPLYLVYP KGGGEPVILP
QLLTEGLVIE AIEKAAKG