Gene Caul_3893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3893 
Symbol 
ID5901355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4213426 
End bp4214550 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content74% 
IMG OID641564414 
Productcytochrome c-type biogenesis protein CcmI 
Protein accessionYP_001685516 
Protein GI167647853 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR03142] cytochrome c-type biogenesis protein CcmI 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0160389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.633294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGTT TCTGGATCGC CGCGGCGGGA TTGTCGGCTT TTGTCGCAGC CTTGATGCTG 
CGCGCGACCG CGCGCGCGGC TCTGGCCTCG GGCGCGGGCG GCGACGACGC CAGCCTGGCG
GTGCACCGCC GACAGCTTTC CGAGATCGAC GATCTGGCCG AGCGCGGCCT GCTGGCCGAG
GGCGAGCTCA AGGCCGCGCG GGCCGAAGCC GGGCGCCGGC TGATCGCCGC CGCCGACCAT
CTGCAAGCTT GGCCGGCCGC CAATCCCAAG GCTCGCCCTC TGGTGTTGGC CTTGGCCGCC
GCCGCTCCGA TGATCGCCCT GGTCATCTAC ATGCTGGTCG GCGCGCCGGG CGTGGCGGAC
CAGCCGTTCC TCAAGCGCGT CGCGGCCTGG CGCGAAGCCG ATCCCGCTCA GCTCGATCCG
CGGAAGATCG CCGCCGTGCT CGAACAGATC GCGATCGCGC GGCCTGCCGA TCCCGAACCG
CTCAAGCACT TGGCCCTGGC GCGGATGGCC GGCGGCGACC CAACTGGCGC GACCCAGGCC
CTGCGCCGGG CCGTGACCCT GGACCCGGCC CGCGTCGACC TGTGGATCGA CCTGGGCCAG
GCCTTGGTGG CCGAGGGCGA CGGCGAGGTT GGCGCCGACG CCCGGCGCGC CTTTTCCGAA
GCCCTGAAGC GCGACCCCGG CAATGTGGTC GCCCGCTATC ACCTGGCGCG GGGCAGGATC
GCCGACGGCG ACGTTTCCGG CGGCCTCGCC GACTGGCGCG CCCTGCTGGC CGACCTGCCG
GCCGGGGATC CGCGCCGCCA GGGCTTCAGC CAGGAGATCG CCCAGGTCCA GGCCAATGGC
GGCCTGCCGG CCTCCACCGC GCCCACGGGC CAGCCGGGTT CGACCACCGG GGGCGACGTC
CAGGGCATGA TCCAGGGCAT GGTCGCGGGC CTGGCCGCCC GGCTGGAAAC CGCGCCAGAC
GATCCCGACG GCTGGGTCAA GCTGGTGCGC GCCTATGCCG TGCTGGGCGC GGCCGCCAAG
CGTGACGCCG CCCTCGCCAA GGCGACCCAG CGCTACCAGG ACCAGCCCAA GGTGCTGGCC
GCCCTGCGCC AGGCCGCCCA AACCCCCAAA GCCCAGACGC CATGA
 
Protein sequence
MIGFWIAAAG LSAFVAALML RATARAALAS GAGGDDASLA VHRRQLSEID DLAERGLLAE 
GELKAARAEA GRRLIAAADH LQAWPAANPK ARPLVLALAA AAPMIALVIY MLVGAPGVAD
QPFLKRVAAW READPAQLDP RKIAAVLEQI AIARPADPEP LKHLALARMA GGDPTGATQA
LRRAVTLDPA RVDLWIDLGQ ALVAEGDGEV GADARRAFSE ALKRDPGNVV ARYHLARGRI
ADGDVSGGLA DWRALLADLP AGDPRRQGFS QEIAQVQANG GLPASTAPTG QPGSTTGGDV
QGMIQGMVAG LAARLETAPD DPDGWVKLVR AYAVLGAAAK RDAALAKATQ RYQDQPKVLA
ALRQAAQTPK AQTP