Gene Caul_4724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4724 
Symbol 
ID5902186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5110719 
End bp5112110 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content68% 
IMG OID641565243 
Productcarboxyl-terminal protease 
Protein accessionYP_001686342 
Protein GI167648679 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.458117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0374379 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGT ACCTGCTCAT CGGTGTTTCG GCCTTCGTCC TCGGCGCGGG GACCATGGCC 
TACGTCAGCC CCATCGCCCA GGCGAACAGT TCGACCAAGG GCCAGACCTA CAAGATGCTG
GAGCTGTTCG GCGACGTCCT GCAGACGGTC GACAACCAGT ACGTCTCCGA GGTCGACAAC
AAGAAGCTGA TCGAGGCGGC CCTCGACGGC ATGCTGACCA GCCTCGATCC GCATTCCGGC
TACCTGTCGC CCGACAGCTT CGAGGACATG CAGGACACCA CGCGCGGCGA ATATGGCGGC
CTGGGCATCG AGGTCACCAG CGAGGACGGC GTGGTCAAGG TGATCTCGCC GATCGACGGC
ACGCCCGCCA TGCGGGCCGG CATCCAGGCC GGCGACTACA TCACTTCGGT CAACGGCCAG
TCGGTGCTGG GCCTGACCGT CAACGAAGCG GTCAAGCAGA TGCGCGGGCC GGCCGGCGAG
GCGGTCACCC TGACCATCGC CCGCGACAAG ACCGATCCGT TCGACGTCAA GCTGACGCGC
GAGGTGATCA AGCCCAAGGC CGCCATCGCC AAGATGGAAG GCGACTACGG CTATGTCCGC
CTGCCCGGCT TCAACGAGAA GGCCACCGAC GCCCTGACCG CGGCGATCAA CGAGCTGAAG
ACCAAGAACC CCCACATGAA GGGGCTGATC TTCGACCTGC GCAACAATCC CGGCGGCCTG
CTCGACCAAG CCGTGGGCGT CTCGGACGTG TTCCTCGATG GCGGCGAGGT GGTCAGCCAG
CGCGGCCGCG ACCCGCGCGA CATCCAACGC TACAACGCCA AGCCTGGCGA CCTGCTGAAC
GGCCTGCCGG TGGTGGTGCT GATCAACCAG GGCTCGGCCT CGGCCGCCGA AATCGTCGCC
GGCGCCCTGC AGGACCGCCA TCGCGCCGAA CTGGTCGGCA TCACCAGCTT CGGCAAGGGC
TCGGTGCAGA CCGTGATCCC GCTGCGCGGC GGGGCCGACG GGGCCCTGAA GCTGACGACG
GCGCGCTACT ACACGCCGTC GGGCCGCTCG ATCCAGAAGA CCGGCATCGC GCCCGACCTG
GAAGTGGCCC AGACCAAGGA CCAGGCTCAG GACATCGCCA ACCGCGTTTG GTTCAGCGAG
GCCAGCTTCA AGAACGCGCT GAACGCCGAC GAGGGCAAGA CCCGCCAAGG GGTCCACACA
CCGGCCGAGG CCCCGCCCCC CGGCTTCGAC GACAAGAAGG GCGACTTCCA GCTGGACCGC
GCCATCGCCG TGCTGAAGGC CGGCTCGGTC CAGGCCGTGC CGAAACTGCC CAAGCCCCAG
GCCAAGATCG CCGAAGTCAC CGCGAAAGCC GCGGCGGCGG CCGGCAAGGG TCCGCCGGCG
GTGGAGAAGT AG
 
Protein sequence
MRKYLLIGVS AFVLGAGTMA YVSPIAQANS STKGQTYKML ELFGDVLQTV DNQYVSEVDN 
KKLIEAALDG MLTSLDPHSG YLSPDSFEDM QDTTRGEYGG LGIEVTSEDG VVKVISPIDG
TPAMRAGIQA GDYITSVNGQ SVLGLTVNEA VKQMRGPAGE AVTLTIARDK TDPFDVKLTR
EVIKPKAAIA KMEGDYGYVR LPGFNEKATD ALTAAINELK TKNPHMKGLI FDLRNNPGGL
LDQAVGVSDV FLDGGEVVSQ RGRDPRDIQR YNAKPGDLLN GLPVVVLINQ GSASAAEIVA
GALQDRHRAE LVGITSFGKG SVQTVIPLRG GADGALKLTT ARYYTPSGRS IQKTGIAPDL
EVAQTKDQAQ DIANRVWFSE ASFKNALNAD EGKTRQGVHT PAEAPPPGFD DKKGDFQLDR
AIAVLKAGSV QAVPKLPKPQ AKIAEVTAKA AAAAGKGPPA VEK