Gene Caul_4790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4790 
Symbol 
ID5902252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5177119 
End bp5179542 
Gene Length2424 bp 
Protein Length807 aa 
Translation table11 
GC content69% 
IMG OID641565310 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_001686408 
Protein GI167648745 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.994259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGAAGG TCCTAAACTG CATTGGTTCG CAACACGACG TACGTCTCGT CGTCGTCGCC 
GGCCTGATCT GTTTCGCCGC CTGCTTCACG GCGTTTCGCC TTTACTCGCG CATGCGCGGG
GCCAAGGGCG TGGTGCGAGG CGCATGGCTG TTGCTGACCG GCCTGGTGTC CGGATCGGGC
GTCTGGGCCA CCCATTTCGT CGCCATGGTC GCCTACGATC CCGGCCTCAA GACGGGCTAC
AGCCCCACGG GCACCCTGCT GTCGTTGATG ATCTCGGTGA TGTTCATGGC CGGCGGCTTC
GCCGTGGCCT CCGCCCAGCG CTCGCGCACC AACGACTTCG CGGGCGGCCT GATCCTCGGC
ATGGGCGTCG CCGCCATGCA CTACACCGGC ATGTCGGCCT TCGTGACCCA GGGCTTCGTC
CAGTGGGAGC AGGCCACGAT CGCGGCCTCC GTTCTGGCCG GCGTCATCGG CGCGACGGCG
GCCCTGCAGT TGGCGGGCCG GGCGCGCAGC CTGCTCAAGC AGATCGGCGG CGGCGTGCTG
CTGACGCTCG GCGTCTGCAG CCTGCATTTC ATCGGCATGG GCGCGATCAC CATCGTTCCC
GATCCGGCGA TCAACGTTCC GGACCAGATG TTGTCGGGCG CTATCCTGAC CCTGGCCGTG
ACCTCGATCA CCGGCATGAT CATTCTCGGC GGCCTCGGCG CCGTAGCTAT CGAATCCTCG
ACCAGCCGTT CGGCGCTCGA CCGTATCCGC CGCCTGGCCA ACGCCGCCTA TGAAGGCATC
GTCGTGATCC AGGACGGCCT GATCAACGAC GCCAACGCCG CCTTCTGCGA GCTGGCCGGC
GCCGAACTGG ACGCCTTGGT GGGCGCGCCG CTCTCGAACC TGCTGACCTT CGACGGCGAG
GCGCCCTCCC GCGAAGGCGC GCGCCGCGAA GGCGCGCTGC AACCGGTCGA TGGCGGACGA
CAGATCCCGA TCGAGGCGTT CTCGCGCTTG ATGGACGACG GCGCCCGCCA GGAGACCTCG
GGCCTCACGG TCCTGGCCGT CCGCGACCTG CGCGAGCGGC GTTCGGCCGA GGAGAAGATC
CGCTATCTGG CCGAGCATGA CGGCTTGACC GGCCTGCCCA ACCGCAACTC GCTGCAGACT
CGCCTGGCCG CGGCCCTGGA CCGCGTCGAG GCCTCGGGCG AAAGCCTGTC GCTGATCTGC
ATCGACCTGG ACCACTTCAA GGAAGCCAAC GACCTGCACG GCCACCTGGC GGGTGACGCC
CTGCTGGTCG AGACGGCCCG TCGCCTGCAG GATTCGGTGA CCGCCCCGTC CTTCGCCGCC
CGTCTGGGCG GCGACGAATT CATCGTCGTG CAGGTCGCCG CCGGCGATCA GCCCGCCGCC
GCCGCCGAGC TGGCCGGGCA TCTGCTCGAA GCCCTGGCGG CTCCCGCCGT CTATGAGGGC
CAGGACCTGG TCATGGGCGC CAGCCTGGGC GTGTCGCTGT TTCCTGACGA CGGCCGCACG
GCCGAGGCCC TGCTGGCCAA CGCCGACATG GCGCTGTACC GCGCCAAGGA AAGTGGCCGC
GGGGCCTATC GCTTCTTCAA GCGCGAGATG GACGAGTCCA TCCGCGAACG TCGCACCATG
GCCCGCGAAC TGCGCCAGGC GATCATCGAC GAGGAGTTGA TTGTCTACTA CCAGCCGCTG
GCCACGGCGT CGGACGGCGT TGTCTGCGGC TTCGAGGCCC TGGTGCGCTG GAACCATCCG
GTGCGCGGCA TGATCCCGCC GCTGGAGTTC ATCCCCGTCG CCGAGGAGAA CGGCCTGATC
GGCCAGCTCG GCGAATGGGT GCTGCGTCGC GCCTGCGCCG ACGCCGTCAC CTGGGAGCGT
CCGCTGCGCA TCGCGGTGAA CCTGTCGCCG CTGCAACTGA ACCAGCCCGA CCTGCCTAAG
CTGGTCCATG AGGTGCTGGT CCAGACCGGC CTGTCGCCCA AGCGGCTCGA GCTGGAGATC
ACCGAGAGCG CCCTGTTCAA GGACTATCAG CGGGCGTTGG ACAACCTGCG CCGCCTGAAG
GCGCTGGGCG TGCGGATCGC CATGGATGAC TTCGGCACCG GCTTCTCGTC GCTGTCCACC
CTGCAGTCGT TCCCGTTCGA CAAGATCAAG ATCGACAAGA GCTTCGTCGA GAACATCCAT
CGCCACGACC GCGCGACAGT GATCGTCCGC GCCGTGCTGG GCCTGGGCCG CAGCCTGGAG
ATCCCGTGCG TCGCCGAGGG CGTCGAGACC CAGGAGCAGA TCGACTTCCT GCGCGGCGAG
GACTGCGCCG AACTTCAGGG CTATGCGATC GGCCGCCCGG CGCCTGTCGA CACCCTGTCG
GCCTGGACCC TGGCCAGCGT CAGCGCGACG GCCAAGCCGG CGGATCCCGT CGTCAAGAAG
TCGCGTCGCC GCAAGGCCGC CTAG
 
Protein sequence
MLKVLNCIGS QHDVRLVVVA GLICFAACFT AFRLYSRMRG AKGVVRGAWL LLTGLVSGSG 
VWATHFVAMV AYDPGLKTGY SPTGTLLSLM ISVMFMAGGF AVASAQRSRT NDFAGGLILG
MGVAAMHYTG MSAFVTQGFV QWEQATIAAS VLAGVIGATA ALQLAGRARS LLKQIGGGVL
LTLGVCSLHF IGMGAITIVP DPAINVPDQM LSGAILTLAV TSITGMIILG GLGAVAIESS
TSRSALDRIR RLANAAYEGI VVIQDGLIND ANAAFCELAG AELDALVGAP LSNLLTFDGE
APSREGARRE GALQPVDGGR QIPIEAFSRL MDDGARQETS GLTVLAVRDL RERRSAEEKI
RYLAEHDGLT GLPNRNSLQT RLAAALDRVE ASGESLSLIC IDLDHFKEAN DLHGHLAGDA
LLVETARRLQ DSVTAPSFAA RLGGDEFIVV QVAAGDQPAA AAELAGHLLE ALAAPAVYEG
QDLVMGASLG VSLFPDDGRT AEALLANADM ALYRAKESGR GAYRFFKREM DESIRERRTM
ARELRQAIID EELIVYYQPL ATASDGVVCG FEALVRWNHP VRGMIPPLEF IPVAEENGLI
GQLGEWVLRR ACADAVTWER PLRIAVNLSP LQLNQPDLPK LVHEVLVQTG LSPKRLELEI
TESALFKDYQ RALDNLRRLK ALGVRIAMDD FGTGFSSLST LQSFPFDKIK IDKSFVENIH
RHDRATVIVR AVLGLGRSLE IPCVAEGVET QEQIDFLRGE DCAELQGYAI GRPAPVDTLS
AWTLASVSAT AKPADPVVKK SRRRKAA