Gene Caul_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0046 
Symbol 
ID5897758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp56597 
End bp59302 
Gene Length2706 bp 
Protein Length901 aa 
Translation table11 
GC content72% 
IMG OID641560529 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001681682 
Protein GI167644019 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.135172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.385882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCGC CCCCCGTCGA CAGCGCCGTC ATGCCCGCCC GGCTCAGCGA CGCTGAAAGC 
GCGCGGTGGT TCTTCGACCA TTCGCACGAC CTGTTCGCCG TGGTGTCGCG GGGCCGGTTC
GCCACCGTCA ACCCGGCCTG GACCCGGGTG ACGGGTTGGA GTCGCCAGGA TCTGGTCGGC
CAGCCCGCCC TGCGCTTCGT GCATCCGGAG TCGCGCCAGG ACTTCATCGA GACCGCCAGC
CTGATCCGCG CCGCCGGCGA GGCGTTCAAC CGCCTCAAGG TGGTCTGCAA GAACGGCGGC
TGGGTGTGGT TAGAGAGCCA GTCGCGCCTG GGTCCCAACG GCGAGATGGT CGGCGCCTTG
CGCGACGTCA CCGCCGACAT GGCCCGCCGC GCCGAGCTGG CCGTCGCCCG CCAGACCCGC
GAGCTGCTGG CCTCGTCGGC GGGCATCGGC GTCTGGAGCT ACGAGCCGCA CGGCCAGCAC
ATCGAGTGGT CGCGCGACAT CCTGACCGTC ACCGGCCTGG AGCCCGAGGA TATCGCCCAG
CCCGACCAGT TCTACGATCG CCTTGACCCG GCCGAGCGCG AGGGTCTGCG GGCGACGTTC
GACGCCGCCG TGCGCACCGG CGAGGGCGCG ACCATCGAGC ACCGGCTGCG CGGAGCCGGC
GACCAATGGT TCACCTACCG CGCCACCTTC CGCACCGAGC CGCGCGGCGA CGGCCTGTTC
GCCTTGAAGG GCATCTCCCA GAACGTCACC GACGTGGCCC GGACCCGCGA CGCCGCCGTC
TGGGGCGAAC GCCAGGCCCG CCGGCTGGTC GAGGAGGCGC CGTTCGCCGT GGCGCTCTAT
GACCGCGACC TGCGTCTGCG GGTGGTCAGT CCGCGCTTCC TGGAGATCTT CCAGGCCACA
GAGGAAGGCG TGATCGGCCG CAGCCTGCAC GACCTGACCT CGGGAACCCG CCGGCGCTTC
GTCAACGCCG TCGAACGAGC CTTGACCGGC GAGACGGTGG TGCGCCGCGA GGACCAGCTG
CGTGACGCCC TGGGCCGCAG CCACACCCTG CGCTGGGAGG CCCGGCCCTG GCGCGACGCC
GACGGCGAGA TCGGCGGCGT CATCACCTAC ATGGACGATG TCACCGCCCT CTCGGACGCC
CGCCGCGAGG CGCGGGTCAA CGCCCGGCGG TTGCGGGTGG CGCTCGGCGC GGCCCAGGCG
GGGGTCTACG AGATCGACCA CGTGGACAAG GCCTTCTGGG GCTCGCCCGA GTTCCATCGG
ATCCTGGGCC GCCGGATCAG CTATGAGGAC GTGCGCAGCG CCGTCTGGCC GATGATCCAC
CCCGACGACG TGGCCTCGGT CTACGACGCC TCGGACGCCT GGCTCGGCAA GCGTGAGGAT
GGGGGCGGGC GCTCGTTCGA CGTGCGGGTG GTCACCGGCC GCGGCGAGAC GCGCTGGATC
CGCATCTTCC ACGAGGTGCG CCAGGACGCC GGCGGCCGGA TCCGCAAGGC GTTCGGCCTG
ATCCTCGACA TCGACGACAA GAAGCGCGCC GAACTGGCCC TGGTCGAGGC CGAGCGCGCC
GCCCAAGCCG CCAACGAGGC CAAGGCCCAG TTCCTGGCCA ATATGAGCCA CGAGATCCGC
ACCCCGATGA ATGGGGTGCT GGGGGTGATG CACCTGCTCA AGCGGGAGAA GCTGCCGGGC
GACGCCGACC ACCTGCTGGG CGAGGCCCTG GCTTGCGGCC GGATGCTGTC GACCCTGCTG
GACGACGTGA TCGACTTCTC GCGCATCGAG GCCGGGCGGC TCGACGTCAC CCGCGAGGCG
GTCGATCCCA GCGAACTGGC GATGAGCGTG GCGCGGCTGC TGCGCGCCCA GGCCGAGCAC
AAGGGTCTGG AACTGCGGGT CGAGGCCCCG GATCTGGGCC TGATCCTCAC CGACCCCAGT
CGTCTGCGCC AGGCGCTGTT CAACCTGGTG GGCAACGCCG TGAAGTTCAC GCTGGCGGGC
TCGGTGACCC TGAAGGTGCG CCTGCTGGAC GGGGGCGGCC CCCAGGACGT TCCCAAGATC
GTCTTCGAGG TGATCGACAC CGGGGTCGGC ATCGCCGCCG AGGCTCAGGC GCGACTGTTC
GAGCGCTTCC AGCAGGCCGA CGCCAGCACC ACCCGCCGCT TCGGCGGGTC GGGCCTGGGC
CTGGCCATCA CCCGGCGGCT GGCCGAGATG ATGGGCGGCG AGGTCGGCTT CACCTCTCGC
GAGGACGCCG GCTCGACCTT CCTGCTGACC ATCGCCGCCC CGCCGGCCCA AGCCCGCGCG
GTCGAGTCCG AGATTGCGGA TGGGCTGCTG CAGGGCCTGA AGATCCTGGT GGTCGAGGAC
AACGCCACCA ACCGCCTGGT GGCGCGCCGC ATCCTCGAGC AGCTGGGCGC CAAGGTCGAG
ACGGCCGAGG ACGGGCTGGA CGGCGTGACC ACGGCGGCGC GTGGCTTCGA CCTGATCCTG
ATGGACGTCC AGATGCCCGG CATCGACGGG CTGGAGGCCG CCCGCCGCAT CCGCGACCTG
CCGGGTCCCG CCGCCCGCAC CCCGATCATC GCCCTGACCG CCAACGTCCT GTCGCACCAG
CGGGCCGCCT ATCTGGCCGC CGGCATGGAC GGCGTCGCCG CCAAGCCGAT CGTTCCGGCC
GCCCTGATCG GCGAGATCCT CCGGCTGGCG GGCGCGGAGA GCGGGGAAGC GGCGGCGGTG
GCCTAG
 
Protein sequence
MSPPPVDSAV MPARLSDAES ARWFFDHSHD LFAVVSRGRF ATVNPAWTRV TGWSRQDLVG 
QPALRFVHPE SRQDFIETAS LIRAAGEAFN RLKVVCKNGG WVWLESQSRL GPNGEMVGAL
RDVTADMARR AELAVARQTR ELLASSAGIG VWSYEPHGQH IEWSRDILTV TGLEPEDIAQ
PDQFYDRLDP AEREGLRATF DAAVRTGEGA TIEHRLRGAG DQWFTYRATF RTEPRGDGLF
ALKGISQNVT DVARTRDAAV WGERQARRLV EEAPFAVALY DRDLRLRVVS PRFLEIFQAT
EEGVIGRSLH DLTSGTRRRF VNAVERALTG ETVVRREDQL RDALGRSHTL RWEARPWRDA
DGEIGGVITY MDDVTALSDA RREARVNARR LRVALGAAQA GVYEIDHVDK AFWGSPEFHR
ILGRRISYED VRSAVWPMIH PDDVASVYDA SDAWLGKRED GGGRSFDVRV VTGRGETRWI
RIFHEVRQDA GGRIRKAFGL ILDIDDKKRA ELALVEAERA AQAANEAKAQ FLANMSHEIR
TPMNGVLGVM HLLKREKLPG DADHLLGEAL ACGRMLSTLL DDVIDFSRIE AGRLDVTREA
VDPSELAMSV ARLLRAQAEH KGLELRVEAP DLGLILTDPS RLRQALFNLV GNAVKFTLAG
SVTLKVRLLD GGGPQDVPKI VFEVIDTGVG IAAEAQARLF ERFQQADAST TRRFGGSGLG
LAITRRLAEM MGGEVGFTSR EDAGSTFLLT IAAPPAQARA VESEIADGLL QGLKILVVED
NATNRLVARR ILEQLGAKVE TAEDGLDGVT TAARGFDLIL MDVQMPGIDG LEAARRIRDL
PGPAARTPII ALTANVLSHQ RAAYLAAGMD GVAAKPIVPA ALIGEILRLA GAESGEAAAV
A