Gene Caul_0622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0622 
Symbol 
ID5898077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp687161 
End bp690202 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content69% 
IMG OID641561104 
Producthistidine kinase 
Protein accessionYP_001682253 
Protein GI167644590 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.378276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACGG CGGCCAGACT GCTCTTGACG CTCGCCGTGC TGCTCGGACC GTTCGTGGGC 
GATCGGACCC TGGCGCTGGA CCCGGCGCGG GGGATCACCC AGTTCAAGCA CACCAGCTGG
ACGGCGGAGG ACGGGGCGCC GACCAACATC TGGGACATCG CCCAGTCGCC CGACGGCTAT
CTGTGGCTCG GCGCGGCGCG CGGCCTCTAT CGCTTCGACG GCGTGACGTT CGAGCCGATC
CCGCTCGCCG ACAACGACTA TTCCCATTCC CAGCACATCA ATGCCCTGCT GGTGGCCCGA
TCCGGCGAAC TCTGGGTCGG CTACAGGCGG GGCGGCGTCG CCGTCTTTCG CGATGGGAAG
TTGCGCGAGC TGCCCGGAAC CGATCGCGTC GGCGTGCTGA CGATGGCCCA GGACCAGGAC
GGCGTCGTCT GGGTAGGGGT CAGCGACGGC CAGCAGAACC TGGCCCGGGT CGTCAACGGC
AAGCTGCAGA GCATCGACTC GTCGTGGAAC CTGCCCCCCG GCTTCGTGAT GGACCTGCAT
GTGAGCCGTG ACGGGACGCT TTGGGTGGCG ATCGACGGCA GCCTGTCCTT CCTGCGTCGC
GGGGCCAAGC GCTTCGAGCG CACGGATATC GTCCTCGGCA AGGGCGCCGG CCTGACCGAG
GATGCGGGCG GCCGGCTGTG GGTGGCCGAT TCGCTCGGCG CCCGCCCGCT GCCCAACGTC
GCCCAGGGCG AGAACCCCAC GCCCGGATCG ACGCTCTACG AAGTCTCGGA CTCGGCCCGC
TACGCCCGCA TCCTGTTCGA CAGGGACGGC AGCCTCTGGG GCACGACGTT CGCCACTGGA
ATCTTCCGCG TGGCCAAGCC GGCGGCCGTC TCCGGGTCCG AACGTACCTC TCCCGTCAGG
GCCGAGACCT ACGAGGCCAA GGACGGCCTG AGCTCGAACA AGGCCGTGCC GATCCTGCAG
GATCGCGAAG GCAATGTCTG GATCGGGACT TCCGCGGGCC TCGACCGCCT GAGAACGGCC
AATGTCGTGG TCGAACCCGG CGTCGCCAGG TCGTCGCGCT TTGGCTACCT GCAGTTCGCC
GACAAGGACG GCGTGGTCCA TGTCGCCGAC AGCGACACGC TCTACCGGGC CGCCCCCCGC
CAGCCCCCGA AGGTGATCCG CGACCATCTC GACAACCCGA CCGCGCTTTG CCAGGACAAG
ACCGGCGCGA TCTGGCTCGG CACGGAAAAC ACCCTGGCTC CCTTGGAGGG CGCCAGGCCG
CGCGGGGTTT CCCCGCCCCT CGGGGACAAG CCCTATACCG GGTGCGTGGT GGATCGCCGT
GGAGACCTGT GGTTCGCCCT TTTCTCGAAT GGATACGCCC GACTGGACCA ACGGGGTTGG
ACGATGTTTC CCCTGTCCGC CAGCGTGGCG GCGGCGTTCC TGGCGCTCGA CAATCAGGGC
CGCGTGGTGC TGTCGACCAG CCGGGCGATC TCCCGCGTCA GTCCGGACGG GGCCGTGCAG
ACCTTGCGCG TGGATCCAAA GCTCGCGATG GGCGGCGTGC GCTCGCTCTA CCAGGGACCC
AAGGACTTCC TGATCGGCTG CGAGTTCGGC CTGCTGCGCC TGACTGGCGA CCGCTTCGAG
GCCCTTCAGG TGGCCCGTTT TCCCTGGGCC CAGGGCATCC AGGGCGTGGT CCAGACACCC
CGCGGCGAGA CCTGGATCGT CGGCGCCCTG GGCGTGGTTC GACTGCGGAC GGAAGACCTC
GACAAGGCCT TCGGCGATCC AGGACGCCCT CTGGACTATC AAGAGTTCGA CCTGAAGGAC
GGGCTTCCGG GTCCGCCTCA GCAGAACGGC TCCAAGGACG CCGTCGTGGG CGGCGATGGC
CGCATCTGGG TCCTGACGCT CGAGGGCGTC GGCTGGATCG ATCCGGCCCA TATCGTCAGG
AACACCCTGC CTCCGCCGGT ATCGATCCGC GGCGTGACCG TCGACGGCAA GACCTATGGC
GACCCGCGGG ACCTGACCCT GCCCAGGGGC GCCTCGAAGC TGCAGATCGA CTACACCGCC
CTGAGCCTCA GCATCCCGGA GCGGGTGCGT TTCCGCTACC AACTGGAGGG CGTCGACAAG
ACCTGGGTCG AGGCCGGCGG CCGGCGTCAG GCCTTCTACA CCAACCTCAA GCCCGGCCAT
TACCGCTTCA GGGTCGTCGC CGCCAACAAT GACGGCGTCT GGAACGACAG GGGCGCGGCC
CTGGCCTTCG CCATTCCGCC GACCTTCGTC CAGACCAAGC TGTTCGCCGC CCTGTGCGTT
ATCGCGTTGT CGGGCCTGCT GTGGGGACTC TACGCCCTGC GCCTGCGCCA ACTGTCCGAC
CGCATTCACG GACGGCTGCA GGACCGCCTG GCCGAACGCG AGCGCATCGC CCGCGAGCTT
CACGACACCC TGCTGCAGGG CATCCAGGGC CTGATGCTGC GCCTGCAGTC GGTCGTCGAC
CAGATACCGC CCGACCAGCG GGCTCGCCAG GACCTGGAAC AGGCCCTGGA CCGCGCCGAC
AGCGTCATCG AGGAAGGTCG CGATCGGGTG AAGAGCCTGC GCGCCGACAA CCCCGCCGAT
CTTCCCAAGA TCCTGGCCGA CGTCGCCGAC CGCCTGGGGC TGGAGCCGGC GGTCAAGGTC
CAGGTGATCG CCGAAGGCCC GCCGCGACGC CTTCACCCCC TGGTCTGCGA GGAGATCGAG
CGGATCGCCA CCGAGGCCCT GTTCAACAGC CTGCGCCACG CTCAGGCGCG GAATGTCGAG
ATCTGCGTCA GCTATGGTCG CAGGGCGCTG GGCGTGCGCT TCCGTGACGA CGGCGTCGGG
CTGGACCAGA CGGTGCTCGA CACCGGAGGC CGCGAAGGCC ATTTCGGCCT GAAGGGGATG
AGCGAGCGGG CTCGAAAGAT TCAGGCCGAG TTCGAGATTC GCAGTCGCCC CGGCGCCGGC
GCCGAGATCG CGTTGACCGT GCCGGCCGCC GTCGCCTATC TGGCCGTCGG CCGGCGGTCC
TGGCCGTTCG CCATGCGCCG CGCCCAGCTG AGCGAGATCT GA
 
Protein sequence
MRTAARLLLT LAVLLGPFVG DRTLALDPAR GITQFKHTSW TAEDGAPTNI WDIAQSPDGY 
LWLGAARGLY RFDGVTFEPI PLADNDYSHS QHINALLVAR SGELWVGYRR GGVAVFRDGK
LRELPGTDRV GVLTMAQDQD GVVWVGVSDG QQNLARVVNG KLQSIDSSWN LPPGFVMDLH
VSRDGTLWVA IDGSLSFLRR GAKRFERTDI VLGKGAGLTE DAGGRLWVAD SLGARPLPNV
AQGENPTPGS TLYEVSDSAR YARILFDRDG SLWGTTFATG IFRVAKPAAV SGSERTSPVR
AETYEAKDGL SSNKAVPILQ DREGNVWIGT SAGLDRLRTA NVVVEPGVAR SSRFGYLQFA
DKDGVVHVAD SDTLYRAAPR QPPKVIRDHL DNPTALCQDK TGAIWLGTEN TLAPLEGARP
RGVSPPLGDK PYTGCVVDRR GDLWFALFSN GYARLDQRGW TMFPLSASVA AAFLALDNQG
RVVLSTSRAI SRVSPDGAVQ TLRVDPKLAM GGVRSLYQGP KDFLIGCEFG LLRLTGDRFE
ALQVARFPWA QGIQGVVQTP RGETWIVGAL GVVRLRTEDL DKAFGDPGRP LDYQEFDLKD
GLPGPPQQNG SKDAVVGGDG RIWVLTLEGV GWIDPAHIVR NTLPPPVSIR GVTVDGKTYG
DPRDLTLPRG ASKLQIDYTA LSLSIPERVR FRYQLEGVDK TWVEAGGRRQ AFYTNLKPGH
YRFRVVAANN DGVWNDRGAA LAFAIPPTFV QTKLFAALCV IALSGLLWGL YALRLRQLSD
RIHGRLQDRL AERERIAREL HDTLLQGIQG LMLRLQSVVD QIPPDQRARQ DLEQALDRAD
SVIEEGRDRV KSLRADNPAD LPKILADVAD RLGLEPAVKV QVIAEGPPRR LHPLVCEEIE
RIATEALFNS LRHAQARNVE ICVSYGRRAL GVRFRDDGVG LDQTVLDTGG REGHFGLKGM
SERARKIQAE FEIRSRPGAG AEIALTVPAA VAYLAVGRRS WPFAMRRAQL SEI