Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0622 |
Symbol | |
ID | 5898077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 687161 |
End bp | 690202 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641561104 |
Product | histidine kinase |
Protein accession | YP_001682253 |
Protein GI | 167644590 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4585] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.378276 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACGG CGGCCAGACT GCTCTTGACG CTCGCCGTGC TGCTCGGACC GTTCGTGGGC GATCGGACCC TGGCGCTGGA CCCGGCGCGG GGGATCACCC AGTTCAAGCA CACCAGCTGG ACGGCGGAGG ACGGGGCGCC GACCAACATC TGGGACATCG CCCAGTCGCC CGACGGCTAT CTGTGGCTCG GCGCGGCGCG CGGCCTCTAT CGCTTCGACG GCGTGACGTT CGAGCCGATC CCGCTCGCCG ACAACGACTA TTCCCATTCC CAGCACATCA ATGCCCTGCT GGTGGCCCGA TCCGGCGAAC TCTGGGTCGG CTACAGGCGG GGCGGCGTCG CCGTCTTTCG CGATGGGAAG TTGCGCGAGC TGCCCGGAAC CGATCGCGTC GGCGTGCTGA CGATGGCCCA GGACCAGGAC GGCGTCGTCT GGGTAGGGGT CAGCGACGGC CAGCAGAACC TGGCCCGGGT CGTCAACGGC AAGCTGCAGA GCATCGACTC GTCGTGGAAC CTGCCCCCCG GCTTCGTGAT GGACCTGCAT GTGAGCCGTG ACGGGACGCT TTGGGTGGCG ATCGACGGCA GCCTGTCCTT CCTGCGTCGC GGGGCCAAGC GCTTCGAGCG CACGGATATC GTCCTCGGCA AGGGCGCCGG CCTGACCGAG GATGCGGGCG GCCGGCTGTG GGTGGCCGAT TCGCTCGGCG CCCGCCCGCT GCCCAACGTC GCCCAGGGCG AGAACCCCAC GCCCGGATCG ACGCTCTACG AAGTCTCGGA CTCGGCCCGC TACGCCCGCA TCCTGTTCGA CAGGGACGGC AGCCTCTGGG GCACGACGTT CGCCACTGGA ATCTTCCGCG TGGCCAAGCC GGCGGCCGTC TCCGGGTCCG AACGTACCTC TCCCGTCAGG GCCGAGACCT ACGAGGCCAA GGACGGCCTG AGCTCGAACA AGGCCGTGCC GATCCTGCAG GATCGCGAAG GCAATGTCTG GATCGGGACT TCCGCGGGCC TCGACCGCCT GAGAACGGCC AATGTCGTGG TCGAACCCGG CGTCGCCAGG TCGTCGCGCT TTGGCTACCT GCAGTTCGCC GACAAGGACG GCGTGGTCCA TGTCGCCGAC AGCGACACGC TCTACCGGGC CGCCCCCCGC CAGCCCCCGA AGGTGATCCG CGACCATCTC GACAACCCGA CCGCGCTTTG CCAGGACAAG ACCGGCGCGA TCTGGCTCGG CACGGAAAAC ACCCTGGCTC CCTTGGAGGG CGCCAGGCCG CGCGGGGTTT CCCCGCCCCT CGGGGACAAG CCCTATACCG GGTGCGTGGT GGATCGCCGT GGAGACCTGT GGTTCGCCCT TTTCTCGAAT GGATACGCCC GACTGGACCA ACGGGGTTGG ACGATGTTTC CCCTGTCCGC CAGCGTGGCG GCGGCGTTCC TGGCGCTCGA CAATCAGGGC CGCGTGGTGC TGTCGACCAG CCGGGCGATC TCCCGCGTCA GTCCGGACGG GGCCGTGCAG ACCTTGCGCG TGGATCCAAA GCTCGCGATG GGCGGCGTGC GCTCGCTCTA CCAGGGACCC AAGGACTTCC TGATCGGCTG CGAGTTCGGC CTGCTGCGCC TGACTGGCGA CCGCTTCGAG GCCCTTCAGG TGGCCCGTTT TCCCTGGGCC CAGGGCATCC AGGGCGTGGT CCAGACACCC CGCGGCGAGA CCTGGATCGT CGGCGCCCTG GGCGTGGTTC GACTGCGGAC GGAAGACCTC GACAAGGCCT TCGGCGATCC AGGACGCCCT CTGGACTATC AAGAGTTCGA CCTGAAGGAC GGGCTTCCGG GTCCGCCTCA GCAGAACGGC TCCAAGGACG CCGTCGTGGG CGGCGATGGC CGCATCTGGG TCCTGACGCT CGAGGGCGTC GGCTGGATCG ATCCGGCCCA TATCGTCAGG AACACCCTGC CTCCGCCGGT ATCGATCCGC GGCGTGACCG TCGACGGCAA GACCTATGGC GACCCGCGGG ACCTGACCCT GCCCAGGGGC GCCTCGAAGC TGCAGATCGA CTACACCGCC CTGAGCCTCA GCATCCCGGA GCGGGTGCGT TTCCGCTACC AACTGGAGGG CGTCGACAAG ACCTGGGTCG AGGCCGGCGG CCGGCGTCAG GCCTTCTACA CCAACCTCAA GCCCGGCCAT TACCGCTTCA GGGTCGTCGC CGCCAACAAT GACGGCGTCT GGAACGACAG GGGCGCGGCC CTGGCCTTCG CCATTCCGCC GACCTTCGTC CAGACCAAGC TGTTCGCCGC CCTGTGCGTT ATCGCGTTGT CGGGCCTGCT GTGGGGACTC TACGCCCTGC GCCTGCGCCA ACTGTCCGAC CGCATTCACG GACGGCTGCA GGACCGCCTG GCCGAACGCG AGCGCATCGC CCGCGAGCTT CACGACACCC TGCTGCAGGG CATCCAGGGC CTGATGCTGC GCCTGCAGTC GGTCGTCGAC CAGATACCGC CCGACCAGCG GGCTCGCCAG GACCTGGAAC AGGCCCTGGA CCGCGCCGAC AGCGTCATCG AGGAAGGTCG CGATCGGGTG AAGAGCCTGC GCGCCGACAA CCCCGCCGAT CTTCCCAAGA TCCTGGCCGA CGTCGCCGAC CGCCTGGGGC TGGAGCCGGC GGTCAAGGTC CAGGTGATCG CCGAAGGCCC GCCGCGACGC CTTCACCCCC TGGTCTGCGA GGAGATCGAG CGGATCGCCA CCGAGGCCCT GTTCAACAGC CTGCGCCACG CTCAGGCGCG GAATGTCGAG ATCTGCGTCA GCTATGGTCG CAGGGCGCTG GGCGTGCGCT TCCGTGACGA CGGCGTCGGG CTGGACCAGA CGGTGCTCGA CACCGGAGGC CGCGAAGGCC ATTTCGGCCT GAAGGGGATG AGCGAGCGGG CTCGAAAGAT TCAGGCCGAG TTCGAGATTC GCAGTCGCCC CGGCGCCGGC GCCGAGATCG CGTTGACCGT GCCGGCCGCC GTCGCCTATC TGGCCGTCGG CCGGCGGTCC TGGCCGTTCG CCATGCGCCG CGCCCAGCTG AGCGAGATCT GA
|
Protein sequence | MRTAARLLLT LAVLLGPFVG DRTLALDPAR GITQFKHTSW TAEDGAPTNI WDIAQSPDGY LWLGAARGLY RFDGVTFEPI PLADNDYSHS QHINALLVAR SGELWVGYRR GGVAVFRDGK LRELPGTDRV GVLTMAQDQD GVVWVGVSDG QQNLARVVNG KLQSIDSSWN LPPGFVMDLH VSRDGTLWVA IDGSLSFLRR GAKRFERTDI VLGKGAGLTE DAGGRLWVAD SLGARPLPNV AQGENPTPGS TLYEVSDSAR YARILFDRDG SLWGTTFATG IFRVAKPAAV SGSERTSPVR AETYEAKDGL SSNKAVPILQ DREGNVWIGT SAGLDRLRTA NVVVEPGVAR SSRFGYLQFA DKDGVVHVAD SDTLYRAAPR QPPKVIRDHL DNPTALCQDK TGAIWLGTEN TLAPLEGARP RGVSPPLGDK PYTGCVVDRR GDLWFALFSN GYARLDQRGW TMFPLSASVA AAFLALDNQG RVVLSTSRAI SRVSPDGAVQ TLRVDPKLAM GGVRSLYQGP KDFLIGCEFG LLRLTGDRFE ALQVARFPWA QGIQGVVQTP RGETWIVGAL GVVRLRTEDL DKAFGDPGRP LDYQEFDLKD GLPGPPQQNG SKDAVVGGDG RIWVLTLEGV GWIDPAHIVR NTLPPPVSIR GVTVDGKTYG DPRDLTLPRG ASKLQIDYTA LSLSIPERVR FRYQLEGVDK TWVEAGGRRQ AFYTNLKPGH YRFRVVAANN DGVWNDRGAA LAFAIPPTFV QTKLFAALCV IALSGLLWGL YALRLRQLSD RIHGRLQDRL AERERIAREL HDTLLQGIQG LMLRLQSVVD QIPPDQRARQ DLEQALDRAD SVIEEGRDRV KSLRADNPAD LPKILADVAD RLGLEPAVKV QVIAEGPPRR LHPLVCEEIE RIATEALFNS LRHAQARNVE ICVSYGRRAL GVRFRDDGVG LDQTVLDTGG REGHFGLKGM SERARKIQAE FEIRSRPGAG AEIALTVPAA VAYLAVGRRS WPFAMRRAQL SEI
|
| |