Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5320 |
Symbol | |
ID | 5897084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010333 |
Strand | - |
Start bp | 29932 |
End bp | 32643 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641550613 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_001672099 |
Protein GI | 167621591 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG3300] MHYT domain (predicted integral membrane sensor domain) |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.544718 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAAAA TATTGCTGTG CGTCGCCACA GAACACAATT GGTGGCTGGT CGTGGTCGCC GCGCTCGTCT GTGTCCCGGC CACGCTGGCG ACGTTCTTCC TCTATTCGAA GGTTCCAACG TTTCCGGTGT GGAGGCGTTG GACCTGGCTC GCCATGACGG GCTTGGTCGC CGGTTCTGGC ATCTGGACGA CGCATTTTGT CGCCATGCTG GCGTTTAAGA CTGGGCTACC GACGGGTTAT GAGGCCCTGG CCACAATGGG GTCGCTGGCT GTCGCCGTCA TCAGCACCAC CTTGGGCTTT GCCGCGGGTT CAGCGACGAC TGCAGATTCT CGGCGGCGAC TGACGTCGAT CGGCGGCGGC CTCGTCGTCG GCCTGGGCAT CACCTCCATG CACTACGTCG GCATGAGCGG CTACCGGACG ACCGGCGTCT TCCAGTGGGA CGTGAACTAC ATCGTGGCCT CCGTGCTGAT TGGCGCGCTG TTCGCTAGCG CGGCCCTCTT TGTCGCCCGC CCCGGCGCGG GGCTTAAGCA GCAAGCCGCA GGGGGAGGTC TGCTCAGTCT CGCGATCGTC GGGATGCACT TCACGGGGAT GACCGCCGTG ACGATAGTTC CTGATCTCAG CATCGCGGTG CCCGCTTCGC TGATGTCGGA TCCGGTGATG GCGGCCGTCG CCGTGGCGGT CACTGCGCTG ATCCTTATCA CCGCGATCGG CGGCGTGGCC CTCGACGCCG CCAGCCGCAA TGGAAACCTT CGCCGCCTGC GCGAGGCGCT CGATGTCATG CCGGAAGGCT TGGCCTTCTA CGACGCGAAC GATCGGCTGG TGGCGTGGAA CACGCAGTAC GACGATCTTT GCAGGACATC TGGGGCGATC CTCGTCGCTG GCATGCCATT CTCCGACTTG CTGGAATCGA GCCTTGTCCA TGGCGTCTAC CCTGAGGCGG TCGGGCGAGA GACGGAGTGG CTCGCCGAAC GAAACGCAGC TCGCCGTGAC GAGGCGCCCA GCCTGACGCA GCAAACCGCT GGCGGACGAT GGCTGCGGAT CACCGAACGG CGCACTGGCG ATGGGGGCAC TGTCTCGGTC AGCGTGGATA TCACCGACCT GAAGCGCACG GAAGCGGCCA TGGCCCAGGC GCGCGACAAG GCCGAGGAGC AAGCCCGGCG GGCTGAGGTC GCCGAGGGGG TTGCCGGCCT TGGCAACTGG CGCGTGGATG CGCGCACTCG CGACGTCACC TGGTCGACCC AGATGTACCA TATCTTCGGC CTCGCCTCCG ACGCGCCGCT CGACCTCGAG GCGCTGATAG GCATGATCCA TCCTGACGAC GCTGAGGCTG TGGCGGCCCG ACTGAAGCGC CAACTCGCGA CAGGTGAGGT CGACGAGAAT TCGATCTCAC GGATCGTGCG TGCCAACGGC GAGGTCCGAT ACCTGGCGCG CAATTCTCGC GCCGAACACG GTCCCGGCGG CGAGGTCATC GCCATGATCG GAACCATGGT GGACGTGACC GACCAAAAGC TCCTCGAGGC CCGGCTTCGG CTAGCGCGAG CCGAAGCCGA AGCCGCTGCG GGGGTGAAGG CGGAGTTTCT CGCCAATATG AGCCATGAGC TGCGCACGCC GCTGACCAGC ATCATCGGCT TCACGAGTCT CGCGGCCGAG CAGAGCGACC TGACCGACCT GACCCGCACC TATGTGGAAC GCGTCGGCGA CGCCAGCCGT GCGCTCCTGT GCACCGTCAA CGACATCCTG GATTTCTCGA AGCTGGAAGC CGGACAGGTG AGCTTCCAGG TCCAACCGGC CTCCTTGGCC AAGCTCAGCC GCGCCACATT GGACCTGTTC ACGCCACAGG CCGGCGCCAA AGACCTGAAC CTCACCCTGG ATGGGGAGGC CGCAGACGAC GATCTGATCA TTGCAGTCGA TCCGGACCGG ATACGCCAAA TCCTTCTCAA CCTTGTCGGC AATGCGGTGA AGTTCACCAC GGGCGGCAGC GTTACCCTGC GCACGCGCTA CGATCGCGCC GCCGAGGTCC TCAGCGTCGA TGTGATCGAT ACCGGAGAAG GCGTTGCCCC GGACAAGCAG GATCGCCTCT TCAAGCGGTT TTCGCAGGTC GATGGGTCGT TGACGCGGGT TCAAGGCGGA ACGGGCCTGG GCTTGGCGAT CTGCAAAGGC TTGGTGGAGG CCATGGGCGG GGAGATCGGC GTCGAGAGCC GGATGGGTGA GGGCAGTCGG TTCTGGTTCA AGGTCCCCGC GCCCTTGTCG AGCCTTTCGC AAGGCAACAC CGATGGCTTG GCGATGGAAC GGCTGACGTT CGGCGGCGTC CGCGTGCTCG TGGTCGACGA CCACCCGACC AATCGCGAAC TGGCGCGCTT GTTCTTGGCA GGCGTCGGCG CCGAGGTTTC CGAAGCAGTT GATGGCGAAG AGGCTGCACA GATGGCGGCG GAGTGGCCAT ACGACGTGAT CCTCATGGAC CTGCGCATGC CAAGGCTCGA TGGGCTGGGC GCCTTGCGGA GAATACGCGC CTCGCAGGGC CCAAATGACG CCACCCCCAT CCTGGCCTTC ACTGCGGACG CAGACACCAA TATGGCGGAC CGATTGATTT CCGCTGGCTT TCAGGACGTC GTCGCCAAGC CGGTAGGCGC CGGAGCGCTG ATCGCCTCCA TAGCTCGAGC CACGGCGTTC GCGGAGGATC CGCAGCCTCA GGAATTCGCC GATGTCGGTT AG
|
Protein sequence | MFKILLCVAT EHNWWLVVVA ALVCVPATLA TFFLYSKVPT FPVWRRWTWL AMTGLVAGSG IWTTHFVAML AFKTGLPTGY EALATMGSLA VAVISTTLGF AAGSATTADS RRRLTSIGGG LVVGLGITSM HYVGMSGYRT TGVFQWDVNY IVASVLIGAL FASAALFVAR PGAGLKQQAA GGGLLSLAIV GMHFTGMTAV TIVPDLSIAV PASLMSDPVM AAVAVAVTAL ILITAIGGVA LDAASRNGNL RRLREALDVM PEGLAFYDAN DRLVAWNTQY DDLCRTSGAI LVAGMPFSDL LESSLVHGVY PEAVGRETEW LAERNAARRD EAPSLTQQTA GGRWLRITER RTGDGGTVSV SVDITDLKRT EAAMAQARDK AEEQARRAEV AEGVAGLGNW RVDARTRDVT WSTQMYHIFG LASDAPLDLE ALIGMIHPDD AEAVAARLKR QLATGEVDEN SISRIVRANG EVRYLARNSR AEHGPGGEVI AMIGTMVDVT DQKLLEARLR LARAEAEAAA GVKAEFLANM SHELRTPLTS IIGFTSLAAE QSDLTDLTRT YVERVGDASR ALLCTVNDIL DFSKLEAGQV SFQVQPASLA KLSRATLDLF TPQAGAKDLN LTLDGEAADD DLIIAVDPDR IRQILLNLVG NAVKFTTGGS VTLRTRYDRA AEVLSVDVID TGEGVAPDKQ DRLFKRFSQV DGSLTRVQGG TGLGLAICKG LVEAMGGEIG VESRMGEGSR FWFKVPAPLS SLSQGNTDGL AMERLTFGGV RVLVVDDHPT NRELARLFLA GVGAEVSEAV DGEEAAQMAA EWPYDVILMD LRMPRLDGLG ALRRIRASQG PNDATPILAF TADADTNMAD RLISAGFQDV VAKPVGAGAL IASIARATAF AEDPQPQEFA DVG
|
| |