Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5316 |
Symbol | |
ID | 5897133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010333 |
Strand | - |
Start bp | 24254 |
End bp | 27463 |
Gene Length | 3210 bp |
Protein Length | 1069 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641550609 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_001672095 |
Protein GI | 167621587 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.72275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.481403 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGAAG CAGAAGCGCG GCGACTTGAA GCGCTTCGTA TAGTCGAGGC GCTGAACAGG CCACCCCAGG AACGGTTCGG CCTTGTGACG GCCTTGGCGG CCGACCTCTT CAAGACCCCG CTCGCCTTGG TGTCGCTGAC CGAGATTGAA GGCTCATCGC CCCTATCCAC GCCCGCCGTA GCAGCCAACA GTCCGCCGCA GGCCTGGACG TTGAGCGCCG AGGCTCTTGG CATGGGCCCC AATGCCCTGT TGGTCGTCGA GGATGCGGCG ACGGACCGGC GTTTCGCAAC CGATCCCCTG GTCACCGGCG CCGCCCATAT CCGTTTCTAC GCCGGCGTGG TGCTGACCAC GCGCGATGGC CGAACCCTCG GCACCTTGTG CGTCATCGAC ACCAAACCCC GTCGCCGTCC AACACCCGCC AAGCTCGACC GCCTGAAGAT GCTGGCCAGG ATCGTCGTTG ATGAACTGGA ACTGGCGCGC GCGCATCGGT TGGCCAGCGA AAAGCAGCGT CTGCTGGAAC TGACCGAGAG CGTGTCGGGC GTCGGCCATT GGCGCATTGA GGTCGGAGCC GGCGATGTGG GCTGGTCGGA CATGGTCTAT ACGATCCATG GGGTCAGCCG CGAAGACTTT GAACCCGACC TCCGGAACTC GCTAGCCCTC TATCATCCCG ACGACCGGGC CAAGTTGACG GCCTGCGTCA GGGAGGCCGT GGCCGCCAAG GGCTCTTTTG AAGCTCAACT GCGCATCAAC CGTCCAGACG GCGAGATTCG CGATGTGATC TCCAGGGGCG TGTGCGAACT CGATGAGCAC GGCGAGGCGA CGGCGGTGTT CGGCGTATTC CAGGACATCA CCGATCAGAC ACGCATCTTG AATGCGATCA AGCGCAGCGA ACGTCGCTAC AAGTTACTCG CCGACAATAT GAGCGACGTC GTCACGCGCA TCCGTCTGGA CGGCGGCAGC GGCTACATCT CGCCGACGAT CGAGCGGCTT CTCGGCTATC GACCTGAGGA GATGACCGGT CGCTCAGCTC AGGACTTTGT CCACCAAGAC GACCGATCCC AGATCCTGGC GATATTCGGA CAGATGGCCC AAGGTTTGGA GCAAAAGACC CTTCAACACA GAGCGATCCA CAAGGATGGG CGGACCGTCT GGGTCGAGAC CAGCTTCCGA TTGGTGCGGG ACGAACAGGA TCAGCCGCTC GAGATCGTGG CGGTAATCCG CGACGCGACA GACCGCAAGG CGCTTGAGGA CGCGACGATC GCCGCGCGCG ATGAGGCCCG AGAACAGGCC CAGAGAGCGG CAACGGCCGA GCGCATGGCC GGACTAGGTC ATTGGCGGTT GGAGGTGGCG ACGCGGTCGG TGACCTGGTC CGAACAGATG TACCAGATCT ACGGTCTCGA CCCCGCGTTG CCGCTCGACC TCGACGCCCT CCTGGCCATG ACCCATCCGG ACGATCAAGC CGAGTCCAGC CAGCGCTTGC ACCGCGCCCT CACGACGGGC CAGCCGACGA TGGAGGCCGT CTTCCGCCTC ATTCGGGCGG ACGGCCAGTT GCGAGACATT ACTGGAAACA TGCTGGTCCA AAAGGACGCC GATGGCCAAG TCATCGCCGT GGTCGGCACG ATGAGCGACG TCACCGAGCA GCAACACGCG CAGGCCGCCT TGGCCCAGAG CGAAGCGCGC TATCGACTCC TGGCGGAAAA CGCCAGCGAC CTGATCATGC ACAGCGACGT CAAGGGGCAG GTGACCTATG TCTCGCCATC AATCCTGCCC ACGACCGGCT ATGCACCCGA GGCGTTGGTC GGGACCAACA TCCTCGATTG GATCGCCCCC GAAGACGTAC CCGGCGTCCA AGCCGCCGTC GCCAAACAGT TCAAATCGCG AGGCGCGGAA CCGCCGATCG CCGTGGAATA TCGCGTGCGG CACAAGGATG GTCGCGAACT GTGGTTGGAA GCCCGCCCGA CCCTGGCCTT CGATCCGGAG ACTGGCGCCA TCACCGGCAT CACCGATGTC GTGCGCGACA TCTCCGCCCG TAGAGTGCTG GAAGCGGAGT TGCGCGCGGC GCGCGCCGAG GCCGAGGCCG CCGCCGCCGT GAAGTCGGAG TTCCTGGCCA ATATGAGCCA TGAACTGCGC ACGCCGCTGA CGGCGGTGCT GGGGTTCTCG CGGCTGGCCG AGGAGCAGCC GGAACTGTCG GACACCACGC GCGGTTATCT GAAGCGCGCC TGCAATGCCG GCCAGGCCTT GCTTCTTACG GTCAATGACA TCCTCGACTT TTCCAAGCTC GAAGCCGGGC AGGTCGAGAT CGCGCCACGA CCGATGTCGC CCGCGAAGTT GGCGACCGAA ACCCTCGACC TCTTCGCGAC CCAAGCCGCC GAGAAGGGGA TCGCGCTTCA CCTGGGCGGG CTGGAGACTC TGCCACGGAC CGTTCGGGCC GACCCCGACC GGGTGCGCCA GATATTGCTC AACCTGATCG GCAACGCCGT GAAATTCACG ATGGTCGGCA GCGTCCGCGT TGACGCCATC TTCGAACCGC TTGGCGGATT CCTGTCATTC GCGGTGACAG ACACTGGCCC GGGCGTTCCC GAGGATCGCG CCGATCAGCT GTTCCAACGT TTCTCCCAAG TTGATGCGTC CTCGACCCGC AAACACGGCG GCACCGGTCT TGGCTTGGCG ATCTGCAAGG GTCTCGCCGA AGCCATGGGC GGCGACATCG GGGCGCAGAG CAAGGTCGGT GAAGGGTCCT GCTTCTGGTT TACAATTCCA GCTCCCGAAC TGGACGTCGC CGCGCCCATC GCCGCTCCGC CACAAGGTCA GATCATGCTG CCCGCCGGCT GCCGAATCCT CGTGGCGGAT GACAATCGGA TCAACCGCGA TCTTGTCCGG GCGATGCTGT CGCCGTTCGA CGTCGAACTC ACCGAAGTCG TGGACGGACT CGGAGCTGTG GCGGCCGCCA ACGCGGCGCC GTTCGACGTC ATTCTCATGG ATCTTCGGAT GCCGGGCCTT GACGGCGCGG GCGCGGCCCG GTGCATCCGC ACCGAGGACG GGCCCAACGC CACCATTCCG ATCCTCGCCT TCTCGGCGGA CGTCGATCAG GCCCAAGCGA CGGGTCTGTT CGATGGCATG GTTGGCAAAC CCCTGACAGG GGTGAATTTA TTGACGGCGA TCGCCAAAGC CATGGCTTGG CCGGAGGATC CCCTTGATGA CGCAGCCTGA
|
Protein sequence | MREAEARRLE ALRIVEALNR PPQERFGLVT ALAADLFKTP LALVSLTEIE GSSPLSTPAV AANSPPQAWT LSAEALGMGP NALLVVEDAA TDRRFATDPL VTGAAHIRFY AGVVLTTRDG RTLGTLCVID TKPRRRPTPA KLDRLKMLAR IVVDELELAR AHRLASEKQR LLELTESVSG VGHWRIEVGA GDVGWSDMVY TIHGVSREDF EPDLRNSLAL YHPDDRAKLT ACVREAVAAK GSFEAQLRIN RPDGEIRDVI SRGVCELDEH GEATAVFGVF QDITDQTRIL NAIKRSERRY KLLADNMSDV VTRIRLDGGS GYISPTIERL LGYRPEEMTG RSAQDFVHQD DRSQILAIFG QMAQGLEQKT LQHRAIHKDG RTVWVETSFR LVRDEQDQPL EIVAVIRDAT DRKALEDATI AARDEAREQA QRAATAERMA GLGHWRLEVA TRSVTWSEQM YQIYGLDPAL PLDLDALLAM THPDDQAESS QRLHRALTTG QPTMEAVFRL IRADGQLRDI TGNMLVQKDA DGQVIAVVGT MSDVTEQQHA QAALAQSEAR YRLLAENASD LIMHSDVKGQ VTYVSPSILP TTGYAPEALV GTNILDWIAP EDVPGVQAAV AKQFKSRGAE PPIAVEYRVR HKDGRELWLE ARPTLAFDPE TGAITGITDV VRDISARRVL EAELRAARAE AEAAAAVKSE FLANMSHELR TPLTAVLGFS RLAEEQPELS DTTRGYLKRA CNAGQALLLT VNDILDFSKL EAGQVEIAPR PMSPAKLATE TLDLFATQAA EKGIALHLGG LETLPRTVRA DPDRVRQILL NLIGNAVKFT MVGSVRVDAI FEPLGGFLSF AVTDTGPGVP EDRADQLFQR FSQVDASSTR KHGGTGLGLA ICKGLAEAMG GDIGAQSKVG EGSCFWFTIP APELDVAAPI AAPPQGQIML PAGCRILVAD DNRINRDLVR AMLSPFDVEL TEVVDGLGAV AAANAAPFDV ILMDLRMPGL DGAGAARCIR TEDGPNATIP ILAFSADVDQ AQATGLFDGM VGKPLTGVNL LTAIAKAMAW PEDPLDDAA
|
| |