Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2768 |
Symbol | |
ID | 5900223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3006612 |
End bp | 3007613 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641563260 |
Product | signal transduction histidine kinase |
Protein accession | YP_001684393 |
Protein GI | 167646730 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3920] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000214374 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGTCCGCTT CAGAGTCTAC CCCTCAGGTC TGGGACACAA AGCAGCTCCG ACGGGCTCTC GAGGCGGCGG GCGTCGCCCT CTGGTCGTGG AATGTCGATT CCGATCAGCT GATCATGGAC AAGCAGGGCT ACGACCTGTG GGGCGTGCCG ATCACGACGG CGGTGTGTTT CGAGGATCTC TCCGCCCACA TCCACCCCGC GGATCGCGAC CGGGTTCGAG CCGCCTTCTC GGCGACACGC GGAATTCTTG GACCCTATGA AATAGATTTT CGCGTCACGG TCGAAGGCGA TGTCCGATGG ATTTCGGCCC GTGGCCAGGG CGATGACGAG GGCATTATCG GCCGCGTCAT GGTCGGCGTT TTTCTCGATG TCACCGGCCG CAAGCAGGCG GAGGAGGCCA ACGAACTGCT GGCCGGCGAG ATGAGTCACC GCGTCAAGAA TCTCCTGACA ATCGCCTCGG CCCTGACCGC CATCACCTCG CGTTCGACAG AAACGACGAC GGACATGGCG CGCGAACTGA CCGACCGCCT CACCTCCTTG GGCCGCGCTC ACGACCTCGT TCGCCCGATC CCCGGCCAAG ACGGCAAGGC GGCGCTGCTT GGCGATCTGA TTTCCGTTCT TCTCGCGCCC TATGACGACC TGGACGCCTT CAGCGGTCGC ATCCGCGTCT CCGTCCCTCG CATGGGGGTG GGCGAGACCG CGGCCACCAC CCTGGCGCTG GTCATCCACG AACTGGCGAC CAATTCCGTG AAATACGGGG CGCTCTCGGT CGCGGCCGGC ACGCTGGATG TTTCGTGCAC AGCTCAGGAC CAGGACGTCG TGATAGTCTG GACCGAGCAT GGAGGTCCAC CCGTTGCCGC TCCAGACGGC CCCGGCGGGT TCGGGAGCAA GCTGGTCACC CGGGGAATGT CGGCACAGCT GGGCGGGTCC ATCACCTACG ACTGGCCCGA GCACGGCGTC ATCGCCACGC TGCGGATGCT CAGGGACCGT CTCGCCACCT GA
|
Protein sequence | MSASESTPQV WDTKQLRRAL EAAGVALWSW NVDSDQLIMD KQGYDLWGVP ITTAVCFEDL SAHIHPADRD RVRAAFSATR GILGPYEIDF RVTVEGDVRW ISARGQGDDE GIIGRVMVGV FLDVTGRKQA EEANELLAGE MSHRVKNLLT IASALTAITS RSTETTTDMA RELTDRLTSL GRAHDLVRPI PGQDGKAALL GDLISVLLAP YDDLDAFSGR IRVSVPRMGV GETAATTLAL VIHELATNSV KYGALSVAAG TLDVSCTAQD QDVVIVWTEH GGPPVAAPDG PGGFGSKLVT RGMSAQLGGS ITYDWPEHGV IATLRMLRDR LAT
|
| |