Gene Caul_2768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2768 
Symbol 
ID5900223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3006612 
End bp3007613 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content65% 
IMG OID641563260 
Productsignal transduction histidine kinase 
Protein accessionYP_001684393 
Protein GI167646730 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000214374 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTCCGCTT CAGAGTCTAC CCCTCAGGTC TGGGACACAA AGCAGCTCCG ACGGGCTCTC 
GAGGCGGCGG GCGTCGCCCT CTGGTCGTGG AATGTCGATT CCGATCAGCT GATCATGGAC
AAGCAGGGCT ACGACCTGTG GGGCGTGCCG ATCACGACGG CGGTGTGTTT CGAGGATCTC
TCCGCCCACA TCCACCCCGC GGATCGCGAC CGGGTTCGAG CCGCCTTCTC GGCGACACGC
GGAATTCTTG GACCCTATGA AATAGATTTT CGCGTCACGG TCGAAGGCGA TGTCCGATGG
ATTTCGGCCC GTGGCCAGGG CGATGACGAG GGCATTATCG GCCGCGTCAT GGTCGGCGTT
TTTCTCGATG TCACCGGCCG CAAGCAGGCG GAGGAGGCCA ACGAACTGCT GGCCGGCGAG
ATGAGTCACC GCGTCAAGAA TCTCCTGACA ATCGCCTCGG CCCTGACCGC CATCACCTCG
CGTTCGACAG AAACGACGAC GGACATGGCG CGCGAACTGA CCGACCGCCT CACCTCCTTG
GGCCGCGCTC ACGACCTCGT TCGCCCGATC CCCGGCCAAG ACGGCAAGGC GGCGCTGCTT
GGCGATCTGA TTTCCGTTCT TCTCGCGCCC TATGACGACC TGGACGCCTT CAGCGGTCGC
ATCCGCGTCT CCGTCCCTCG CATGGGGGTG GGCGAGACCG CGGCCACCAC CCTGGCGCTG
GTCATCCACG AACTGGCGAC CAATTCCGTG AAATACGGGG CGCTCTCGGT CGCGGCCGGC
ACGCTGGATG TTTCGTGCAC AGCTCAGGAC CAGGACGTCG TGATAGTCTG GACCGAGCAT
GGAGGTCCAC CCGTTGCCGC TCCAGACGGC CCCGGCGGGT TCGGGAGCAA GCTGGTCACC
CGGGGAATGT CGGCACAGCT GGGCGGGTCC ATCACCTACG ACTGGCCCGA GCACGGCGTC
ATCGCCACGC TGCGGATGCT CAGGGACCGT CTCGCCACCT GA
 
Protein sequence
MSASESTPQV WDTKQLRRAL EAAGVALWSW NVDSDQLIMD KQGYDLWGVP ITTAVCFEDL 
SAHIHPADRD RVRAAFSATR GILGPYEIDF RVTVEGDVRW ISARGQGDDE GIIGRVMVGV
FLDVTGRKQA EEANELLAGE MSHRVKNLLT IASALTAITS RSTETTTDMA RELTDRLTSL
GRAHDLVRPI PGQDGKAALL GDLISVLLAP YDDLDAFSGR IRVSVPRMGV GETAATTLAL
VIHELATNSV KYGALSVAAG TLDVSCTAQD QDVVIVWTEH GGPPVAAPDG PGGFGSKLVT
RGMSAQLGGS ITYDWPEHGV IATLRMLRDR LAT