Gene Caul_5292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5292 
Symbol 
ID5897087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp285 
End bp1859 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content67% 
IMG OID641550585 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001672071 
Protein GI167621563 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGAA CACGGGAAGC TGCGGAGCCG GACGCCGAGG TCGCTGAGCG CCTGGCGCGG 
ATCATCGAGT TGCTGCCAGC CGGAGCCGTC CACGTGCGGG CCGGCGTCCT GTCGATGAAT
GCGGCGATGG AAGAGGTCAC AGGGTATGCC CGCGCCGAAC TGCCGACGCT GGAAGCCTGG
TTCGAGAAAC TCTACGGCGA CACCTCCGGC AAGCTGATCG CTCACTATCT TGACCAACGG
GCGCAGGGTT TTCCGGCGAC GGTCGAGGGC TCGATCCGCC GCAGGGACGG CGAGGTGCGC
CGCATCGAGT TCCGCGCCTG TAACGACAGT GTAGGCGAAA TCTGGATCGT CAACGACATC
ACCGAGCGCG ACCTGATCGA GCGCGATTTG ATCGCCGCCA AGGACCGCGC GGAGGCCGGC
GCGCGGTCGA AATCCGAGTT TCTAGCCAAC ATGAGCCACG AGCTGCGCAC GCCGCTGACG
GCGATCATCG GGTTTGCAGA ACTGCTCGCC CGGGAGGGGT CGCTAGCTCC TCAGGAACGC
CACTGGCTGG CCCGCATCGA GGACGCCAGC AAGGCGCTGC ATTCGATCGT CAACGACGTC
CTGGACTTTT CCAAGCTGGA AGAGGGCGCC GTCGAGCTGG AGAGCGAGCC ATTCGGTCTG
CGCAAGCTAG TCGAAGACAC CGTGGCCCTT CTGGCGGATC AGGCAGAGCG CAAGGGTGTC
GGCCTGGCGA TTTTCGTGGA TGACGGGCTT TGTGATGGGC TGAGGGGCGA CACCGGGCGG
TTGCGGCAGA TCCTGCTCAA CCTGATCTGC AACGCCGTGA AGTTCACCAA TCATGGCGGG
GTGACGATCC GGGTTAGCGA GGACGTAGGG GCCGGCGGCC CGCCCCAAGT CCGGTTCGCC
ATTTCCGACA CCGGTATCGG CATTCCGGAC GCCGCCCTGG ACCAGGTGTT CCAACGCTTC
GTGCAGGCCG ACGGATCGGT CTCGCGAGAG TTTGGCGGCA CCGGCCTGGG CTTGGCCATC
TGTCGCCGGC TGGTGGAGTT GATGGGCGGA GAGATCGGGG TCGAGAGCCG GGTTGGATCG
GGCTCGACGT TTTGGTTCTC GGTCGCCCTG GCGCCGGCCG CCCATGTCGA ACAGCCGGGC
GCCGAGGACC TTGCCATCGA CGCGGGCGCG GTGCGCCTGC TGCTGGTGGA AGACACCGAG
GCGAACCAGG AGCTGGTCTC CACGATTCTC CGCTCGGTCG GCGTCGAGGT CGACATCGTC
TCGAACGGCG CCGAGGCGGT CGAGGCGGTG CAGGTCTGCG CCTATGACCT GGTGCTCATG
GACGTGCATA TGCCGGTAAT GGGCGGTGTC CAGGCGACCC AGATCATCCG AAGCCTGGGC
GGCGAGTTCG CCACCCTGCC GATCATCGCA CTCACCGCGA ATGTGCTGCC AGCCCAGGTG
GCCGACTACC ATCGCGCCGG CATGAACACG CACCTGGCCA AGCCGATCAA CCCCCGCGAG
ATGCTGGCCG TGATTGGCCG CTGGGCGGCG GCGGATCGCG CCGCCGAGCC CGCGCCGCAA
GACCTGCGGG CCTGA
 
Protein sequence
MARTREAAEP DAEVAERLAR IIELLPAGAV HVRAGVLSMN AAMEEVTGYA RAELPTLEAW 
FEKLYGDTSG KLIAHYLDQR AQGFPATVEG SIRRRDGEVR RIEFRACNDS VGEIWIVNDI
TERDLIERDL IAAKDRAEAG ARSKSEFLAN MSHELRTPLT AIIGFAELLA REGSLAPQER
HWLARIEDAS KALHSIVNDV LDFSKLEEGA VELESEPFGL RKLVEDTVAL LADQAERKGV
GLAIFVDDGL CDGLRGDTGR LRQILLNLIC NAVKFTNHGG VTIRVSEDVG AGGPPQVRFA
ISDTGIGIPD AALDQVFQRF VQADGSVSRE FGGTGLGLAI CRRLVELMGG EIGVESRVGS
GSTFWFSVAL APAAHVEQPG AEDLAIDAGA VRLLLVEDTE ANQELVSTIL RSVGVEVDIV
SNGAEAVEAV QVCAYDLVLM DVHMPVMGGV QATQIIRSLG GEFATLPIIA LTANVLPAQV
ADYHRAGMNT HLAKPINPRE MLAVIGRWAA ADRAAEPAPQ DLRA