Gene Caul_5254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5254 
Symbol 
ID5897266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp187524 
End bp189263 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content63% 
IMG OID641555357 
Productintegral membrane sensor hybrid histidine kinase 
Protein accessionYP_001676688 
Protein GI167621903 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.601488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0228125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGAC ATTCGGTCGA CAAATACCGG CTCGCGCTCC CGGTTGATAT CAATCGTTCC 
TTGGCGTTCC GCGCTCTCCA CATCATGTCG GCGGGTCTTG TCTTCGGCGT GAACGTGCGC
TGGACATCAG CGGTGACCTG GACACTCATC GTGCTGTCGA TCGAGGTCTG GGAAGCGCTC
GACGCCGCGA AGCGGGGCAA GTCCGGGGCC CCCCTACGAA GGTCAGGCCG AACGCTCCGG
CGCATCGCGC TCAACATCGT CTGGGTCACG ATGCCGGTCA TCCTGTGGCT GATGGGTGAT
TTCGCGTCGC GCATCGTAGC GCTCGCAATG CTGACCACTC ACCTGGCCGT CGCGCTTAGC
TATTCGTTCA ACACGTCGAG AGCGGCCGTC GCCATCGGCC TGCCGCCAGC CGTGGCCTTC
TTCTTCCTGC CCGTTGCGCT GGGGGGCTTG TCGGGCCTCA AACTCGCCGC GGTGGCCGCC
TGCTTTGGTT TCTGCCTGTT CTATCTGGCC ATAATCGTCG AACAGAACCG CGCCAACGCC
CGCATCCTGC GCGGGGCCCA GGCTGAACTG CTTGAACAGA GGGAGGTCCT GCGCGCCCAG
ACGGAAGCCG CCAACGCGGC GAGCCAGGCC AAGTCGTCGT TCCTGGCGAT GATGAGCCAC
GAGTTGCGCA CCCCCATGAA TGGCGTTTTG GGAATGGCGC ATGCCCTGAC CCTGAGCAAG
CTGGACAGTC ATCAGGTTAG CCATCTCGAC ATGCTGCTGC GCTCGGGGCA GGGGCTCATG
ACGATCCTCA ATGATCTGCT CGACATCTCG AAGATCGAAG CTGGCAAACT GGAATTGGAG
ATCATACCGT TCGATCTGAG CGAGCTTGGT CGCCAGGTCG AGGACCTGTG GCGCGATGCG
GCCCATATCA AGGGGGTGAG CCTGGTCTGC GAAGTGGCGT TCAGTGGGCC GCATTGGGTG
TCGGGCGATC CGACACGCCT ACGCCAGGTG CTTATCAACC TTGTCTCCAA CGCCTTGAAA
TTCACCAGCC AGGGCGAAGT GCGGCTGTCG ATCTGGAGAT CGGAAGACGG CCTTTGCCAA
ATCGCCGTGA CCGACACCGG GCCAGGCATT CCCGTTAACC AGCAGGCATT GCTTTTCCAG
GCGTTTTCCC AGGCGGACGC CTCGATCACC CGCAAGTTTG GCGGAACCGG GCTTGGCCTG
GCGATCTGCA AGCAACTTGT CTCCCTGATG GATGGTCGTA TCGACCTGGA CAGCCGCGAA
GGCGTTGGAT CGACCTTCAC TGTCTCGCTT CCGCTGCCGA CCGCCGAGGC TGTTCAAGAG
GCCGAGCAGC GGGACGGCGC CATCACGCTG GCCGGCCTTG AAATCCTCGT CGCTGACGAT
AACGTCATTA ACCAGGCGGT TGCGCGCGCT ATCCTCGAGG CGTTCGACGC GAAGGTCACG
ATGGCCGGAG ATGGGCGGAT CGCCCTGTCC CGGCTGGCGA CCCGACGGTT TGACGTCGTG
TTGATGGACA TCCACATGCC GCTCATGGGT GGCGTGGAGG CTTTGGGGCG GGTCCGAGCC
GGGGAAGCAG GTCCATCCGA CGTGCCGGTG ATCGCGTTGA CGGCCGACGC CGTCACTGGT
GTCGACACCG TCTTGCTCGC GGCCGGATTC AATGACGTAA TCTCGAAACC GATCAATCCA
GGCGACCTGG TCTCCCGGAT CGCGGCGGCC GTGGCCTCGC GCGACCAACA GTCGATTTAG
 
Protein sequence
MARHSVDKYR LALPVDINRS LAFRALHIMS AGLVFGVNVR WTSAVTWTLI VLSIEVWEAL 
DAAKRGKSGA PLRRSGRTLR RIALNIVWVT MPVILWLMGD FASRIVALAM LTTHLAVALS
YSFNTSRAAV AIGLPPAVAF FFLPVALGGL SGLKLAAVAA CFGFCLFYLA IIVEQNRANA
RILRGAQAEL LEQREVLRAQ TEAANAASQA KSSFLAMMSH ELRTPMNGVL GMAHALTLSK
LDSHQVSHLD MLLRSGQGLM TILNDLLDIS KIEAGKLELE IIPFDLSELG RQVEDLWRDA
AHIKGVSLVC EVAFSGPHWV SGDPTRLRQV LINLVSNALK FTSQGEVRLS IWRSEDGLCQ
IAVTDTGPGI PVNQQALLFQ AFSQADASIT RKFGGTGLGL AICKQLVSLM DGRIDLDSRE
GVGSTFTVSL PLPTAEAVQE AEQRDGAITL AGLEILVADD NVINQAVARA ILEAFDAKVT
MAGDGRIALS RLATRRFDVV LMDIHMPLMG GVEALGRVRA GEAGPSDVPV IALTADAVTG
VDTVLLAAGF NDVISKPINP GDLVSRIAAA VASRDQQSI