Gene Caul_1238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1238 
Symbol 
ID5898693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1299695 
End bp1301434 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content72% 
IMG OID641561723 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001682866 
Protein GI167645203 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0280549 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACCG GCCACAAGCC GGGTCAAGCC TCGTCTCCGA CAGGAGGCTT CGGCGCGGCG 
TCGCTGTGGC GGCTCGGCTG GCTGACGACG GTGATCTTCG CGACCGGCGC GGCGGCCTTC
ACGCCGGGCG TCGTCGGGCT TCATGTCTGG CTGGCCCTGG CGGCCGGCGC GGTTCCGGCC
CTGGCGGCCC TGGTCATAGG GGATCGGGGC GGGGATCGCG GCGGCGACCG GCGAGACGAG
CGCCTGCAAT CCCTGATCCT GGTGCTGTGG GCCACCTGCG GCGCCGGCGC GGCTGTGGTG
ACTGGCGGGG TGTCGGGCGC GATGAGCGCC TGGATCCTGG CCCCGGTCGC GGCGGCCTCG
ACCCTGTCTT CCCCGCGACG CCTGGCCGAG GGCGCGACCC TGGCCCTGAT CGGCGGCGCG
GTGGCGGCCC TGACCCAGCT GTCCGGCCTG GCGCCCAGCG CGCCGACGGG ACCCCTGGGC
TTCATCCTCG GCTTCCTGTC GGTGGCCACC GTCGGCGTCG GCCTGGCGTC CGGCCTGATC
CTCACCCACC GCTGGGCCGG CGAGCGCGAC AGCCGCAATG TCGGCGAGCT CAATATCCTG
CAGGCCCTGG TCGACGGCGC GCCTCAGCTG CTGCTGGTGA TCTCGAGCAA TGGCCGGATC
AAGACCGTGC GCGGCGCCGC GCCGCGCGGC GTCGCGACCC TGGCCCTGAC CAGCGAAGGC
CTGGCCTCGG CGGCCGTGGC CGGCGACCGT CAGAAGGTCA GCGCCGCCCT GGTCGCCGCC
CTCGACGGCG GCGAGGCCAG CGTGTCCTTC GCCCCCCTGC TGGACCCGAC CCGAACCGTG
TGCCTGGACC TGCGCCAGGT CTCGCCCGAT ATGCTGGTCG GGGTGATGCG CGACGCCACG
ATCGACAAGG CCCGCGAGAC GGCCCTGCAG CAGGCCCGCG CCGAGGCCGA GGCCCTGGCC
GCCGGCCGGG CTCGCTTCCT GGCCAATATG AGCCACGAGC TGCGCACGCC GCTCAACGCC
ATCATGGGCT TTTCCGACAT CATGCGCGCC CGGATGTTCG GCCCCCTCAC CGACCGTTAT
GGTGAATACG CCGAGCTGAT CCATGAATCC GGTCGCCACC TGCTGGACCT GATCAACGAC
GTGCTCGACA TGTCGAAGAT CGAGGCCGAG CGCTTCGAAC TTCAACGCGG AGAGTTCGAC
GCCCGCGAGG CGGTGACGGC CGCCATGCGG TTGCTGCGGG TCCAGGCCGA CGCGGCCGGG
GTGCAACTGC GCGGCGTGCT TCCGCCGACC GACCTGGAGG TCGACGCCGA CCGCCGGGCG
CTCAAGCAGA TCGTGCTGAA CCTGGTGTCC AACGCCCTGA AGTTCACGCC GCGCGGCGGC
CAGGTGACGG TCACCGCCGA CGGCCATGAC GGCGAGTTCG AACTGGTGGT CGCCGACACC
GGCGTCGGCA TCAGCCCGGA AGACCTGGAA CGGCTGGGCC GTCCCTACGA GCAGGCCGGC
GGCGTCGACC AGCGGGCCAA GGGCACGGGC CTGGGCCTGT CGCTGGTGCG GGCCTTCGCC
AAGCTGCACG GCGGCGAGAT GCATATCGAA AGCCGGATGG GCGCCGGCAC CAGCGTCTCG
GTGCGCATGC CGGTGCTGCT GAAGCCCTCG CGCCCGGCGA TGGAGGCTCC GTTGCTGGAG
CCCGCGCCGG AGGAGCCGGA GCTTGGTCCG AACGTGATCA AGTTCGCCCC CCCGAGGTAG
 
Protein sequence
MQTGHKPGQA SSPTGGFGAA SLWRLGWLTT VIFATGAAAF TPGVVGLHVW LALAAGAVPA 
LAALVIGDRG GDRGGDRRDE RLQSLILVLW ATCGAGAAVV TGGVSGAMSA WILAPVAAAS
TLSSPRRLAE GATLALIGGA VAALTQLSGL APSAPTGPLG FILGFLSVAT VGVGLASGLI
LTHRWAGERD SRNVGELNIL QALVDGAPQL LLVISSNGRI KTVRGAAPRG VATLALTSEG
LASAAVAGDR QKVSAALVAA LDGGEASVSF APLLDPTRTV CLDLRQVSPD MLVGVMRDAT
IDKARETALQ QARAEAEALA AGRARFLANM SHELRTPLNA IMGFSDIMRA RMFGPLTDRY
GEYAELIHES GRHLLDLIND VLDMSKIEAE RFELQRGEFD AREAVTAAMR LLRVQADAAG
VQLRGVLPPT DLEVDADRRA LKQIVLNLVS NALKFTPRGG QVTVTADGHD GEFELVVADT
GVGISPEDLE RLGRPYEQAG GVDQRAKGTG LGLSLVRAFA KLHGGEMHIE SRMGAGTSVS
VRMPVLLKPS RPAMEAPLLE PAPEEPELGP NVIKFAPPR