Gene Caul_3769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3769 
Symbol 
ID5901231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4084595 
End bp4086727 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content70% 
IMG OID641564292 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001685394 
Protein GI167647731 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.312808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.729669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGA CCGCCGCCCA TCACGCCTTT CGACGGACAA TGACGCTGAT CGCCGGCGGC 
AATCCGTTGC CCATGGTGCT GGAGGCCATC GTCCTGGCGG TCGAGGCCGA GGATCCGGGC
ATACTGTGCA GCGTCCTGCT GGTGGACGAG GCCGGCCAGA ACCTGCTGCT TGGCGCGGCG
CCCAGCCTGC CGGACGACTA CAACCAGGCC ATCGACGGCG TCGCCATCGG TCCCGCGACC
GGCTCCTGCG GCACGGCGGC GGCCCTTGGC CAGCGGGTGA TCGTCAAGAA CATCCAGGCC
GATCCCCTGT GGGCCGACTA CAAGCACCTG GCCGCCGCGG CCGACCTCGC CGCCTGCTGG
TCGCAGCCGA TTCGGGCCGC CAACGGCGCG GTGCTGGGCA CGTTCGCCAT CTATCACCGA
GAGATCGCCG TCCCCGACGA CGAGGACATC GCCTTCATCG AGGCGGCCGC CGACCTGGCG
GCGATCGCCA TCGATCGTCG GCGGGCAGAG GAGAACCTGG CGCTCAGCGA AGCCCGGGCC
CGCGCGGCGG CCGAGGCGGC GCGGGAGACG GCGCGCAACC TGACCACCTT CTTCGACGTC
TCGCTGGACA TGCTGGTCAT CCGCGACATG GAGGGCCGGT TCGTCAAGAG CAGCCGCGCC
TGGGAGACCG CGCTCGGCTA CCCGCTCGAG GCGCTTGAGG GCGCGTCCCT GCTATCCCTG
GTCCATCCGG ACGACGTGGC CGCCACCCAG GACTACATGC GCCACGCCGG TGACTGCGGC
GAGGTGTTCG GGTTCGTCAA CCGCTACCGC CACAGCGACG GCCACTATCG GCAGCTGGAA
TGGCGGGCGC GGCGGTCCGG CGACCTGGTG TTCGGCGTCG CCCGCGACGT CACCGAGCGG
CTGCGCGAAG CGGCGGAGAT GGAGGCGGCC AAGACGGCCG CCGAGGCCGC AAATCGCGCC
AAGAGCGACT TCCTGGCCAA TATGAGCCAC GAGATCCGCA CGCCGCTGAA CGGGGTGATC
GGCGTCGTCG CCGCCCTGGG CCAGACCGGG CTGACGCCGG CCCAGCGCGA GATGGTGGAC
CTGATCCAGA GCTCGGGCGA GACCCTTGAG CGGCTGGTGT CCGACATCCT CGACGTCTCG
AAGATCGAGG CCGGTCACCT GGCGATCGAG GAACGGGAGT TCGACCTCGA GGCGGAGCTG
GGCGGCCTGC TGGACATCGC CCGCCTGCGC GCCGAGGAAA AAGGCCTGAC CTTCCGCGTC
GAAGGGGGCG CCAGCGCGCG AGGCGTGTTC CTGGGCGACA GCATCCGCAT CCGGCAGGTG
CTGGGCAATC TGTTGTCGAA CGCCATGAAG TTCACCGGCC AGGGCGAGAT CGTCGCTCGG
ATCGAGGTCG CCGATCCGCC GGCGGGCGAG CAGGCCTCCC GGCTGACCCT GGAGGTCCAG
GACACCGGCG TGGGGTTCGA TCCGGATCTG GCGACCATGC TGTTCCAGCG TTTCAGCCAG
GCCGACACCA CCATCACCCG CCGGTTCGGC GGCACGGGCC TGGGGCTGTC GATCAGCAAG
ACCCTGGTCG AGATGATGGG CGGCCAGATC TCGGCGCAAT CCGAGCCCGG ACGCGGCAGC
CTGTTCCGGG TGGTCATTCC GTTGGCGCGG ACGGTGTCCC TGGCCGACTA TGACGCGCCG
CGGGGCGACC TCCTCCCGGC GACCCCGTCG TGCGCGCCGT CGTTCGACGA CCGGGGCGGG
CTGCGGGTGC TGCTGGCCGA GGACCACCCG GTCAACCGCA AGGTCGTCCA GCTGATCCTG
ACGCCCTACG ACATCGAACT GACCATGGTG GAGAACGGCG CCCTGGCGGT GGAGGCCTGC
AAGGCGACGC CCTTCGACCT GGTGTTGATG GACATGCAGA TGCCGATCAT GGACGGGCTG
GCCGCGACCC GGGCCATCCG CGCCCATGAA CGCGATCTGA CGACCGGCGC GCGCATCCCG
ATCATCGTGC TCAGCGCCAA CGCCATGGCC CACCACAAGC ACGACGCCCT GGCGGCCGGC
GCCGATCTGC ACGTCGCCAA GCCCGTCACC GCCGACGCCC TGCTCAGCGG CATCGAGCAG
GCCTTGAACG CCGGCCAGCG CCTGGCGATC TGA
 
Protein sequence
MSRTAAHHAF RRTMTLIAGG NPLPMVLEAI VLAVEAEDPG ILCSVLLVDE AGQNLLLGAA 
PSLPDDYNQA IDGVAIGPAT GSCGTAAALG QRVIVKNIQA DPLWADYKHL AAAADLAACW
SQPIRAANGA VLGTFAIYHR EIAVPDDEDI AFIEAAADLA AIAIDRRRAE ENLALSEARA
RAAAEAARET ARNLTTFFDV SLDMLVIRDM EGRFVKSSRA WETALGYPLE ALEGASLLSL
VHPDDVAATQ DYMRHAGDCG EVFGFVNRYR HSDGHYRQLE WRARRSGDLV FGVARDVTER
LREAAEMEAA KTAAEAANRA KSDFLANMSH EIRTPLNGVI GVVAALGQTG LTPAQREMVD
LIQSSGETLE RLVSDILDVS KIEAGHLAIE EREFDLEAEL GGLLDIARLR AEEKGLTFRV
EGGASARGVF LGDSIRIRQV LGNLLSNAMK FTGQGEIVAR IEVADPPAGE QASRLTLEVQ
DTGVGFDPDL ATMLFQRFSQ ADTTITRRFG GTGLGLSISK TLVEMMGGQI SAQSEPGRGS
LFRVVIPLAR TVSLADYDAP RGDLLPATPS CAPSFDDRGG LRVLLAEDHP VNRKVVQLIL
TPYDIELTMV ENGALAVEAC KATPFDLVLM DMQMPIMDGL AATRAIRAHE RDLTTGARIP
IIVLSANAMA HHKHDALAAG ADLHVAKPVT ADALLSGIEQ ALNAGQRLAI