Gene Caul_3668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3668 
Symbol 
ID5901123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3961692 
End bp3963653 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content69% 
IMG OID641564179 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001685293 
Protein GI167647630 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0873112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0648404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGATC GGAGCGATTT GACCGCCTTT CAGACCGATG AGGGCAGGTA CAAGCTCCTG 
GTCGACGCGA TCACCGACTA CGCGGTCTAC ATGCTCGACC CGACGGGCCG GGTGATCAGC
TGGAACGCCG GCGCCGAACG GTTCAAGGGC TACAAGCCTC ACGAGATCAT CGGCCAGCAT
TTTTCGCGCT TCTACACCGA AGAAGATCGC GCGGCGGGCG TTCCCGCGGC GGCTCTGGCT
CAGGCGATCC GCGAAGGGCG CTTCGACCAG GAGGGCTGGC GGGTCCGCAA GGACGGCAGC
CGCTTCTGGG CCCACGTGGT CATCGATCCC ATCGTCCTAC CCGGTGGCGA ACTGGTGGGT
TTCGCCAAGA TCACGCGGGA CCTGACGGAG CGCAGAGCCG CCGAGGAAAC CCTGCGCCGT
AGCGAAGAGC AGTTCCGGCT GCTGGTGGAA GGGGTCACCG ACTACGCCAT CTACATGCTG
GATCCGACGG GGCGGGTGTC GAGCTGGAAC GCCGGCGCCC AACGGATCAA GGGCTACGGA
CCCGACGAGA TCATCGGCGA GCACTTCTCG CGGTTCTATA CCGAAGAGGA TCGCCGCGCG
GGGGGACCGG AGGCCGCCCT GAGGATCGCC GCCGCCGAGG GGCGATGCGA GAAGGAAGGC
TGGCGGTTGC GCAAGGACGG GACCCGCTTC TGGGCCCACG TCATCATCGA CCCCATACGC
GACGACCAGG GCCAGGTGAT CAGTTTCGCC AAGATCACCC GCGACATCAC CGAGCGCCGC
CAAACCCAGC GGGCGCTGGA GGAGGCGCGG GAAGCCCTGT TCCAGGCGCA GAAGCTGGAA
GCCATCGGCC AACTGACCGG CGGCCTGGCT CACGATTTCA ACAACCTGCT GACGGCGGTG
CTGGGCAGCC TGGAGCTTGT CCGCAAGCGG CTTCCCGAGG ACCCGCGGAT TTCGCCCTTG
ATCGACAACG CGATCCACGG CGCCCAGCGC GGCGCGGTGC TGACCCAGCG CATGCTCGCC
TTCGCCAGGA AGCAGGAGCT GAGGCTGGAA CCGGTGGACC TGCCCGCCTT GGCGACCGGC
ATGGCCGGGC TGTTCCAACG CTCGGTCGGC CCGGCGATCC AGATCCAGAC CTCTTTCCCG
GCGGGGCTCG CGCCGGCGCT CACCGACGCC AACCAGCTGG AAAACGCGCT GCTGAACCTG
GTGGTCAACG CCCGGGACGC CATGCCCGAG GGCGGCGTGA TCCGGATCGA GGCCTCCAAC
GAAACCGTCG CCGCGGCGTC CGCCAGCGGC CTGCCGCCGG GCGATTATGT TCGGCTGGCG
GTGATCGACA CCGGCCAGGG CATGGACGCC GAAACAAGGG CGCGCGCCAC CGAACCGTTC
TACACGACCA AGGGCGTCGG CAAGGGCACG GGCCTGGGGC TCTCCATGGT CCACGGCCTG
GCCGAGCAGT CCGGCGGCCG CCTGCTGATC CGCAGCGAGG CCGGCGAGGG AACCGCCATG
GAGATGTGGC TGCCGCTCGC CGAGCGCGGA GGCGAGCCGG CTTCGCTGGC GGCGCCGCCC
GAGCAGGGCG AACCCGACGA CGGCGGCCCT CCGCTGACGA TCCTGGCCGT CGACGACGAC
AGCCTGGTGC TGATGAACAC CTTGGCCATG CTGGAGGACC TGGGCCATCG CGTGCTGCCG
GCGTCATCGG GTCTGGAAGC CTTGGCCATA GCCGAGCGCG AGCAGGTCGA CCTGGTCATC
ACCGACTACG CCATGCCGAC GATGAACGGC GTTCAACTGC TGGAAGCGCT CAGAAAGCGA
AATCCCGATC TGCCCGCCTT GCTGGCGACC GGCTACGCGG AGCTGCCGGC CGGCGGCGGC
GGCGGCGACC TTCCCCGCTT GGCCAAGCCG TACCTGCAGG ACGATCTGAG GCGCGCGCTG
CGGCCGATGG TCGATCGGCG ACGTCGCGAG ACCGCGGCAT AG
 
Protein sequence
MDDRSDLTAF QTDEGRYKLL VDAITDYAVY MLDPTGRVIS WNAGAERFKG YKPHEIIGQH 
FSRFYTEEDR AAGVPAAALA QAIREGRFDQ EGWRVRKDGS RFWAHVVIDP IVLPGGELVG
FAKITRDLTE RRAAEETLRR SEEQFRLLVE GVTDYAIYML DPTGRVSSWN AGAQRIKGYG
PDEIIGEHFS RFYTEEDRRA GGPEAALRIA AAEGRCEKEG WRLRKDGTRF WAHVIIDPIR
DDQGQVISFA KITRDITERR QTQRALEEAR EALFQAQKLE AIGQLTGGLA HDFNNLLTAV
LGSLELVRKR LPEDPRISPL IDNAIHGAQR GAVLTQRMLA FARKQELRLE PVDLPALATG
MAGLFQRSVG PAIQIQTSFP AGLAPALTDA NQLENALLNL VVNARDAMPE GGVIRIEASN
ETVAAASASG LPPGDYVRLA VIDTGQGMDA ETRARATEPF YTTKGVGKGT GLGLSMVHGL
AEQSGGRLLI RSEAGEGTAM EMWLPLAERG GEPASLAAPP EQGEPDDGGP PLTILAVDDD
SLVLMNTLAM LEDLGHRVLP ASSGLEALAI AEREQVDLVI TDYAMPTMNG VQLLEALRKR
NPDLPALLAT GYAELPAGGG GGDLPRLAKP YLQDDLRRAL RPMVDRRRRE TAA