Gene Caul_0932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0932 
Symbol 
ID5898387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp980595 
End bp982874 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content71% 
IMG OID641561415 
Productsignal transduction histidine kinase 
Protein accessionYP_001682561 
Protein GI167644898 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGACA CCCGTGAACC GGGGCGCGCC GTCGAGGAAA CGCCGTGCGG CGACATGCGC 
CGGCGGATAG AGGCGTTCGA CTGGGCGGCG ACGCCGCTGG GTCCGCGCGA GACCTGGTCG
CCTCGCCTGA CCTTCGCCGT CGACATGATC GTGGCCAGCC AGTTCCCCAC CGCCCTGCGT
TGGGGTCCCG AGCTGGTGCT GATCTATAAC GACGCCTATG CGCCGATGCT GGGCGATCGC
CATCCCGGGA CGCTGGGCAA GACCTTCGCC GAGGCGCCGC CCAGTTCGGC GGTCGAGTCC
GAGGGGCGGG AACGCGCGGT CGTGGCCGGC GAAAGCGGCG GCGAGATCAT CGAGGACCTG
TTCCTGCTGA CCGAGCACGC CGACGGAACG GTGACCGAGG GCTATTTCAC GATCCGCCTC
AGTCCGCTTC CCGACCCCGC GACCGCCACC GGGGTCGGCG GGGTGCTGAT CGCCCTGTCC
GACACCACCC GCCGCGTGCG GGCCGAGCAG GCGCTGCGGG CCAGCGAGGA GCGCTATCAA
CTGGCGCTCG AGGCGGCCAG CGGCGTGGGC ACCTGGGACT GGGACATCGT CGCCGACAAG
GTCTATGCCG ACGCCCGCTA CGCGACCTTC CACAATGTCG ATCCCGAGCG CGCCGCCGCC
GGGGCGCCCC TGGCCGAATA TAGCCGCGCG CTGCACCCCG ACGATTTCCA GCGGCTGCTG
GACTCGGGGC GCACGCACCT GGAGACCACC GGCGACTTCC TGGAGGAATA CCGGCTGATC
CAGGCCGATG GGTCGGTCCG CTGGGTGCAG GCCCGCGGCC AGGTCTATCG CGACGCCCGG
GGCTGGGCGG TGCGCCATCG CGGGGTGATG GTCGATATCA CCGAGCGCAA GCGGATCGAG
GCCGCGCTCG AAGCCACCGA GGCCGATCTG CGCATCGCCA TCGAGGCCGC GGGCCTGGGC
CGATGGGACA ACAACCCCGC CACCGGCCAG CGGTACTGGG ATCAAAGGAC GCGCGAGATC
TTCGGCCTGC CGCTGGAGGG GACGGCAGCC TCCCCCGAGA TCGTGGCGCG GCTGATCCAT
CCTGACGATC TGGGGCGCTT TCAGGAGGCG GTCCGCGAGG CCATCGATCC CGCCGATCCG
ACCGCCGACC ACGTGTTCAA CCAGGAATAC CGGATCTTCC GCGGCACGGA CGGGGCCCTG
CGCTGGATCG AGGCCTTTGG CCGGGCCTTC TTCCGCGACG ACCAGTGCGT GCGCTTCGTC
GGGGTGGTCT CCGACGTGAC CGAGCGCAAG CAGGCCGACG CCGACCGCGA GCTGCGCGAG
GCCACGATGG CCCTGGCCCT GGACGCCGGC GATGTGGGCA CCTGGGACTT CGACGTCACG
CGGCGCGACC TGCGCTGGTC CGAGCGGGCC TTGGGCATGT TCGGCATGTC CCCGGGCCAG
GACCTGGGGC TGGACGACTT CTACGCCGCC CTGCACCCCG ACGATCGCGA GGCCACGCGG
ACCGCCCTGG TCGCCGCCAT GACCCCCGGT CTGAGCCCCG ACATCGACGT GGAGTTCCGC
ACGATCGGCC TGGAGCGCTG GATCCTGGCC AAGGGCCGCG GCTTCTTCGA CGAGGCCGGC
CAGCCCGTCC GGGTGGTCGG GGCCACGGTC GACATCACCG AGCGCAAGAA GGCCGAGCTG
CACCTGCGGC TGCTGGTCAA CGAGTTGAAC CACCGGGTGA AGAACTCGCT GGCCACGATC
CAGGCCATCG CCGCCCAGAC CTTCCACGCC GCCCGGTCCC TGCCCCAGGC CCAGGAGGCG
TTCTCGGCGC GGATCGTCGC CCTGGCCGAG GCCCACGACC TGCTGACCCG CGAGAACTGG
GAGGGCGCCG ACCTGACGGA CCTGCTGACC CGGCTGGAGA TCCTGCACGG CGGCCCCCCG
CAGGGGGAGA TCCGGCGCTT CATGTTCAGC GGACCGTCCG TGCGGCTGTC GCCGCGCATG
GCCCTGTCGC TGTCGATGGC CCTGCACGAA CTGGCCACCA ACGCGGTCAA GTATGGGGCG
CTGTCGGTTC CCACCGGCCA GGTGCGGATC GTCTGGAGCG TCGCCCCCGG CCCCATCCAG
CCTTTCCTGA CCCTGACCTG GACCGAAACG GGCGGGCCGC CCGTCTCGCC GCCCCACCGG
CGCGGGTTCG GCTCGCGACT GATCGAGCGC GGCCTGGCCT CGGAGCTGTC AGGCGAGGCC
CATATCGATT TCCGGCCCAG CGGCGTGGTC TGCCGGATCT CGGCGGGGTT GGACGGCTGA
 
Protein sequence
MVDTREPGRA VEETPCGDMR RRIEAFDWAA TPLGPRETWS PRLTFAVDMI VASQFPTALR 
WGPELVLIYN DAYAPMLGDR HPGTLGKTFA EAPPSSAVES EGRERAVVAG ESGGEIIEDL
FLLTEHADGT VTEGYFTIRL SPLPDPATAT GVGGVLIALS DTTRRVRAEQ ALRASEERYQ
LALEAASGVG TWDWDIVADK VYADARYATF HNVDPERAAA GAPLAEYSRA LHPDDFQRLL
DSGRTHLETT GDFLEEYRLI QADGSVRWVQ ARGQVYRDAR GWAVRHRGVM VDITERKRIE
AALEATEADL RIAIEAAGLG RWDNNPATGQ RYWDQRTREI FGLPLEGTAA SPEIVARLIH
PDDLGRFQEA VREAIDPADP TADHVFNQEY RIFRGTDGAL RWIEAFGRAF FRDDQCVRFV
GVVSDVTERK QADADRELRE ATMALALDAG DVGTWDFDVT RRDLRWSERA LGMFGMSPGQ
DLGLDDFYAA LHPDDREATR TALVAAMTPG LSPDIDVEFR TIGLERWILA KGRGFFDEAG
QPVRVVGATV DITERKKAEL HLRLLVNELN HRVKNSLATI QAIAAQTFHA ARSLPQAQEA
FSARIVALAE AHDLLTRENW EGADLTDLLT RLEILHGGPP QGEIRRFMFS GPSVRLSPRM
ALSLSMALHE LATNAVKYGA LSVPTGQVRI VWSVAPGPIQ PFLTLTWTET GGPPVSPPHR
RGFGSRLIER GLASELSGEA HIDFRPSGVV CRISAGLDG