Gene Caul_5316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5316 
Symbol 
ID5897133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp24254 
End bp27463 
Gene Length3210 bp 
Protein Length1069 aa 
Translation table11 
GC content64% 
IMG OID641550609 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001672095 
Protein GI167621587 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.72275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.481403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGAAG CAGAAGCGCG GCGACTTGAA GCGCTTCGTA TAGTCGAGGC GCTGAACAGG 
CCACCCCAGG AACGGTTCGG CCTTGTGACG GCCTTGGCGG CCGACCTCTT CAAGACCCCG
CTCGCCTTGG TGTCGCTGAC CGAGATTGAA GGCTCATCGC CCCTATCCAC GCCCGCCGTA
GCAGCCAACA GTCCGCCGCA GGCCTGGACG TTGAGCGCCG AGGCTCTTGG CATGGGCCCC
AATGCCCTGT TGGTCGTCGA GGATGCGGCG ACGGACCGGC GTTTCGCAAC CGATCCCCTG
GTCACCGGCG CCGCCCATAT CCGTTTCTAC GCCGGCGTGG TGCTGACCAC GCGCGATGGC
CGAACCCTCG GCACCTTGTG CGTCATCGAC ACCAAACCCC GTCGCCGTCC AACACCCGCC
AAGCTCGACC GCCTGAAGAT GCTGGCCAGG ATCGTCGTTG ATGAACTGGA ACTGGCGCGC
GCGCATCGGT TGGCCAGCGA AAAGCAGCGT CTGCTGGAAC TGACCGAGAG CGTGTCGGGC
GTCGGCCATT GGCGCATTGA GGTCGGAGCC GGCGATGTGG GCTGGTCGGA CATGGTCTAT
ACGATCCATG GGGTCAGCCG CGAAGACTTT GAACCCGACC TCCGGAACTC GCTAGCCCTC
TATCATCCCG ACGACCGGGC CAAGTTGACG GCCTGCGTCA GGGAGGCCGT GGCCGCCAAG
GGCTCTTTTG AAGCTCAACT GCGCATCAAC CGTCCAGACG GCGAGATTCG CGATGTGATC
TCCAGGGGCG TGTGCGAACT CGATGAGCAC GGCGAGGCGA CGGCGGTGTT CGGCGTATTC
CAGGACATCA CCGATCAGAC ACGCATCTTG AATGCGATCA AGCGCAGCGA ACGTCGCTAC
AAGTTACTCG CCGACAATAT GAGCGACGTC GTCACGCGCA TCCGTCTGGA CGGCGGCAGC
GGCTACATCT CGCCGACGAT CGAGCGGCTT CTCGGCTATC GACCTGAGGA GATGACCGGT
CGCTCAGCTC AGGACTTTGT CCACCAAGAC GACCGATCCC AGATCCTGGC GATATTCGGA
CAGATGGCCC AAGGTTTGGA GCAAAAGACC CTTCAACACA GAGCGATCCA CAAGGATGGG
CGGACCGTCT GGGTCGAGAC CAGCTTCCGA TTGGTGCGGG ACGAACAGGA TCAGCCGCTC
GAGATCGTGG CGGTAATCCG CGACGCGACA GACCGCAAGG CGCTTGAGGA CGCGACGATC
GCCGCGCGCG ATGAGGCCCG AGAACAGGCC CAGAGAGCGG CAACGGCCGA GCGCATGGCC
GGACTAGGTC ATTGGCGGTT GGAGGTGGCG ACGCGGTCGG TGACCTGGTC CGAACAGATG
TACCAGATCT ACGGTCTCGA CCCCGCGTTG CCGCTCGACC TCGACGCCCT CCTGGCCATG
ACCCATCCGG ACGATCAAGC CGAGTCCAGC CAGCGCTTGC ACCGCGCCCT CACGACGGGC
CAGCCGACGA TGGAGGCCGT CTTCCGCCTC ATTCGGGCGG ACGGCCAGTT GCGAGACATT
ACTGGAAACA TGCTGGTCCA AAAGGACGCC GATGGCCAAG TCATCGCCGT GGTCGGCACG
ATGAGCGACG TCACCGAGCA GCAACACGCG CAGGCCGCCT TGGCCCAGAG CGAAGCGCGC
TATCGACTCC TGGCGGAAAA CGCCAGCGAC CTGATCATGC ACAGCGACGT CAAGGGGCAG
GTGACCTATG TCTCGCCATC AATCCTGCCC ACGACCGGCT ATGCACCCGA GGCGTTGGTC
GGGACCAACA TCCTCGATTG GATCGCCCCC GAAGACGTAC CCGGCGTCCA AGCCGCCGTC
GCCAAACAGT TCAAATCGCG AGGCGCGGAA CCGCCGATCG CCGTGGAATA TCGCGTGCGG
CACAAGGATG GTCGCGAACT GTGGTTGGAA GCCCGCCCGA CCCTGGCCTT CGATCCGGAG
ACTGGCGCCA TCACCGGCAT CACCGATGTC GTGCGCGACA TCTCCGCCCG TAGAGTGCTG
GAAGCGGAGT TGCGCGCGGC GCGCGCCGAG GCCGAGGCCG CCGCCGCCGT GAAGTCGGAG
TTCCTGGCCA ATATGAGCCA TGAACTGCGC ACGCCGCTGA CGGCGGTGCT GGGGTTCTCG
CGGCTGGCCG AGGAGCAGCC GGAACTGTCG GACACCACGC GCGGTTATCT GAAGCGCGCC
TGCAATGCCG GCCAGGCCTT GCTTCTTACG GTCAATGACA TCCTCGACTT TTCCAAGCTC
GAAGCCGGGC AGGTCGAGAT CGCGCCACGA CCGATGTCGC CCGCGAAGTT GGCGACCGAA
ACCCTCGACC TCTTCGCGAC CCAAGCCGCC GAGAAGGGGA TCGCGCTTCA CCTGGGCGGG
CTGGAGACTC TGCCACGGAC CGTTCGGGCC GACCCCGACC GGGTGCGCCA GATATTGCTC
AACCTGATCG GCAACGCCGT GAAATTCACG ATGGTCGGCA GCGTCCGCGT TGACGCCATC
TTCGAACCGC TTGGCGGATT CCTGTCATTC GCGGTGACAG ACACTGGCCC GGGCGTTCCC
GAGGATCGCG CCGATCAGCT GTTCCAACGT TTCTCCCAAG TTGATGCGTC CTCGACCCGC
AAACACGGCG GCACCGGTCT TGGCTTGGCG ATCTGCAAGG GTCTCGCCGA AGCCATGGGC
GGCGACATCG GGGCGCAGAG CAAGGTCGGT GAAGGGTCCT GCTTCTGGTT TACAATTCCA
GCTCCCGAAC TGGACGTCGC CGCGCCCATC GCCGCTCCGC CACAAGGTCA GATCATGCTG
CCCGCCGGCT GCCGAATCCT CGTGGCGGAT GACAATCGGA TCAACCGCGA TCTTGTCCGG
GCGATGCTGT CGCCGTTCGA CGTCGAACTC ACCGAAGTCG TGGACGGACT CGGAGCTGTG
GCGGCCGCCA ACGCGGCGCC GTTCGACGTC ATTCTCATGG ATCTTCGGAT GCCGGGCCTT
GACGGCGCGG GCGCGGCCCG GTGCATCCGC ACCGAGGACG GGCCCAACGC CACCATTCCG
ATCCTCGCCT TCTCGGCGGA CGTCGATCAG GCCCAAGCGA CGGGTCTGTT CGATGGCATG
GTTGGCAAAC CCCTGACAGG GGTGAATTTA TTGACGGCGA TCGCCAAAGC CATGGCTTGG
CCGGAGGATC CCCTTGATGA CGCAGCCTGA
 
Protein sequence
MREAEARRLE ALRIVEALNR PPQERFGLVT ALAADLFKTP LALVSLTEIE GSSPLSTPAV 
AANSPPQAWT LSAEALGMGP NALLVVEDAA TDRRFATDPL VTGAAHIRFY AGVVLTTRDG
RTLGTLCVID TKPRRRPTPA KLDRLKMLAR IVVDELELAR AHRLASEKQR LLELTESVSG
VGHWRIEVGA GDVGWSDMVY TIHGVSREDF EPDLRNSLAL YHPDDRAKLT ACVREAVAAK
GSFEAQLRIN RPDGEIRDVI SRGVCELDEH GEATAVFGVF QDITDQTRIL NAIKRSERRY
KLLADNMSDV VTRIRLDGGS GYISPTIERL LGYRPEEMTG RSAQDFVHQD DRSQILAIFG
QMAQGLEQKT LQHRAIHKDG RTVWVETSFR LVRDEQDQPL EIVAVIRDAT DRKALEDATI
AARDEAREQA QRAATAERMA GLGHWRLEVA TRSVTWSEQM YQIYGLDPAL PLDLDALLAM
THPDDQAESS QRLHRALTTG QPTMEAVFRL IRADGQLRDI TGNMLVQKDA DGQVIAVVGT
MSDVTEQQHA QAALAQSEAR YRLLAENASD LIMHSDVKGQ VTYVSPSILP TTGYAPEALV
GTNILDWIAP EDVPGVQAAV AKQFKSRGAE PPIAVEYRVR HKDGRELWLE ARPTLAFDPE
TGAITGITDV VRDISARRVL EAELRAARAE AEAAAAVKSE FLANMSHELR TPLTAVLGFS
RLAEEQPELS DTTRGYLKRA CNAGQALLLT VNDILDFSKL EAGQVEIAPR PMSPAKLATE
TLDLFATQAA EKGIALHLGG LETLPRTVRA DPDRVRQILL NLIGNAVKFT MVGSVRVDAI
FEPLGGFLSF AVTDTGPGVP EDRADQLFQR FSQVDASSTR KHGGTGLGLA ICKGLAEAMG
GDIGAQSKVG EGSCFWFTIP APELDVAAPI AAPPQGQIML PAGCRILVAD DNRINRDLVR
AMLSPFDVEL TEVVDGLGAV AAANAAPFDV ILMDLRMPGL DGAGAARCIR TEDGPNATIP
ILAFSADVDQ AQATGLFDGM VGKPLTGVNL LTAIAKAMAW PEDPLDDAA