Gene Caul_4307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4307 
Symbol 
ID5901768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4680208 
End bp4682628 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content72% 
IMG OID641564825 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001685925 
Protein GI167648262 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.767019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACAA AAGAACCACT CGCAAGCGGT CGCGGGCGCT CGACGTCCGA TGCGATTAGC 
GCGTGGCTCA TCTCCGGGAC GCCTCCGCCG GCCTGGCGGG CGGCCCTGGT GACAGGGGCG
GCGATCCTGG CGGCGGTCGG GGTGAGACTG GCCCTGCTGG GACTCAGCGG CGGGGTCGGC
GCCACCCAGG CTTTCTTCCC GGCCATGATT CTGGTCACCC TCTACGCCGG CTGGCGCTGG
GGCGCCGCGC CGGTTCTGTT CGGCACGACT TTCGCCTGGT GGTTGTGGGG CGGGACGCGG
AGCAACGGGC TCGACCAGGG CGAGGCGGCC ACGCTGACGC TATTCCTGCT GTCCTCCGCG
ATCACCGTGG CGGTGGCGCA AGCCCTGCGC ACCAGCCTGA TCCACCTGGC CGAGGCCCGC
CGGCGACAGA CCGAGGCCGA GGTCCGCCTG CAGGTCACCC AGACGGCCGC CGGCGTGGGT
CCTTGGGAGT GGGACGTCCA GAACAACGTG CTGGAGCTGT CGCGCACCGC CCGCCGGAAC
CTGGGCGTCG CGGCCGAAGG TCCGCTGGAC CTTGGCGCCC TGGTCGAGGC GGTGCATCCC
GACGATCGCG CGCTGATCCG CGAAAAGATC CGCGAAGCGG TGACCCGCGG TCCGCTCTAC
GAGGTCGAGT ATCGGCTGGC CAACCATCCG GATGGCGAGC GCTGGATCCA TGGCCGCGGC
GAGGTGGTGC GCGACGAGAA CGGCCGCGCG ACCCGCATCC TCGGGGTCAA TTTCGACGTC
ACCAAGCGCC GCAGGGCCGA GGAGAGCCTG CGCGAGAGCG AGGCCCGGTT CCGCGCTCTT
GCCGACAGCG CCCCGGCGCT GATGTGGATC ACCGGCGAGG ACGGCGTCCG GGTGTTCGTC
AACGCCGCCT ATATCGACTT CGCCGGCGTC AGCTATGACG AGGCCCTGGT CCTGGACTGG
CGCTCGCGAC TGCATGCCGA CGACCTGCCT GCCATCCTCA AGGGCCAGAT CGCCGGCGAG
GCGTCGCGCA GGCCGTTCAG CCTGGAGGGC CGCTATCGCC GGGCCGACGG CGAGATGCGC
TGGCTCAAGT CGTTCTCGCA ACCGCGCCTG GGGGCCGGCG GCGAGTTCAT CGGCTTCATC
GGCATCGCCT TCGACGTCAC CGACGCCAAG GAGGCCGAGG TCAAGCTGAC CGGCCTCAAC
GAGCTGCTGG CCGATCGGGT GCAGGAAGCC CTGTCCGAGC GCGACATCGC CCAGGCCGCC
CTCACCCAGT CGCAGAAGCT GGAGGCCATC GGCCAGCTGA CCGGCGGGGT CGCCCACGAC
TTCAACAACC TGCTGACCGT GATCATCGGC GCGCTGGACG TGCTGCAGCG CAATCCCGAC
GACGCGGCGC GCCGCGAGCG CATGCTGGCC GCCGCCCAGG CCGCCGCGCG CCGGGGCGAG
CGGCTGACCC AGCACCTGCT GGCCTTCGCC CGCCGCCAGC CTTTGAACCC GGAGATCTGC
CGGATCGACG CGATGATCGC CGAAAGCGAA AGCCTGCTGC GTCGCGCGGT CGGCGAAGGC
GTGGAGCTGC GGCTGGACCT GAAGGCCGGC GGCCGCACCA CCCTGACCGA CTCCGGCCAG
TTCGAGGCGG CGGTGCTGAA CCTGGTGGTC AACGCCCGCG ACGCCACCCC GGCCGGCGGC
GTGATCACCG TGCTGTCGCG GGGCGTCGAC CTGGCCGAGC CCAAGGGCGA GCTGCCGGCC
GGCCGCTATC TGAGCGTGGC CGTGCGCGAC ACCGGCGAGG GCATGGACGC CGAGACCCTG
CAGCGCGCCT TCGAGCCCTT CTACACCACC AAGCCGGTGG GCAGGGGCAC AGGCCTGGGC
CTGTCGCAGG TCTATGGCTT CGTCCGCCAG AGCGGCGGCG AGGTGACCAT CGAGTCGACC
GTCGGCCAGG GCACGACCGT GACCATGCTG CTGCCCGTCC GCGAGGCCTT CTCGCCGGTC
GAGGTGCTGG CCGTGCGGCC GCCCGCCACC CGCGCCACCT CGCACGTGCT GCTGGTCGAG
GACGACGTCG AGGTCGGCGA CCTGGTCGCG GCGATGATCG ACGAGCTGGG CCACATGGTC
AGCCGGGCGG CCAACGCCGA CGAGGCCCTG GCCATCGCCC GCGCCGACCC CACCCTGGGC
CTGGTGATCA CCGACGTGAT CATGCCCGGC GGCAAGAGCG GGGTCGACCT GGCCATCACC
CTGGCCAGCG AGCGCCCAGA ACTGCCGATC CTGCTCAGCT CGGGCTATAC CGGCCAGGAA
CTGATGCGCG CTCACGACAC CCCCTGGCCC CTGCTGCGCA AGCCCTATGC CCTGGACGCC
CTGGCCCAGG CCATGGCGGA CGCCTGGGAC CGGCATGGGC CGGCGCAGGT GTCGAGCAAG
GGCGGGAAGG CGAAGGGGTA G
 
Protein sequence
MKTKEPLASG RGRSTSDAIS AWLISGTPPP AWRAALVTGA AILAAVGVRL ALLGLSGGVG 
ATQAFFPAMI LVTLYAGWRW GAAPVLFGTT FAWWLWGGTR SNGLDQGEAA TLTLFLLSSA
ITVAVAQALR TSLIHLAEAR RRQTEAEVRL QVTQTAAGVG PWEWDVQNNV LELSRTARRN
LGVAAEGPLD LGALVEAVHP DDRALIREKI REAVTRGPLY EVEYRLANHP DGERWIHGRG
EVVRDENGRA TRILGVNFDV TKRRRAEESL RESEARFRAL ADSAPALMWI TGEDGVRVFV
NAAYIDFAGV SYDEALVLDW RSRLHADDLP AILKGQIAGE ASRRPFSLEG RYRRADGEMR
WLKSFSQPRL GAGGEFIGFI GIAFDVTDAK EAEVKLTGLN ELLADRVQEA LSERDIAQAA
LTQSQKLEAI GQLTGGVAHD FNNLLTVIIG ALDVLQRNPD DAARRERMLA AAQAAARRGE
RLTQHLLAFA RRQPLNPEIC RIDAMIAESE SLLRRAVGEG VELRLDLKAG GRTTLTDSGQ
FEAAVLNLVV NARDATPAGG VITVLSRGVD LAEPKGELPA GRYLSVAVRD TGEGMDAETL
QRAFEPFYTT KPVGRGTGLG LSQVYGFVRQ SGGEVTIEST VGQGTTVTML LPVREAFSPV
EVLAVRPPAT RATSHVLLVE DDVEVGDLVA AMIDELGHMV SRAANADEAL AIARADPTLG
LVITDVIMPG GKSGVDLAIT LASERPELPI LLSSGYTGQE LMRAHDTPWP LLRKPYALDA
LAQAMADAWD RHGPAQVSSK GGKAKG