Gene Caul_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0996 
Symbol 
ID5898451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1053521 
End bp1056019 
Gene Length2499 bp 
Protein Length832 aa 
Translation table11 
GC content70% 
IMG OID641561478 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001682624 
Protein GI167644961 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACAG AGACGACCTC ATCCGCCTCG ACCTGGACGG GCGGTGACAT TCTGAAAACG 
GCGTTGCTCG CCGTGATCGC CGCGGTCGGC TACGCCCTGC TCGCCTATCT CTGCGTCGAT
TTTCCCCGCA ACTACGGCCA GGTGGCCCCG ATCTGGCTGT CCAACGGCTT TGGCGTGGCC
TGCCTGCTGT CGAGCCGAAG CCGCCGCTGG CCCGCCATCC TGCTGGGCTG CATGGTCGGC
GGACTTGCGG CCGGCGCCCA TGTCCAGGAC CTGCTGGCCG TCAATCTCGT GCTGGTCGGC
TGCAACATCG CACAGATCTT CCTCTGCGCC TGGGGCATGC GCCGGGTGGT CGGCGACGAG
GTCGATCTGG GCCGGGTCCG TGACATGATC GCCTTCGCCC TGATCTGCGG CCTGGCCGCG
CCCATCGCCA CCGGCGTGCT GGCCGCCACC CTCCATACGG CGATGCGCGG CGGCGTGCTG
TGGGCCAATA TCGGCGTCTG GTCGCTGGGC GACATCCTGG GCCTGATGAC CATCACCCCC
TGCCTGCTGG CCCTGGTGCG GATTCGCGAC TATCTGCGCG AACGCCCGCT GTCGGCCAGC
GGCGTCGCCA GCCTGCTGCT GCTGCTGGCG GTGACCGTCG GGGTGTTCTC CCAGAACCGC
CCGCTGCTGT TCGTCATCCC GCCGGTGATG CTGCTGGTCG CCTGGCGGCT GGAAGTGCTC
GGCGCGGCGT TGAGCGCGAC CCTGGTGGCG GTGGTGGCCG TGACCTTCAC CATGGCCGGC
CACGGCCCGA TCACCCAGCT CAAGAGCGGA TCGCCGGACC AGGCCATCGT GCTGCAGCTG
TTCCTGGCGG TGGCGATCTT CATCAGCCTG CCCGTCGCCT CGATCCAGCG CCACCGGCGC
AACATGCTGC ACAGCATGAC CGAGGCCAAC GCCGCGGTGG CCCGCAGCGA AGCGCGCTTC
CGCCAACTGG CCGAAAACGC CCAGGACATG ATCATTCAGA GCAACATGCG GGGGATCGTC
CAGTACGCTT CGCCGGGATG CCTGGCGATG ACCGGCTTCA CCCCCGAAGA GGTGGTCGGT
CGCGAGGGCA TGCAGATCGT TTACGAGGAG GATCGCGAAG CCGTGCGCGC CATCGTCATC
GCGCAGTTGG CCCATCTGGC CGCCGATATC GACAACGGCC CCAACCGGGT GGAGTACCGG
GCGGCGCGCA AGGATGGCCG GATCATCTGG CTGGAATCGC GCCCCACCCT GTCGCGCGAT
CCGCTGACCG GCAAGGTCAC GGGCATCACC GACATCGTCC GCGACATCAC CGAGCACAAG
GCCATGGAGC GCCAGCTGCG CGAGGCTCGC AACGAGGCCG AGGCCGCCGC CGCCGTGAAG
GGGGAGTTCC TGGCCAATAT GAGCCACGAG TTGCGCACGC CGCTCACCTC GGTGCTGGGC
TTCGCCCGCC TGGTCGACAA CGAGCCCGAT CTGTCGCCCG ACGCACGGCG GTTCATCAGC
CGGGTGCTGA GCGGCGGCAA GGCCCTGCTG ACCACCATCA ACGACATCCT CGACTTCTCG
AAGCTGGAGG CGGGACAGCT GGAGCTGAAG CTCGAGCCGA CCGCCCCGGC CCAGTTGATC
GACGAGGCGA TGGATCTGTT CTCGCTCGAG ACCGAGGCCA GGGGCGTGGC CCTGCGCGCC
GTAGGCCTGG AGGACCTGCC CGCCGACCTG CTGATCGACG GCGGTCGCCT GCGCCAGGTG
CTGCTGAACC TGATCGGCAA CGCCGTGAAG TTCACGGAAG TTGGGACGAT CACCGTCAGC
GCGGCCTACG AACCCGCCGG CGAACGCCTG AGCCTGTCGG TCGCCGACAC TGGCCCCGGC
ATCGCGCCCG GGGATGTGGA CCTGCTGTTC CGGCGCTTCT CGCAGGTGGA CGCGGGCCTC
ACCCGCAAGC ATGGCGGCAC GGGATTGGGC CTGGCGATCT GCAAGGGCCT GGTTGAGGCG
ATGGGCGGCC AGATCGGCGT CACCAGCGTT CCTGGCCAAG GCGCCTGCTT CAGCTTCGAC
GTCGTCGCGC CGTCCACCGC CCATGCCGCG GTCGATCCCG CCGCGCCGGT CGTTGAGCCG
CCGCTGCCGC CCGCCTGCCG GGTGTTGGTG GTCGACGACA ATCCAGCCAA CCGCGAACTG
GTCACCGCCA TCCTGACCGC CATGGGCGCC GAGGTGATCG AGGCGGTGGA CGGCGTGGAG
GGCGTGACCG CCGCCGCCGC GGCCCCGTTC GACGCCATCC TGATGGACCT GCGCATGCCG
CGCCTGGACG GCGCCAAGGC CGCCTTGCGC ATCCGCGAAG AGGGCGGCCC CAACGCCCGC
ACGCCGATCA TCGCCTTCTC GGCCGACGCC CGCCCCGGCG GTCCCGGCGG GATCTTCGAC
GGGTCGGTGT CCAAGCCGAT GACCGCCCAG GGCCTGGTTG ACGCCCTCAA CGCCGCCATG
GCGGCCTCGC CCGCCCAGCC GTTGGCGGCC TCGGCCTGA
 
Protein sequence
MMTETTSSAS TWTGGDILKT ALLAVIAAVG YALLAYLCVD FPRNYGQVAP IWLSNGFGVA 
CLLSSRSRRW PAILLGCMVG GLAAGAHVQD LLAVNLVLVG CNIAQIFLCA WGMRRVVGDE
VDLGRVRDMI AFALICGLAA PIATGVLAAT LHTAMRGGVL WANIGVWSLG DILGLMTITP
CLLALVRIRD YLRERPLSAS GVASLLLLLA VTVGVFSQNR PLLFVIPPVM LLVAWRLEVL
GAALSATLVA VVAVTFTMAG HGPITQLKSG SPDQAIVLQL FLAVAIFISL PVASIQRHRR
NMLHSMTEAN AAVARSEARF RQLAENAQDM IIQSNMRGIV QYASPGCLAM TGFTPEEVVG
REGMQIVYEE DREAVRAIVI AQLAHLAADI DNGPNRVEYR AARKDGRIIW LESRPTLSRD
PLTGKVTGIT DIVRDITEHK AMERQLREAR NEAEAAAAVK GEFLANMSHE LRTPLTSVLG
FARLVDNEPD LSPDARRFIS RVLSGGKALL TTINDILDFS KLEAGQLELK LEPTAPAQLI
DEAMDLFSLE TEARGVALRA VGLEDLPADL LIDGGRLRQV LLNLIGNAVK FTEVGTITVS
AAYEPAGERL SLSVADTGPG IAPGDVDLLF RRFSQVDAGL TRKHGGTGLG LAICKGLVEA
MGGQIGVTSV PGQGACFSFD VVAPSTAHAA VDPAAPVVEP PLPPACRVLV VDDNPANREL
VTAILTAMGA EVIEAVDGVE GVTAAAAAPF DAILMDLRMP RLDGAKAALR IREEGGPNAR
TPIIAFSADA RPGGPGGIFD GSVSKPMTAQ GLVDALNAAM AASPAQPLAA SA