Gene Caul_3549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3549 
Symbol 
ID5901004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3832726 
End bp3834834 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content70% 
IMG OID641564057 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001685174 
Protein GI167647511 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGG CCGTCCAGTT CCCGATCGAG CCGCCTTCCG AGGACCGGCC GATCACCAAG 
ATTCTGATCG TCGACGACGA CGAGCGCAAC GCCTTCGCCG CCATCCAGGC GCTGGAGGCG
CTGGGCCAGG AGCTGGTGGT GGCCCGCTCC GGCGAAGAGG CTCTGCGCAA GCTGCTGGTC
GACGACTACG CGGTGATCCT GCTCGACCTG CACATGCCCG GCATGGACGG CTACGAAACC
GCCGCCCTGA TCCGCCAGCG TCGGCGAAAC CGCGATATCC CGATCGTTTT CCTGACGGCG
GTGTTCCGCG AAGAGACCCA CATCTTCAAG GCCTATTCGG CCGGGGCGGT CGACGTGGTG
TTCAAGCCGG TCGATCCGTT CATCCTGCGC TCCAAGGTGC AGGTTCTGGT CGATCTCCAC
CTGAAGACCC TGGAACTGGC CCGCCAGTCG GAGGGCCGCC GCCTGCTGCT GGAGGAGAAC
GCCCAGGTTC ATGCCGAAAA GCTGGTGGCC GAACGCTCGC TGCGCTCCAG CCAGGAGCGG
CAACAGGCGA TCCTGCGCGC CCTGCCGATC GTCTTCCACT CGCGCCGGCC CGATCCGCCC
TACGCGCCCC TGTCCCTCAG TGAAGGCGTG CTGGGCTTGA CCGGCTTTCC GCCCGCGCGG
TTCCTCGACG AGCCTGCCTT CGCCTTCGAC CGCATCCACC CCGGCGATCG CGCCGGTGTG
ACGGCGGCCC AGGACGAGGC GCGTCGCGTC GGCGCCTATC ATTGCGAATA TCGCTGGCTC
TGCGCGGACG GCCAGTATCG CTCGCTGATC GACCAGGGCG TCATGGTGGT CGATGACGAG
AGCGGCGAGC CGCTGATCTT CGGAACCATC CTCGACAACA CCGAGCGGCG CGACCTGGAA
GAGGCGCTGG TCCAGGCTCG CAAGATGGAG GCGGTGGGCC AGCTGACCGG CGGCGTCGCC
CACGACTTCA ACAACCTGCT GACCGTGATC CTGGGCAATA TCGAACTGAT CCAGCGCCGC
AGCGGCGACG AGCATCCCTT GGCGCGCCAC GTGGCCGCCG TCCGCCAGGC CGCCGAACGG
GGCGGCGCCC TCACCCGCCA GCTCCTGGCC TTCTCGCGCC GCCAGCGCCT CGATCCGGCG
ACGGTGGACA TCATCGACCT CGTCCGGGAG TTCACGCCTC TGCTGCGCCA AGCCGTGGGC
GAGGCCGTGA CGATCGATCT GGAAATCGGC GCGACGCCGG TGTGGGTGCA TGTCGACGCC
GCCCAACTGG AAAGCGCGCT GCTGAACCTG GCCGTCAACG CGCGCGACGC GATGGACGCG
GGCGGCGTTC TGAGCATTTC GGCGCGGGTC GAAGCGTCGG CGGGCGCCGC CCTGGCCGTG
ATCAGCGTGC GCGATACGGG GCCGGGCATG TCCGAAGAGA TCGCGTCGCG GGTCTTCGAA
CCGTTCTTCA CGACCAAGGA GGTCGGCAAG GGCTCTGGAC TTGGCCTGTC GCAGGTCTAT
GGCTTCGTAC GGCAGTCGGG GGGAGAGATC ACCCTGCGCA GCGCTCCCGG CCAGGGGGCG
ACGTTCGAGA TTCGCCTGCC AACCACGACC ACGACCGCCA AGGTCGCGAC AGCCCCGGCG
ACCAGGGCCG AGGACGTCGC GGTCGCCGCG GCCGCGCCAT CCGGCGGCGA GACGATCCTG
GTCGTGGAGG ACGATCCGGC GGTGCTGGCC CTGGCGGTCG ACACCCTGCG GAGCTTCGGC
TATCGCGTGA CCACCGCCAG CAACGCCGCG AGCGCGTTGC GCCGGCTGCG GGGCCGCCAG
GCCTTCGACA TGCTGTTTTC GGACGTCGTC ATGCCCGGCG GCGTCAGCGG CATCGAGCTG
GCCCGGCGCG CGCGGGTGCT GCGGCCCGAC CTCAAGATTC TGCTGACCTC CGGGTTCGTG
GGCGAGGAGG CCGAGGCCTG GGCCAATGAG TTTCCCATGA TCGACAAGCC CTACGAGCCG
TCGCGCCTGG TCAGCCGCGT CCGCGCCGCC TTCGACGGCG ACGGTCCCGC CGAGGGGCGC
GAGAGTCCCG CGACACCCTC TCCCAGCGGG AGAGGGTGGG GTCCGACCGC GAAGCGGTTG
GGAGGGTGA
 
Protein sequence
MTPAVQFPIE PPSEDRPITK ILIVDDDERN AFAAIQALEA LGQELVVARS GEEALRKLLV 
DDYAVILLDL HMPGMDGYET AALIRQRRRN RDIPIVFLTA VFREETHIFK AYSAGAVDVV
FKPVDPFILR SKVQVLVDLH LKTLELARQS EGRRLLLEEN AQVHAEKLVA ERSLRSSQER
QQAILRALPI VFHSRRPDPP YAPLSLSEGV LGLTGFPPAR FLDEPAFAFD RIHPGDRAGV
TAAQDEARRV GAYHCEYRWL CADGQYRSLI DQGVMVVDDE SGEPLIFGTI LDNTERRDLE
EALVQARKME AVGQLTGGVA HDFNNLLTVI LGNIELIQRR SGDEHPLARH VAAVRQAAER
GGALTRQLLA FSRRQRLDPA TVDIIDLVRE FTPLLRQAVG EAVTIDLEIG ATPVWVHVDA
AQLESALLNL AVNARDAMDA GGVLSISARV EASAGAALAV ISVRDTGPGM SEEIASRVFE
PFFTTKEVGK GSGLGLSQVY GFVRQSGGEI TLRSAPGQGA TFEIRLPTTT TTAKVATAPA
TRAEDVAVAA AAPSGGETIL VVEDDPAVLA LAVDTLRSFG YRVTTASNAA SALRRLRGRQ
AFDMLFSDVV MPGGVSGIEL ARRARVLRPD LKILLTSGFV GEEAEAWANE FPMIDKPYEP
SRLVSRVRAA FDGDGPAEGR ESPATPSPSG RGWGPTAKRL GG