Gene Caul_4555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4555 
Symbol 
ID5902016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4934636 
End bp4935745 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content66% 
IMG OID641565074 
Productsignal transduction histidine kinase 
Protein accessionYP_001686173 
Protein GI167648510 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGACA CATCCCAGTC GGAACGCGCG GTGGGCAAGC CTGTCGCCGA ACACGGCGCC 
TTTGACCCGT TCGCCGCCGC GATGCGGACC GCCCGTCTGC CGATGATCGT CACCGACGCT
CGGCAGAGCG ACAATCCGAT CGTCTTCGCC AACGACGCCT TCCTGGCCCT CACGGGCTAT
GACCTCGACG AGGTGATCGG CCGTAACTGC CGATTCCTGC AGGGTCTGGA AACCGACCCG
GACCAGGTCG ATCGTCTGCG CCAGGCCGTC GCCCAAGGCG AGGAGGTCGC GCTCGAGCTC
CTCAACTACC GCAAGGACGG ATCGACCTTC TGGAACGCCC TCTATCTGTC GCCGGTGCGC
GGCGAGACCG GAGAAGTCCT CTACTTCTTC GGCACACTAC GGGACATCAG CGACCAGAAG
CGGGTCGAGT TCGAACTGAG CGACGCCCGC GATCGGCTGG AAGCGGCGGT CGAGGCCCGC
ACCCGCGACC TCACCCAGGC CCTGGACCAG AAAACCGCCC TGCTCCACGA GGTCGATCAC
CGGGTCAAGA ACAACCTCCA GCTCATCTCT TCGCTGCTGC TGCTGCAGAA CCGCCGGGTC
ACCGACCCGG CGGTGAAGGC CTCGCTGCGC GGCATGCTGG AGCGGGTCAG CGCCATCGCC
ACCGTTCACC GCCGCCTGTT CCAGAGCGAC GATGTCGAGC GCTTCGACGT CTCGGCCTTC
GTCCGCGACC TGGTCAGCGA CATGATGGGC GGCGCGCGCC GCGACGACAT CAAGGTGCGC
CTGGACCTGG AGCGCATCGA CGTGGCCGCC TCCAAGGCCG CGCCCCTGGC CCTGGTGATC
AGCGAACTGT TCTCTAACGC CCTGCGGCAC GCGTTCCCTC CGGGCCACGG TGGCGAGATT
TCCGTCGAAA TCACCCGCGA TCACGGCGAT TTCCGGATCG AAATCGCGGA CAACGGCGTT
GGGGTTGAGA GTTCCGTATC GTCGGGCGGG TTCGGTCTGA CCATCGTGCA ACTGCTCTGC
CAGCAGTTGA AGGCGCGATC CGAGACCACC CCCGCCGATC CCGGAACCCG GGTCGTGGTG
TATCTGCCGG TCAACGGCGC GCATCACTAA
 
Protein sequence
MKDTSQSERA VGKPVAEHGA FDPFAAAMRT ARLPMIVTDA RQSDNPIVFA NDAFLALTGY 
DLDEVIGRNC RFLQGLETDP DQVDRLRQAV AQGEEVALEL LNYRKDGSTF WNALYLSPVR
GETGEVLYFF GTLRDISDQK RVEFELSDAR DRLEAAVEAR TRDLTQALDQ KTALLHEVDH
RVKNNLQLIS SLLLLQNRRV TDPAVKASLR GMLERVSAIA TVHRRLFQSD DVERFDVSAF
VRDLVSDMMG GARRDDIKVR LDLERIDVAA SKAAPLALVI SELFSNALRH AFPPGHGGEI
SVEITRDHGD FRIEIADNGV GVESSVSSGG FGLTIVQLLC QQLKARSETT PADPGTRVVV
YLPVNGAHH