Gene Caul_0191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0191 
Symbol 
ID5897465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp209019 
End bp210614 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content72% 
IMG OID641560675 
Producthistidine kinase 
Protein accessionYP_001681826 
Protein GI167644163 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.363458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACC GTCCTTCCTC CGCCGGGGTC ACGCCCGAAC AGTTGGCCAC GCTAAGCCAT 
GAGTTCCGCA CCCCCCTGAA CGGCGTGCTG GGCATGGCCC GGCTGCTGGA GGGAACGCGG
CTGACCGCCG AGCAGCGAGC CTATGTCGGC GCCCTGCGCG AGAGCGGCGA CCACCTGCTG
TCGCTGGTCA ACGACGTGCT GGATTTCGCC CGGCTGGGGG CCACGGCCAT CGAGCTGCAT
ACAGCCTCGG TCGATGTCGA GAACCTGCTG CGCCAGGTGG CCGAGTTGCT CAGCCCCCGG
GCTCACGAGA AGAACATCGA GATCGCCTGG GCCGCCGCGC CCGGCCTGCC GGCCATCCTG
GCCGACGAAG GGCGCCTGCG GCAGGTGCTG CTGAACTATG CCGGCAACGC CATCAAGTTC
ACCGAGACGG GCGGGGTCCT GCTGAGCGCC GAACTCGTGG TCCCCACCTC CGATTCCGAA
GGCCGTCTTC GCTTCAGCGT CCGCGACACC GGGCCGGGCG TCGCCCCGGA AGCCCGCGCC
GCGATCTTCG AGGCCTTCGT CCAGACCGAT CCCTCGCACC AGGCCCAGCT GGGTGGAGCG
GGCCTGGGCC TGGCCATCGT CGCCCGCCTG GCCGGCGCCA TGAGCGGCGA GGCCGGGGTC
GGGGGCGAGC TGGGCCAGGG CGCCGACTTC TGGTTCGAGG CCCCCTTTGA TTTCGCGCCG
GCCATGCCGG TCGAACTGCC TCTGCACGGA CGCGCTGTCG CCATCGCCTC GCCCAACGCC
ATGGTCCGCG AGGCCGCCAT CCGCCAGATC CGCGCCAGCG GCGGCCAGGC CCTGTCGGGC
AAAACCGTCG TCTCGGCCCT GAAGGGCGCG CCCGCCGACG CGGTGCTGCT GCTCGACGCC
GCCCTCGCCG GATCGCGTGG GGCCGGATCG CGTGGGGCCG GGCCGCGCGG CGCGCTGAAG
CCGCCCATCG GCCGGGCCTG CGTGGTGCTG CTGACCCCCG ACCAGCGCGA CCGCATCCCC
AAGCTGAAGG CCGCCGGCCT CGGTTACCTG ATCAAGCCCC TGCGTCGCGC CTCGCTGATC
GCCCAGGTCC TGGCCGCCCA ATTTTCCGCC AAGGTGGCCG CCAACGAGCG CGAGATCGCC
CCGACCGCCA CGCCGGTCGC CCACGAGGAC GACCGGATCG CCCCGGCCGC CGCCCCCGGC
GTCCGCGTTC TGCTGGCCGA GGACAATCCG ATCAACGCCC TGCTGGCCCG GGCCCTGCTG
GAACGCGAGG GCTGCAAGGT CGACCGCATC GCCAGCGGCG ACGAGGCCGT CTCGGCCCTG
TCGCGCGGCT TCTACGACCT GATCCTGATG GACCTGCGCA TGCCGGGCCT GAACGGCATG
GAGGCCACCA AGGCCCTGCG CGAACGCGGT GTCACCACCC CCATCGTCGC CCTGACCGCC
GACGCCTTCG ACGAGGACCG CCGCGCCTGC CTGGCGGCCG GCATGAACGA CTTCCTGGCC
AAACCCCTGA CCCCGGCGGC CCTGCGCGGC GTGCTGATCA ACTGGACCGG GCTTGGCTGG
ACGAAAGCGG CGACGCGGGC CAAGGTCGCC TCCTAA
 
Protein sequence
MNNRPSSAGV TPEQLATLSH EFRTPLNGVL GMARLLEGTR LTAEQRAYVG ALRESGDHLL 
SLVNDVLDFA RLGATAIELH TASVDVENLL RQVAELLSPR AHEKNIEIAW AAAPGLPAIL
ADEGRLRQVL LNYAGNAIKF TETGGVLLSA ELVVPTSDSE GRLRFSVRDT GPGVAPEARA
AIFEAFVQTD PSHQAQLGGA GLGLAIVARL AGAMSGEAGV GGELGQGADF WFEAPFDFAP
AMPVELPLHG RAVAIASPNA MVREAAIRQI RASGGQALSG KTVVSALKGA PADAVLLLDA
ALAGSRGAGS RGAGPRGALK PPIGRACVVL LTPDQRDRIP KLKAAGLGYL IKPLRRASLI
AQVLAAQFSA KVAANEREIA PTATPVAHED DRIAPAAAPG VRVLLAEDNP INALLARALL
EREGCKVDRI ASGDEAVSAL SRGFYDLILM DLRMPGLNGM EATKALRERG VTTPIVALTA
DAFDEDRRAC LAAGMNDFLA KPLTPAALRG VLINWTGLGW TKAATRAKVA S