Gene Caul_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1079 
Symbol 
ID5898534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1140444 
End bp1141982 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content72% 
IMG OID641561561 
ProductCBS sensor hybrid histidine kinase 
Protein accessionYP_001682707 
Protein GI167645044 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0784] FOG: CheY-like receiver 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.721589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACGC TCGACCGCCT TATTGACCGC CGAGCCCCGA TCGATCCCGC GACCCCCTGC 
GCGGACGTCC GGGCGATTTT CTTGGCCGAA GCGCACGCCG CCGCCGTGGC CGTCGTCGTG
GCGGGCAAGC CCGTGGGGCT GGTCTATCGC GACGTGTTCC TGGGCCAGAT GGCCGTCGCC
GATCTGGACG CCCGCCCGGT CTCCGAGGTC ATGGACCGCG AACCGCGGAC CGTCGAGTGC
AGCCTCACGG CCACCGCCTT CGTCGAGAGC ATCACGCAGA GCGCCATCCC CGTCTTCCGC
AGCGCCTATG TCAGCGTCGA CGAAGCCGGC GACTATGTCG GCGTTGGCGG CCTGAGCTCG
CTGCTCGCCT CGCACCGCCG CCGCCAGCGC GAGGCCGAGG AGGCCATGGC CCTGGTCGAG
CGCATGGCCG TCGATGTCAG CCACCATCTG GAAGGCGTCC TGGCCTTCAC CGAGCGGCTG
GAGCAGTCGC GCCTGACGCC CGACGCCGCC GCCTATGTCC GCGCCATCGG CGACACCAGC
CGCGACATGA GCCAGGTGCT CGGCCGGGCC ATGGACCTGC GCCGCGCCGC CACCGGCGGC
CTGACCCTGA CCCCCGCCCC GTCCCTGCTG CGCGACCTCA GCGACGCCGT CGAGGCCCGC
TGGAGCGCCC GCGCCGCCGA GGGCGGCTCA ACCCTGCTGT TCTCCTACGA CGGCGACCCC
GAAGCCGCCG CCCTGATCGA CGCCGACCGT GTGTTGCAGG TGTTCGACGC CCTGATCGAC
AGCGCCCTGT CCAGCGGTCG CGGCGTGATC GAGGCCAGCC TCAAGGCCCG TCCGGTCAAT
CTTGAGCATG GGGGCGGCCT GCGGCTTGAA GGCCGCGTGC GCGACAACAC CGCCGGCTCG
CCCGAGGAAC GCCTGGCCCG GGTCTACGAC CCGCTGGGCG CGGGCAGCAT CGAGGATCGC
AACGAACTGG CCCTGGGCGT CAGCATGGCC CTGGCCCACG GCCTGACCCG CGCCATGGGC
GGCCCGCTGC GCGCCGAGGC CAATCTTGGC GCCGGCCTGA CCCTGCACTT CTCGGTGACC
GCCCCGCAGG TCAACATGAT CCAGGGTCCC GCCGAGGAAC CGACGATGGA CGCCCGCTCG
GCCCACATCC TGATCGTCGA TGACAACGCC ACCAACCGCA TGGTCGCCGA GGCCCTGTGC
GAGATGTTCG ACTGCACCTC CGAGCAGGTG GTCGACGGGC TCGAGGCCGT CGAGGCCGCC
AAGTCTGGCC GCTTCGACCT GATCCTGATG GACATCAAGA TGCCGCGCAT GGACGGCGTC
GCCGCCACCC GCGCCATCCG CGAACTGCCC GGCCGGGCCG GCAGCGCCCC GATCGTCGCC
CTGACCGCCA ACGCCGACCC CGCCGACGTC GCCACCTACG TCGCCGCCGG CATGCAGGAC
GTGGTCGAAA AGCCGATCAA GCCCGAACGC CTGGCCGTGG TGCTCAGCGC CCTGCTCGGC
GGCGACAACG AGAACGCGGA CGCCGAAGCG GCGGCCTAG
 
Protein sequence
MDTLDRLIDR RAPIDPATPC ADVRAIFLAE AHAAAVAVVV AGKPVGLVYR DVFLGQMAVA 
DLDARPVSEV MDREPRTVEC SLTATAFVES ITQSAIPVFR SAYVSVDEAG DYVGVGGLSS
LLASHRRRQR EAEEAMALVE RMAVDVSHHL EGVLAFTERL EQSRLTPDAA AYVRAIGDTS
RDMSQVLGRA MDLRRAATGG LTLTPAPSLL RDLSDAVEAR WSARAAEGGS TLLFSYDGDP
EAAALIDADR VLQVFDALID SALSSGRGVI EASLKARPVN LEHGGGLRLE GRVRDNTAGS
PEERLARVYD PLGAGSIEDR NELALGVSMA LAHGLTRAMG GPLRAEANLG AGLTLHFSVT
APQVNMIQGP AEEPTMDARS AHILIVDDNA TNRMVAEALC EMFDCTSEQV VDGLEAVEAA
KSGRFDLILM DIKMPRMDGV AATRAIRELP GRAGSAPIVA LTANADPADV ATYVAAGMQD
VVEKPIKPER LAVVLSALLG GDNENADAEA AA