Gene Caul_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3301 
Symbol 
ID5900756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3574881 
End bp3576803 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content69% 
IMG OID641563807 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_001684926 
Protein GI167647263 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.763924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCG ACCTGATGGC TCACTTCCTG AGTGAAGGTC GCGAACTGGT CGCCAGCGCC 
GAGCGCGACC TGGCGAGTTT GGCGCGCCAG CCCGACGACG CCAACGCACT GGATGGCTGC
TTTCGGGCGA TCCACACGCT AAAGGGTTCG GCGGGATTGT TCGACCTGCT GCCCATGAGC
GTGATGCTGC ACGCCGCCGA AGATCTGCTC GCGCTGCTGC GCGCCGAGCG CACCGGCGTG
GCCGAGGACT TCGAGGCGCT GTTCAGCGTC GTCGACACCG TCGATCGCTG GCTAGACGCG
CTGGACCAGG CGGGCGCTCT GCCGGCCAAC GCCGAACAGA TCGGGCAGAC GGAAGCCTTG
CGTCTTCGGG ATCTGGTCGC TTCGGTGAGC GCGCCCACGG ATGTCGCATC CTGGGCGTCG
CCGCCGCCGC CGACCTGGCG ACCGCCCCAG ACCTTCAACG GCAAGGGCGC CATGGCCCTT
CGCTATACGC CGCGCGCCGA CAGCTATTTT TCCGGCGACG ATCCCATCGC CATTGTCGCG
GCGACGCCCG GCCTGGCGGG GCTGAAGATA TCGCCGCGCG AGCCCTGGGG CGAGATCGAG
GACTATGACC CCTATGCCTG CAACCTGGTG CTGGAGGCGG TTTCCACCGC CAGCCGGGCA
GAGGTCGAGG CGGCGTTTCG CTTCGTGGCC GACCAGGTCG AGTTCGTCGA TCTGACGTCG
AGCGAGCCGG CGCTCGCGCC CGAAGCGCAG GGCGCACGCA AGACCCTGCG TATCGACGCC
GAGCGCGTGG ACCGATTGGC CGGACTGGCC GGCGATCTGG TCATCGCCAA GAACGGGCTG
TCGGAGTTGG CCGCCCAGGC CGAGGGCCTG CCTGGGGGTC AAGCCCTGGG TCAAGCCTTG
CGCGCGCGGC AAGCCCTGCT TGATCGCCTG GTGGGCGACC TGCACGCGAC CGTGGGAAAG
GTCCGTCTCG TGGCGCTGGG GCCGCTGTTC GCCCGGTTCC ATCGCCTGGC GCGCGAGATC
GCCCGTTCGC TGCACAAGGA GATCTCGCTG GAGGTGGAGG GCGGCGACAT CGAGGTCGAC
AAGACCATTG TCGACGGCCT GTTCGAGCCT CTGTTGCATG TCCTGCGCAA CGCGATCGAT
CATGGTGTCG AGCCCACTGA CGTTCGCGCT GGCGCCGGAA AGCCCGCGAC CGCCACCATC
CGGTTCAAGG CCCGGGCGGC GGCGGATCAG GTGGTGATCG AGGTTCGTGA TGACGGCGCG
GGTATCGATC CGGCCAAGGT CCGCGCCCTG GCCGTCACGC GCGGAGTGTT GACCCAGGAG
GCGGCCGACC GCCTGGATGA TCGCGCATCG ATCGACCTGA TCTTCACTCC CGGCTTCTCG
ACGGCCACCG AGATCAGCTC GGTGTCGGGC CGGGGCGTCG GCATGGACGT CGTGCGCGAC
GCGGCCGGGA AGCTGGGCGG CAAGGTCATC GTCGAAAGCG AGAAGGGGCG GGGCACGACC
GTGCGGTTCA TCCTGCCGGT GACCATGGTC CTCACCAAGG TGATGGTCGT GACCTGCGGC
GAGGAGCGCT ATGGCTTGGC GTTGGACACG GTGGTCGAGA CCGTCCGGGT CGCGGCCGAC
CGCATCGTCG CCGTGCGCGC GGGCAGGGCG TTCCAGTTGC GCGACGCCGT GATCCCCCTA
GTGTCGCTCG GCGACCTCGT GGGGGCCGCC GCGTCTGAAG CCAGATCAGC CGAGCGGGTG
GTCGTGGCGA GGGCTCAAGG CGAATTGGTC GGCTTCGCGG TGGATGCGAT CGTCGATCGC
ATGGACGCCG CCGTGCGGCC CATGACCGGA TTGCTTGCCG GCGCGCCGGG CGTCATGGGC
GCCACGCTGC TCGCCGACGG CGCGGTGTTG ATGATTCTCG ATCCGGCGGA GCTGATCCGG
TGA
 
Protein sequence
MTGDLMAHFL SEGRELVASA ERDLASLARQ PDDANALDGC FRAIHTLKGS AGLFDLLPMS 
VMLHAAEDLL ALLRAERTGV AEDFEALFSV VDTVDRWLDA LDQAGALPAN AEQIGQTEAL
RLRDLVASVS APTDVASWAS PPPPTWRPPQ TFNGKGAMAL RYTPRADSYF SGDDPIAIVA
ATPGLAGLKI SPREPWGEIE DYDPYACNLV LEAVSTASRA EVEAAFRFVA DQVEFVDLTS
SEPALAPEAQ GARKTLRIDA ERVDRLAGLA GDLVIAKNGL SELAAQAEGL PGGQALGQAL
RARQALLDRL VGDLHATVGK VRLVALGPLF ARFHRLAREI ARSLHKEISL EVEGGDIEVD
KTIVDGLFEP LLHVLRNAID HGVEPTDVRA GAGKPATATI RFKARAAADQ VVIEVRDDGA
GIDPAKVRAL AVTRGVLTQE AADRLDDRAS IDLIFTPGFS TATEISSVSG RGVGMDVVRD
AAGKLGGKVI VESEKGRGTT VRFILPVTMV LTKVMVVTCG EERYGLALDT VVETVRVAAD
RIVAVRAGRA FQLRDAVIPL VSLGDLVGAA ASEARSAERV VVARAQGELV GFAVDAIVDR
MDAAVRPMTG LLAGAPGVMG ATLLADGAVL MILDPAELIR