Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3301 |
Symbol | |
ID | 5900756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3574881 |
End bp | 3576803 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641563807 |
Product | CheA signal transduction histidine kinase |
Protein accession | YP_001684926 |
Protein GI | 167647263 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.763924 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGCG ACCTGATGGC TCACTTCCTG AGTGAAGGTC GCGAACTGGT CGCCAGCGCC GAGCGCGACC TGGCGAGTTT GGCGCGCCAG CCCGACGACG CCAACGCACT GGATGGCTGC TTTCGGGCGA TCCACACGCT AAAGGGTTCG GCGGGATTGT TCGACCTGCT GCCCATGAGC GTGATGCTGC ACGCCGCCGA AGATCTGCTC GCGCTGCTGC GCGCCGAGCG CACCGGCGTG GCCGAGGACT TCGAGGCGCT GTTCAGCGTC GTCGACACCG TCGATCGCTG GCTAGACGCG CTGGACCAGG CGGGCGCTCT GCCGGCCAAC GCCGAACAGA TCGGGCAGAC GGAAGCCTTG CGTCTTCGGG ATCTGGTCGC TTCGGTGAGC GCGCCCACGG ATGTCGCATC CTGGGCGTCG CCGCCGCCGC CGACCTGGCG ACCGCCCCAG ACCTTCAACG GCAAGGGCGC CATGGCCCTT CGCTATACGC CGCGCGCCGA CAGCTATTTT TCCGGCGACG ATCCCATCGC CATTGTCGCG GCGACGCCCG GCCTGGCGGG GCTGAAGATA TCGCCGCGCG AGCCCTGGGG CGAGATCGAG GACTATGACC CCTATGCCTG CAACCTGGTG CTGGAGGCGG TTTCCACCGC CAGCCGGGCA GAGGTCGAGG CGGCGTTTCG CTTCGTGGCC GACCAGGTCG AGTTCGTCGA TCTGACGTCG AGCGAGCCGG CGCTCGCGCC CGAAGCGCAG GGCGCACGCA AGACCCTGCG TATCGACGCC GAGCGCGTGG ACCGATTGGC CGGACTGGCC GGCGATCTGG TCATCGCCAA GAACGGGCTG TCGGAGTTGG CCGCCCAGGC CGAGGGCCTG CCTGGGGGTC AAGCCCTGGG TCAAGCCTTG CGCGCGCGGC AAGCCCTGCT TGATCGCCTG GTGGGCGACC TGCACGCGAC CGTGGGAAAG GTCCGTCTCG TGGCGCTGGG GCCGCTGTTC GCCCGGTTCC ATCGCCTGGC GCGCGAGATC GCCCGTTCGC TGCACAAGGA GATCTCGCTG GAGGTGGAGG GCGGCGACAT CGAGGTCGAC AAGACCATTG TCGACGGCCT GTTCGAGCCT CTGTTGCATG TCCTGCGCAA CGCGATCGAT CATGGTGTCG AGCCCACTGA CGTTCGCGCT GGCGCCGGAA AGCCCGCGAC CGCCACCATC CGGTTCAAGG CCCGGGCGGC GGCGGATCAG GTGGTGATCG AGGTTCGTGA TGACGGCGCG GGTATCGATC CGGCCAAGGT CCGCGCCCTG GCCGTCACGC GCGGAGTGTT GACCCAGGAG GCGGCCGACC GCCTGGATGA TCGCGCATCG ATCGACCTGA TCTTCACTCC CGGCTTCTCG ACGGCCACCG AGATCAGCTC GGTGTCGGGC CGGGGCGTCG GCATGGACGT CGTGCGCGAC GCGGCCGGGA AGCTGGGCGG CAAGGTCATC GTCGAAAGCG AGAAGGGGCG GGGCACGACC GTGCGGTTCA TCCTGCCGGT GACCATGGTC CTCACCAAGG TGATGGTCGT GACCTGCGGC GAGGAGCGCT ATGGCTTGGC GTTGGACACG GTGGTCGAGA CCGTCCGGGT CGCGGCCGAC CGCATCGTCG CCGTGCGCGC GGGCAGGGCG TTCCAGTTGC GCGACGCCGT GATCCCCCTA GTGTCGCTCG GCGACCTCGT GGGGGCCGCC GCGTCTGAAG CCAGATCAGC CGAGCGGGTG GTCGTGGCGA GGGCTCAAGG CGAATTGGTC GGCTTCGCGG TGGATGCGAT CGTCGATCGC ATGGACGCCG CCGTGCGGCC CATGACCGGA TTGCTTGCCG GCGCGCCGGG CGTCATGGGC GCCACGCTGC TCGCCGACGG CGCGGTGTTG ATGATTCTCG ATCCGGCGGA GCTGATCCGG TGA
|
Protein sequence | MTGDLMAHFL SEGRELVASA ERDLASLARQ PDDANALDGC FRAIHTLKGS AGLFDLLPMS VMLHAAEDLL ALLRAERTGV AEDFEALFSV VDTVDRWLDA LDQAGALPAN AEQIGQTEAL RLRDLVASVS APTDVASWAS PPPPTWRPPQ TFNGKGAMAL RYTPRADSYF SGDDPIAIVA ATPGLAGLKI SPREPWGEIE DYDPYACNLV LEAVSTASRA EVEAAFRFVA DQVEFVDLTS SEPALAPEAQ GARKTLRIDA ERVDRLAGLA GDLVIAKNGL SELAAQAEGL PGGQALGQAL RARQALLDRL VGDLHATVGK VRLVALGPLF ARFHRLAREI ARSLHKEISL EVEGGDIEVD KTIVDGLFEP LLHVLRNAID HGVEPTDVRA GAGKPATATI RFKARAAADQ VVIEVRDDGA GIDPAKVRAL AVTRGVLTQE AADRLDDRAS IDLIFTPGFS TATEISSVSG RGVGMDVVRD AAGKLGGKVI VESEKGRGTT VRFILPVTMV LTKVMVVTCG EERYGLALDT VVETVRVAAD RIVAVRAGRA FQLRDAVIPL VSLGDLVGAA ASEARSAERV VVARAQGELV GFAVDAIVDR MDAAVRPMTG LLAGAPGVMG ATLLADGAVL MILDPAELIR
|
| |