Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4471 |
Symbol | |
ID | 5901932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4845369 |
End bp | 4847252 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564990 |
Product | integral membrane sensor hybrid histidine kinase |
Protein accession | YP_001686089 |
Protein GI | 167648426 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.611462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACG ACAGCCGCAG CTCTGGAGCC GCGACTCACG CGCCGTTGCA GGCGCGGATA GCGGCTGTCG CGTTGACCAC CACGGTGGCC GCGCTGCTGG CGGCGTGCCT GTGCTTCATG CTGCAGCAAT GGAGCGTCGC CCGTCAGGAA TCCCAGACCA GCCACGAAAT CCTGAGCCAG ATGGTCGCCG CGGGCGCCGC CGCGCCGATG GCCGATCACA ACACCGTCGG CGCCTATCGC GCCCTCGAGG CGGTGGCCAA GGCCCCCAAC GTCTCCGCCG TGCGCCTGAC GGACATGCAT GGCCAGGTCG TCGCCCAGAT CGGCGCCGAG GGCGACGCCG CCGTCGACAC CCTGATCCGC CCGATCAAGC TGGGGTCTCG CCAGGTCGGA ACCCTGGCCC TGATGGCCAA GCGGCCCAGC CTGACCGACA TCATCGCCCG CGACCTGGCC CTGACCGGCA CCCTGTTCTT CGGCTCGGCC GGCCTGGCCA TCCTGCTGGC CAACGGCCTG GCCCGGCGCA TGACCAAGCC GATGGAGCGC CTGTCCCGGG CCATGCGCGA AGTCGCCGCC GACGGCCGCT TCAAGCCGGT GGAGCAGGCG GCCGACGACG ACGTCTTCCA GAGCCTGACC GACAGTTTCA ATCACCTGGT CTCGCGCCTG GAGATCAATG ACCACGACCT GCGCGGGGCG ATGGCCGAGC TGGTCAAGGC CCGCGACGAC GCCAACGCCG CCAATGTCCT GAAGTCGCAC TTCCTGGCCA ATATGAGCCA CGAGATCCGC ACGCCGCTGA ACGGCGTGCT GGCCATGACC GAGGTGATGG CCATGGGCGA GCTCAGCGCC GCCCAGCGCG AACGCCTGTC GGTGATCCGC GAATCGGGCG ACTTGCTGCT GTCGGTGCTC AACGACGTGC TCGACCTGTC GAAGATCGAA GCCGGCCGGC TGGACCTGGT CGAACGCGAC TTCGACCTGG CCAGCCTGGC CCTGTCGATC CGCGAATCCT ACGCCACCCA GGCCCGCCAG AAGAACCTGG AGTTTGGCGT CTTCGTCGCG CCAGAGGCCC TGGGTCCGTG GCGCGGCGAC GCCGACCGCC TGCGCCAGAT CCTCGGCAAC CTGGTCTCAA ACGCCCTGAA GTTCACGCTG GAGGGCTCGG TCTCGGTGCG GTTCGCCTCG GCCGACGACG GCAGCGGCCT GCGGATCGAC GTGGTCGACA CCGGCATCGG CATCGCCGCC GAAAGCCTGC CGCGCCTGTT CGACAAGTTC GTCCAGGCCG ACAGCTCGAC CACCCGCCGG TTCGGCGGCT CGGGCCTGGG CCTGTCGATC TGCAACGAAT TGGCGGCCCT GATGGGCGGC GGCGTCCATG TCCAGAGCCG CGAGGGCCAG GGCTCGACCT TCACCGTCGT GGTCGCCATG CCGCGCGGCG AGGCCACCGT CCACGTCCCG GTCGAGGCGG TCCCGCCGCC GATCGAATCG GAGCGCCGGC TGCGCGTGCT GGCCGCCGAC GACAATCCGA CCAACCAGAA GGTCATCGCC GCCGTCCTGG CCCCGCTGGG CGCCGAGGTC GAACTGGTCG CCGACGGCGC GGCCTGCGTC GAGGCCTGGA AGCGCGGCGG CTTCGACATC GTGCTGATGG ACATCCACAT GCCGGTGATG GACGGGGTCG AGGCCGCCCG CACCATCCGT TCGCTCGAGG TCAGCGAGGG CCGCAAGCGC ATCCCGATCG TGGCGGTCAC CGCCAACGCC CTGGTCCATC AGGTCGAGGG CTACATGGCC GCCGGCATGG ATGGCCATGT CGCCAAGCCG ATCGAGGTGA CCAAACTCTA CGACGCGATC GAGACCGCCG TGGCGATCGC CCGCCGGGAC GGCTCGAAGG TCGCGGCGGC TTAG
|
Protein sequence | MIDDSRSSGA ATHAPLQARI AAVALTTTVA ALLAACLCFM LQQWSVARQE SQTSHEILSQ MVAAGAAAPM ADHNTVGAYR ALEAVAKAPN VSAVRLTDMH GQVVAQIGAE GDAAVDTLIR PIKLGSRQVG TLALMAKRPS LTDIIARDLA LTGTLFFGSA GLAILLANGL ARRMTKPMER LSRAMREVAA DGRFKPVEQA ADDDVFQSLT DSFNHLVSRL EINDHDLRGA MAELVKARDD ANAANVLKSH FLANMSHEIR TPLNGVLAMT EVMAMGELSA AQRERLSVIR ESGDLLLSVL NDVLDLSKIE AGRLDLVERD FDLASLALSI RESYATQARQ KNLEFGVFVA PEALGPWRGD ADRLRQILGN LVSNALKFTL EGSVSVRFAS ADDGSGLRID VVDTGIGIAA ESLPRLFDKF VQADSSTTRR FGGSGLGLSI CNELAALMGG GVHVQSREGQ GSTFTVVVAM PRGEATVHVP VEAVPPPIES ERRLRVLAAD DNPTNQKVIA AVLAPLGAEV ELVADGAACV EAWKRGGFDI VLMDIHMPVM DGVEAARTIR SLEVSEGRKR IPIVAVTANA LVHQVEGYMA AGMDGHVAKP IEVTKLYDAI ETAVAIARRD GSKVAAA
|
| |