Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0631 |
Symbol | |
ID | 5898086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 698559 |
End bp | 700064 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641561113 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_001682262 |
Protein GI | 167644599 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.22133 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.283314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGAAGTG GAGACCGCTA CCACGTGATG AACACGGTTT CGGCCGGCTC GCTGCACAGC GACCTGCGGC CCCGGGCCTA TCTGGCGGCG ATAGTCACGG TGCTGGCGTG CTGCCTGGTG CAATTGGCCT TGCCGCCCGG CTTCACCGCC CCTTCGGCCT TCCTGCTGTT CGTGCCGGCG GTGCTGATCA GCGCGGCGGT CGGCGGCCTG GCGCCGGGTC TGTTCGCCAC GGTGCTGGCG GCCGGGGCGG TGTGGCTGCT GAAGGTGCGC GGCGTTCCCG ACCAGGCCAC GGCGTTGTCC AGCCTGGTGT TCGTGATGAT CGGTTTCGGC ATGTCGGTCG GCGGCGGCTG GTTCCACGCC GCCCGGGCCC GCGCCGCCGC CATGACCCAT CACCTGCAGT CGATCCTCGA CTCGGCCCCC GACGCGGTGA TCGTCATCGA CCCGGCCGGC CTGATGACCT CGTTCAGCCC CGCCGCCGAG CGGCTGTTCG GCTGGACCTC GGCCGAGGCG ATCGGCCGGA ACGTCAGCCT GCTGATGCCC GACCCCGACG GCGCCGGCCA CGACGGCTTC CTGGCCAACT ACAGCCGCAG CGGCGAGAAG CGGATCATCG GCACGGGCCG CGTCGTGGTG GGCAAGCGCA GGGACGGCTC GACCTTCCCG ATGGAGCTGG CGGTCGGCGA GACGCGGGGC GCGCGGCCGT TCTACACAGG CTTCATCCGT GACCTGACCG ATCGCCAGCA GACCGAGGCC CGGCTGCGCG ACCTGCAGAC CGAACTGGTC CACGTCTCGC GCCTGACCGC CATGGGCGAG ATGGCCTCGA CCCTGGCCCA CGAGTTGAAC CAGCCGCTGT CGGCGATCGC CAACCTGCTG ACCGGCTCGC GCCGCCTGCT CGACCGCGGC CGCCCCGAGG ACCAGGCCAA GGTGCGCGAC GCCGTCGACA AGGCCTCGGC CCAGGCCCTG CGCGCCGGCG ACGTCATCCA CCGCATGCGC GAGTTCGTCC GGCGGGGCGC GACCGAACGC GCGCCGGAAA GCCTGTCCAA GGTGGTCGAG GACGCCGCCG CCCTGGCCTT GATCGGGGCT CGCGAGCACT TGGTCCAGAC ACGCCTGCAA CTGGATCCCG CCGCCGACGC CGTCTATGCC GACCGCGTGC AGATCCAGCA GGTTCTGGTC AATCTGATCC GCAACGCCGT CGACGCCATG GCCGACTCGC CGCGCCGCGA ACTGACCATC GCCAGCCAGC GGCTCGCCAA CGGTTCGGTC CAGGTGAGCG TCACCGACAC CGGCTCGGGG ATCAGCGACG ACTTCCGCGA GCGCCTGTTC CAGCCGTTCA TGACCACCAA GGCCGAGGGC ATGGGGGTGG GCCTGTCGAT CTCGCGCTCG ATCGTCGAGG CGCATGGCGG CAAGATCTGG GCCGACGCGA ACCCCACGGG CGGGACGGTG TTCCACTTCA CCCTGCCGCC CCGCCGCGAC AAAATCGAAG AGCATGGGAA ACCGATCGAT GAGTGA
|
Protein sequence | MRSGDRYHVM NTVSAGSLHS DLRPRAYLAA IVTVLACCLV QLALPPGFTA PSAFLLFVPA VLISAAVGGL APGLFATVLA AGAVWLLKVR GVPDQATALS SLVFVMIGFG MSVGGGWFHA ARARAAAMTH HLQSILDSAP DAVIVIDPAG LMTSFSPAAE RLFGWTSAEA IGRNVSLLMP DPDGAGHDGF LANYSRSGEK RIIGTGRVVV GKRRDGSTFP MELAVGETRG ARPFYTGFIR DLTDRQQTEA RLRDLQTELV HVSRLTAMGE MASTLAHELN QPLSAIANLL TGSRRLLDRG RPEDQAKVRD AVDKASAQAL RAGDVIHRMR EFVRRGATER APESLSKVVE DAAALALIGA REHLVQTRLQ LDPAADAVYA DRVQIQQVLV NLIRNAVDAM ADSPRRELTI ASQRLANGSV QVSVTDTGSG ISDDFRERLF QPFMTTKAEG MGVGLSISRS IVEAHGGKIW ADANPTGGTV FHFTLPPRRD KIEEHGKPID E
|
| |