Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4725 |
Symbol | |
ID | 5902187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5112197 |
End bp | 5114041 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641565244 |
Product | integral membrane sensor hybrid histidine kinase |
Protein accession | YP_001686343 |
Protein GI | 167648680 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.269083 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0435637 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATTGG GAGAACGGGA CGACCGACTG CGAAGCGACC CAATCAGGCT GGCGCTGCTT GGGCACGCCC ATCGCAACGC CTTGGCGACC ATGGCGGTGC AGGCGGCCGC CGCGATTGGC GTCTGCCTGG TCGCCCAGCC CGTGGAGCGA CCCTTTCGGC TGATGTGGCT GGCGGTCGTT CTGGCGCTGT TGGCGGTCAG GCTTTTCACT GATCGTCTGC TCGGCGCGGC CCTTGGCGGT CGGTTGATGG CCGACCGGCT TTTGCTGCTG GCGCGAGTGC ACAGCCTGGG CCTGATCCTG AGCGCGGGAT TGTGGGCGCT GCTGGCTTGC GTGCGAATTC CCCAGGACAG CGTCAGCGCC CGCTACGTGC TGATCATCGT CTTGTCCGCC TTGGCCGGAG GCGCGGTGGG CGTCCTGTCG CCGCTCAAGT GGACGGGCCG GATCTATGTT TCCCTGATCC TTCTGCCGGC CAGCCTGACC CTGATTCTCA ATCGCGGCGT GGACGCCACC TTGGGCGTTC TCGGCGTGAT CTTCTGGATC GTGATGATCG TGGGTCATCG CAACAACCAT GCCCTTTTGG TCGACGCCTT GCGACTGCGC GACGAGAACC GCGAACTGTT GGCGGACGTC GCGCGGCGCA ATCACGCGAC CCTTCGCCTG AACCATGATC TCGAAAGCAG CGTTCGCGCC CGCACGATCG AGCTTGAGCG CATGACCGAA GAGGCCAAGG TCGCCAACCG CGCCAAGTCG CAATTCCTGG CCACGGTCAG CCACGAAATG CGCACGCCGT TGAACGCCAT TCTGGGCGAG GGTCAATTGC TGGCGCGGGA GGCGCTGACG CCGTCCCAGC GGAGTCGCCT CCAAGTCATC GACACCGCGT CCCGGGCGAT GCGGCACCTG ATCGACGATG TGCTCGACAT CTCGCAGATC GAGGCGGGCG CGCTTCGGCT GAGACCTAAG GTGTTCGCGC TCGCCACGCT CGTGGACGAT ATCCAGCAGA TCTACCGACC GCTTGCCGAA GGGCGCGGTT TGTCGCTGAC GGTCTCGCTG CAACCGGAGA CGGCGCCGTT CCGGCGCGGC GACCCTGATC GGCTTCGTCA GATCGTCGGT AACCTGATCG CCAACGCCTT GAAGTTCACG CGCCAGGGCG GCGTGACCGT CAAGATCGGC GGCGACGACG AACAGCTCAC GGTTTCGGTG AGCGATACGG GGATCGGCAT CGACGCCAAG GATCACGAGA CGATCTTCCA GCGCTTCGTT CAGGTCGATA GTTCATCGAC GCGGGAGGCT GGCGGCATCG GCCTGGGCCT GGCCATCTGT CGCGAACTTT CCGAACAGAT GGGAGGCTCT CTGAAGGTGA TCTCGGCGCG GGGCATCGGC GCGCGGTTCG ATTTCAGCGC GCCGATCCCA TGCGTCCTGG CCTCCGCCCC GCTAGCCGTC GCCGAGGACG CCGTGTCCGA CGATGGAGCG CCGGGCTCGG TCTTAGTGGT TGATGATAAT CCCGTGAACC GCCGCATCCT GGCCGCCCTG ATGGAGCCGT TCGGCGTCGA ATGCGGCTTC GCGACCAGCG GGAAGGAAGC CGTGGAGGCG TGGCGTCGCC AGCCCTGGGA CGCGATCTTC ATGGACGTGC ACATGCCGGA CATGGACGGC GTCGAAGCCT CGCGGACGAT CCGCGCCGAA GAGATTGTCG CCGGCCGCGG CAGGACGCCG ATCGTCGCCG TCACCGCCAG CGTGCTCACC CATGAGGTGG AAGCCTATCG GCAAGCCGGT ATGGACGATG TGCTGCCCAA GCCCGTAGAC GCTTCGGCCT TGGCCAGCAT GTTGTCGCGC TGCGCCGCGG CCTGA
|
Protein sequence | MQLGERDDRL RSDPIRLALL GHAHRNALAT MAVQAAAAIG VCLVAQPVER PFRLMWLAVV LALLAVRLFT DRLLGAALGG RLMADRLLLL ARVHSLGLIL SAGLWALLAC VRIPQDSVSA RYVLIIVLSA LAGGAVGVLS PLKWTGRIYV SLILLPASLT LILNRGVDAT LGVLGVIFWI VMIVGHRNNH ALLVDALRLR DENRELLADV ARRNHATLRL NHDLESSVRA RTIELERMTE EAKVANRAKS QFLATVSHEM RTPLNAILGE GQLLAREALT PSQRSRLQVI DTASRAMRHL IDDVLDISQI EAGALRLRPK VFALATLVDD IQQIYRPLAE GRGLSLTVSL QPETAPFRRG DPDRLRQIVG NLIANALKFT RQGGVTVKIG GDDEQLTVSV SDTGIGIDAK DHETIFQRFV QVDSSSTREA GGIGLGLAIC RELSEQMGGS LKVISARGIG ARFDFSAPIP CVLASAPLAV AEDAVSDDGA PGSVLVVDDN PVNRRILAAL MEPFGVECGF ATSGKEAVEA WRRQPWDAIF MDVHMPDMDG VEASRTIRAE EIVAGRGRTP IVAVTASVLT HEVEAYRQAG MDDVLPKPVD ASALASMLSR CAAA
|
| |