Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0079 |
Symbol | |
ID | 5897791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 94113 |
End bp | 96032 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641560562 |
Product | signal transduction histidine kinase |
Protein accession | YP_001681715 |
Protein GI | 167644052 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0784] FOG: CheY-like receiver [COG3920] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.464916 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGATG CGGGCGGAAC GCTGGCAGAT GATCACGAGG CGCGCCGGCT GCGCGCCCTC GACGCCCTTC GCGCCCTCGA CGGCGACCCG TGCGACCCCC GATTCGATCG GATCGTCCGC TTGGCCTCGC GCCTGTTCAA CGCGCCGCGC GCCGCGATCC GGCTGATCGG CAAGGACCGG GTCTGGTTGA AGGCCAGGGT GGGCTTTGAC CATGTCGAAG AAGCTCGCCC CCCCGGCCTG AGCGAGCGCC TGCGCGAAAC CGGCGTGGTC TCGCATCCCG ATCTCGCCCA CGCCGCCTCG GACCAAACCC TACGGCCCTG GTGCGCCGAC AGCCGCTTCT TCGCCTGCGC GCCGCTCAAG AGCGCGGCGG GGGATATCGT CGGCCTGCTG ACCGTCGAGG ACCCCCGGCC CCGCGATGCG GTCGATGCGG GCCTGACCGA GGCCCTGGCC GACCTCGCGG CCCTGGCGAT GGAAGAACTG CTGCATGACG CCGAGACGGC CCGCAACGCC GCCGAGCGCG CGCTCGACAG CGAGCGCATC GCCCTGGCCC TGCGGGCGGC CAATCTGGGC GAGTTCGTCT GGGACATCGT CGCCGACACG GTGCGGGTCA GCCCGCGGAT GTCGCGGATC ACCGAGATTC CGGAAGGGGT GGCCCCGGCG GACGGCGGCA AGGCCCTCTA CGCCTTCATC CACCCTGACG ATCGCGAGGC CACCCGCGCC GAGATCGAAG CCCAGCTGAA GGCCCAGGGC CGCTACGAGG TCGAGTTCCG GCGCGTGACC TCGGATCCGG ACCGGGTGAT CTGGAACCGC GTGGCCGCCC TGATGGTGCT CGACGCGGCC GACCAGCCCG TGCGGCTGAT CGGCGTGGTG CAGGACGTCA CCGCGCGCCG CGACGCCGAC GATCAGCGCG AGAACCTGCT CACCGAGCTG GATCACCGGA TCAAGAACAT CCTGGCCGCC GTGCTGTCGG TGGCCGGCCA GTCGGCCCGC AAGGCCTCGT CGCTGGATGG CTTCTTAAAG GCCTTCACCG GCCGGCTGAA ATCGATGAGC TCGGCCCATG ACCTGCTCAG CGCCGCGCGC TGGCGCGGGG CCACCCTGGC GCGGATCGCC GCCGCCGAGC TGGGCGGTCT GGCCCCCAAC CAGACCCGCT GGGACGGGCC GGAGCTGTTC CTGACGCCCC GCGCGGCGGC CGCCCTGTCG CTGACCCTGC ACGAACTGGC CGTCAACGCC GTGAAGTTCG GGGCCCTGTC CTCGGAGAGC GGCCGGGTCG AGGTCGTCTG GCGCGGCTCG CCCGAAGGCG GCTTCAACCT CGAATGGCTG GAGACCGGTG GACCCATGAC CTCGCCGCCA GCCACCCGCG GCTTCGGCAT GACCCTGATC GAGGACGTGG TCGGTCGCGA ACTGGGGGGG CGGGCCAAGA TCGAATACAA GCGCAGCGGC GTCACGGCGA TGATCCACGC CGCCGCCGAC GCCCTGGTCG AGACGCCCGA ACCCGAGCCG GCCGCGCCCC CGAACGAACG CATCGTCGAG ACCGTGGGCG GCGGCGACGA CAGCTTCCGG GCCGGCGACA TCGCGGGCCT GCGCGTGCTG ATCGTCGAGG ATTCGCTGCT GCTGGCCATG GAGTTGGAGG CGGGGCTGGA GGATTCCGGC GTCGAGGTGG TGGGGTGCGC CGCCGAACTG TCCGAGGCCC TGCAGATGCT GGAGCTGTCG TTCGACGCCG CCGTGCTCGA CGCGGACCTC AACGGCCAGT CGGTGGCGCC GGTCGCCGAG ATCCTACGTC GCGAGGGCCG GCCCTTCGTG TTCGCCACCG GCTACGCCGA CAAGGCCGCC CCGATGGGGT TCGACGCCCC GATCGTCCGC AAGCCCTACA ACGTCCACCA GATCGCCCGG GCGCTGGCGT CGGTGACGGG GCGCGGCTGA
|
Protein sequence | MDDAGGTLAD DHEARRLRAL DALRALDGDP CDPRFDRIVR LASRLFNAPR AAIRLIGKDR VWLKARVGFD HVEEARPPGL SERLRETGVV SHPDLAHAAS DQTLRPWCAD SRFFACAPLK SAAGDIVGLL TVEDPRPRDA VDAGLTEALA DLAALAMEEL LHDAETARNA AERALDSERI ALALRAANLG EFVWDIVADT VRVSPRMSRI TEIPEGVAPA DGGKALYAFI HPDDREATRA EIEAQLKAQG RYEVEFRRVT SDPDRVIWNR VAALMVLDAA DQPVRLIGVV QDVTARRDAD DQRENLLTEL DHRIKNILAA VLSVAGQSAR KASSLDGFLK AFTGRLKSMS SAHDLLSAAR WRGATLARIA AAELGGLAPN QTRWDGPELF LTPRAAAALS LTLHELAVNA VKFGALSSES GRVEVVWRGS PEGGFNLEWL ETGGPMTSPP ATRGFGMTLI EDVVGRELGG RAKIEYKRSG VTAMIHAAAD ALVETPEPEP AAPPNERIVE TVGGGDDSFR AGDIAGLRVL IVEDSLLLAM ELEAGLEDSG VEVVGCAAEL SEALQMLELS FDAAVLDADL NGQSVAPVAE ILRREGRPFV FATGYADKAA PMGFDAPIVR KPYNVHQIAR ALASVTGRG
|
| |