Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4629 |
Symbol | |
ID | 5902091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5006987 |
End bp | 5008666 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641565148 |
Product | integral membrane sensor signal transduction histidine kinase |
Protein accession | YP_001686247 |
Protein GI | 167648584 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.119322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCGC ATCCCGGGCC AGAGGGCAGG CCGAAGCGCC GGTTGACCTG GCCCGGGTCC GCCCGCTTCA GGCTTGCCTG GCCCAGGGGT TCGCGCCTGG GCCGGCTGAT CATCGGGCTC AACCTGCTGG CCCTGGCGGT CCTGCTGGGC GGAGCCCTGG TGCTCAACGA GCTGCGCAGC GGCCTGATCA AGGCGCGCAT CGACAGCCTG ACGACCCAGG GCGAGCTGAT CGCCAATGTG ATCGACCTAG CCGCGACGGT GGGCGATCCC GAACCGCGGC TGGCGCCCGA CCAGGCCAGC GACATCCTGC AGAGCCTGTT CATCCCGCGC TCGCAGCGGG CGCGGCTGTT CGACGCTGAC GGCAACCAGC TGGCCGACTC CTACGTCGTC GCCGACCGGG TGGAGTCCAA GGTCCTGCCC CCGGCTCGCA AGCCGGGCCA GCCGGCCTTC GGCCTGCCGG CCCCGGACGC CGGCGCCAAG CCCAAGGCCG CCGAGGCCGC GCGCAAGGCC CTGGCCGCCG AGATCGCCCA GGCCAAGCTG GGCGAGCCGG TGGCGGGAAT GCGCCGGGCC GAGAACGGCG AGCGGGTGGT GTCGGTGTCG ATCCCCATCC AGCATGTGCG CGCGGTCCTG GGCGTCCTGA CGCTTGAAGC GGGGGATGTC GACGAGATCA TCGCCGCCGA GCGCAAGGCC CTGCTGCCCT TCGCCCTGAT CGCCATCGCC ACCACCCTGA TCTCGTCGTT CCTGCTCAAC CGTCTGATCG CCCAGCCGGT GCTGCGGATG GCCAGCGCCG CCGACCGCGT GCGGCTGGAC GGGGCGCGGG CCATCTCCCT GCCAGACATC TCCGGCCGCA AGGACGAGCT GGGCGACCTG TCGCGCGCCC TGGAGGAGAT GACGGATTCA ATGTCCGAGC GGATGGACGC CATCGAGCGC TTCGCCGCCG ACGTCGCCCA CGAGATCAAG AACCCCCTGA CCTCGATCCG CTCGGCGATC GAGACCCTGG ACCTGGTGTC CGACCCTGCC GCCCGGGCCC GGCTGCTGGC CATCCTGCAG CAGGACGTCA CCCGGCTGGA CCGGCTGGTC ACCGATATCT CCAACGCCTC GCGGCTGGAC GCCGAGCTGT CGCGCGAGGC GCCGCGGGCC TTCGAGTTGA ACCTGCTGCT GGCCGAGGTG ATCCATCTCT ACGAGGCCCA ACTGCGGCCG GGTCCCGCGC CGGGCAGCGT GCGGGTCACC TTCGACGCCC AGGCCGCGCC GCAGCATGTC CGCGTGCTGG CCCGCGAGAC CCCGATCGGC CAGGTGTTCC GCAACCTGAT CGACAACGCC CGCTCGTTCA GCGCCGCGGA CGGCGAGGTG CGGGTGACCC TGACCCGAGA TCCGGGCCTG CGCGACCATC ACAACCGGCT GATCGTCACG GTGGATGACG ACGGGCCGGG GATCCCGCCC GACAATCTCG AGACCATCTT CGAGCGGTTC TACACCTCGC GGCCCAAGGG CAAGGCGTTC GGCGGCAATT CGGGCCTGGG CCTGTCGATC GCCCGGCAGA TCGTCGAGGC GCACGGCGGG ACGATGCGGG CCGAGAACCG CATGGACGCC GAAGGCAAGG TGGTGGGCGC GCGCTTCCGG GTCGACCTGC CCGAAGCCAG CGCCCATCAG GCCAATCACC ATGGCGGCTC GCGCGAATGA
|
Protein sequence | MAAHPGPEGR PKRRLTWPGS ARFRLAWPRG SRLGRLIIGL NLLALAVLLG GALVLNELRS GLIKARIDSL TTQGELIANV IDLAATVGDP EPRLAPDQAS DILQSLFIPR SQRARLFDAD GNQLADSYVV ADRVESKVLP PARKPGQPAF GLPAPDAGAK PKAAEAARKA LAAEIAQAKL GEPVAGMRRA ENGERVVSVS IPIQHVRAVL GVLTLEAGDV DEIIAAERKA LLPFALIAIA TTLISSFLLN RLIAQPVLRM ASAADRVRLD GARAISLPDI SGRKDELGDL SRALEEMTDS MSERMDAIER FAADVAHEIK NPLTSIRSAI ETLDLVSDPA ARARLLAILQ QDVTRLDRLV TDISNASRLD AELSREAPRA FELNLLLAEV IHLYEAQLRP GPAPGSVRVT FDAQAAPQHV RVLARETPIG QVFRNLIDNA RSFSAADGEV RVTLTRDPGL RDHHNRLIVT VDDDGPGIPP DNLETIFERF YTSRPKGKAF GGNSGLGLSI ARQIVEAHGG TMRAENRMDA EGKVVGARFR VDLPEASAHQ ANHHGGSRE
|
| |