Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0222 |
Symbol | |
ID | 5897496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 237725 |
End bp | 239653 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641560706 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_001681857 |
Protein GI | 167644194 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.414219 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.376796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACTTTG GCGACCTGAA AATCCCCCAC AAGCTTCTGG CGGTGTTCGC CGTGATGCTG ACCGCCATCG CGGTCATGGG GGTTACGCTC TATCTCAATC AGCGCGGGTA TGAGGCTTCG GTCGACCGCA CCGAGCGGGC CTATGAGTCC GTGCGCGCCG CCGATACCGC GGCCTTCCGC CTGACACGCC AGGAGAACTC GCTGCGGGGC TTCCTGCTCT CGGGCGACGA CTACTACGTC AAGCGCCTGG AAGAGGCCCA CAAGCCCAAG TTCCTGGCGG CCCTCGACAA GCTGCGCGCC CTGGCCAAGG GCGACGAGGC CGACCTGGCT CGCATCGCCG CCGTCGACGC CGCCTATGCC GACTACCGCA AGCTGGCCAT CGAGCCGGGC GAAGCCCTGG GCCGAGACCC GGTCACCCGG CCGCAGGCCG TTGAGCTGGT CAAGCATGAC GGCGTCGCCG ACCAGGCCGT GACGCCGGTC GAGGACGCCA TCGAAGCCAT CGTCAAGAGC TCCGAGGGCG TGCTGGCCGC CGAGGCCGCG GCGCAGAAAA AGGCCTCGTT GGCGACAACC CTGGCCCTGG CCATCGGCAT CGCCATGACC ATCGCCATCG CCCTGGGCGG CGGCTTGTTG CTGTCGGGCG TCATCGCCGC CCCTGTGGCC GCCATGACCG CCGCCATGCG TCGCCTGGCC TCGGGCGACA ACGGCGTCGA GGTACCGGCG GGCGGACGCA AGGACGAGAT CGGCCAGATG GCCGCCGCCG TCGCCACCTT CAAGGAAGCG GCGATCGAGA AGATCAGGAT CGAGGGTGTC GCCGCCGACC AGCGCCAGGC CGCCGATGCC GAGCGCGCCG CCGTGGCCGA CGAAAAGGCG CAGGTCGACC GCAGCGACGC CATCAATATC GGCGCCTTGA ACGAGGCCCT GGACCGCCTG GCGAGCGGCG ACCTGACCCA CCGGATCACC ATTCCGTTCT CGCCCAAGGC CGAGAGCCTG AAAGCCAACT TCAATGCCGC GGCCGATCGG GTGCAGGACG CCATGAAGGC CATCGCCGCC GCCACTGGCG GAGTCAACAG CGGCGCTGAC GAGATCGCCG TGGCTTCGGA CGACCTGTCG CGCCGCACCG AGCAGCAGGC CGCCAGCCTG GAAGAGACCG CCGCCGCCCT CGACGAGATC ACCGCCACGG TGCGCAAGAC CGCCGCCGGC GCCAAGGAAG CCTCGTCCGT GGTCGCCGTC GCCCGCAGCG ACGCCGAGAA GTCCGGCAGG ATCGTCAGCC AGGCGGTCAG CGCCATGACC GAGATCGAGA CCTCGTCCAA TCAGGTCAGC CAGATCATCG GCGTGATCGA CGAGATCGCC TTCCAGACTA ATCTCCTGGC CCTGAACGCC GGCGTCGAGG CGGCGCGGGC CGGCGATGCG GGCCGAGGCT TCGCGGTGGT CGCGCAGGAA GTGCGGGCCT TGGCCCAGCG CTCGGCCGAA GCGGCCCGGG AGATCAAGAC CCTGATCTCG ACCTCGACCC AACAGGTTGG GGCCGGCGTG GACCTAGTCG GCCAGACCGG CGAGGCCCTG CAGCGGATCG TCGACCAGGT GGCGTCGATC GACATCCTGG TCAACGAGAT TTCCGCCTCG GCGTCCGAGC AGTCCACCGG CCTGCATGAG GTCAACACCG CCGTGAACCA GATGGACCAG GTCGTGCAGC GCAACGCCGC CATGGTCGAG GAAGCCACCG CCGCCGCCCA TTCGCTGAAG GGCGAGGCCA ACCAGCTGTC GGCCTTGGTC GGCCGCTTCA AGGTCGGGAC CGAAGCCGCG GCGCCGGCCT CGCGCGCCAG CGCGCCAGCC CGCCCCAACA CCTCCGTCCG TCCCGGCCGC GCCGCAGCTC CCGCCTCGCG GGGCAACACG GCGCTGGCGA CCAAGAGCGA CGAATGGGAA GAATTCTGA
|
Protein sequence | MNFGDLKIPH KLLAVFAVML TAIAVMGVTL YLNQRGYEAS VDRTERAYES VRAADTAAFR LTRQENSLRG FLLSGDDYYV KRLEEAHKPK FLAALDKLRA LAKGDEADLA RIAAVDAAYA DYRKLAIEPG EALGRDPVTR PQAVELVKHD GVADQAVTPV EDAIEAIVKS SEGVLAAEAA AQKKASLATT LALAIGIAMT IAIALGGGLL LSGVIAAPVA AMTAAMRRLA SGDNGVEVPA GGRKDEIGQM AAAVATFKEA AIEKIRIEGV AADQRQAADA ERAAVADEKA QVDRSDAINI GALNEALDRL ASGDLTHRIT IPFSPKAESL KANFNAAADR VQDAMKAIAA ATGGVNSGAD EIAVASDDLS RRTEQQAASL EETAAALDEI TATVRKTAAG AKEASSVVAV ARSDAEKSGR IVSQAVSAMT EIETSSNQVS QIIGVIDEIA FQTNLLALNA GVEAARAGDA GRGFAVVAQE VRALAQRSAE AAREIKTLIS TSTQQVGAGV DLVGQTGEAL QRIVDQVASI DILVNEISAS ASEQSTGLHE VNTAVNQMDQ VVQRNAAMVE EATAAAHSLK GEANQLSALV GRFKVGTEAA APASRASAPA RPNTSVRPGR AAAPASRGNT ALATKSDEWE EF
|
| |