Gene Caul_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0222 
Symbol 
ID5897496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp237725 
End bp239653 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content70% 
IMG OID641560706 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001681857 
Protein GI167644194 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.414219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.376796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTTTG GCGACCTGAA AATCCCCCAC AAGCTTCTGG CGGTGTTCGC CGTGATGCTG 
ACCGCCATCG CGGTCATGGG GGTTACGCTC TATCTCAATC AGCGCGGGTA TGAGGCTTCG
GTCGACCGCA CCGAGCGGGC CTATGAGTCC GTGCGCGCCG CCGATACCGC GGCCTTCCGC
CTGACACGCC AGGAGAACTC GCTGCGGGGC TTCCTGCTCT CGGGCGACGA CTACTACGTC
AAGCGCCTGG AAGAGGCCCA CAAGCCCAAG TTCCTGGCGG CCCTCGACAA GCTGCGCGCC
CTGGCCAAGG GCGACGAGGC CGACCTGGCT CGCATCGCCG CCGTCGACGC CGCCTATGCC
GACTACCGCA AGCTGGCCAT CGAGCCGGGC GAAGCCCTGG GCCGAGACCC GGTCACCCGG
CCGCAGGCCG TTGAGCTGGT CAAGCATGAC GGCGTCGCCG ACCAGGCCGT GACGCCGGTC
GAGGACGCCA TCGAAGCCAT CGTCAAGAGC TCCGAGGGCG TGCTGGCCGC CGAGGCCGCG
GCGCAGAAAA AGGCCTCGTT GGCGACAACC CTGGCCCTGG CCATCGGCAT CGCCATGACC
ATCGCCATCG CCCTGGGCGG CGGCTTGTTG CTGTCGGGCG TCATCGCCGC CCCTGTGGCC
GCCATGACCG CCGCCATGCG TCGCCTGGCC TCGGGCGACA ACGGCGTCGA GGTACCGGCG
GGCGGACGCA AGGACGAGAT CGGCCAGATG GCCGCCGCCG TCGCCACCTT CAAGGAAGCG
GCGATCGAGA AGATCAGGAT CGAGGGTGTC GCCGCCGACC AGCGCCAGGC CGCCGATGCC
GAGCGCGCCG CCGTGGCCGA CGAAAAGGCG CAGGTCGACC GCAGCGACGC CATCAATATC
GGCGCCTTGA ACGAGGCCCT GGACCGCCTG GCGAGCGGCG ACCTGACCCA CCGGATCACC
ATTCCGTTCT CGCCCAAGGC CGAGAGCCTG AAAGCCAACT TCAATGCCGC GGCCGATCGG
GTGCAGGACG CCATGAAGGC CATCGCCGCC GCCACTGGCG GAGTCAACAG CGGCGCTGAC
GAGATCGCCG TGGCTTCGGA CGACCTGTCG CGCCGCACCG AGCAGCAGGC CGCCAGCCTG
GAAGAGACCG CCGCCGCCCT CGACGAGATC ACCGCCACGG TGCGCAAGAC CGCCGCCGGC
GCCAAGGAAG CCTCGTCCGT GGTCGCCGTC GCCCGCAGCG ACGCCGAGAA GTCCGGCAGG
ATCGTCAGCC AGGCGGTCAG CGCCATGACC GAGATCGAGA CCTCGTCCAA TCAGGTCAGC
CAGATCATCG GCGTGATCGA CGAGATCGCC TTCCAGACTA ATCTCCTGGC CCTGAACGCC
GGCGTCGAGG CGGCGCGGGC CGGCGATGCG GGCCGAGGCT TCGCGGTGGT CGCGCAGGAA
GTGCGGGCCT TGGCCCAGCG CTCGGCCGAA GCGGCCCGGG AGATCAAGAC CCTGATCTCG
ACCTCGACCC AACAGGTTGG GGCCGGCGTG GACCTAGTCG GCCAGACCGG CGAGGCCCTG
CAGCGGATCG TCGACCAGGT GGCGTCGATC GACATCCTGG TCAACGAGAT TTCCGCCTCG
GCGTCCGAGC AGTCCACCGG CCTGCATGAG GTCAACACCG CCGTGAACCA GATGGACCAG
GTCGTGCAGC GCAACGCCGC CATGGTCGAG GAAGCCACCG CCGCCGCCCA TTCGCTGAAG
GGCGAGGCCA ACCAGCTGTC GGCCTTGGTC GGCCGCTTCA AGGTCGGGAC CGAAGCCGCG
GCGCCGGCCT CGCGCGCCAG CGCGCCAGCC CGCCCCAACA CCTCCGTCCG TCCCGGCCGC
GCCGCAGCTC CCGCCTCGCG GGGCAACACG GCGCTGGCGA CCAAGAGCGA CGAATGGGAA
GAATTCTGA
 
Protein sequence
MNFGDLKIPH KLLAVFAVML TAIAVMGVTL YLNQRGYEAS VDRTERAYES VRAADTAAFR 
LTRQENSLRG FLLSGDDYYV KRLEEAHKPK FLAALDKLRA LAKGDEADLA RIAAVDAAYA
DYRKLAIEPG EALGRDPVTR PQAVELVKHD GVADQAVTPV EDAIEAIVKS SEGVLAAEAA
AQKKASLATT LALAIGIAMT IAIALGGGLL LSGVIAAPVA AMTAAMRRLA SGDNGVEVPA
GGRKDEIGQM AAAVATFKEA AIEKIRIEGV AADQRQAADA ERAAVADEKA QVDRSDAINI
GALNEALDRL ASGDLTHRIT IPFSPKAESL KANFNAAADR VQDAMKAIAA ATGGVNSGAD
EIAVASDDLS RRTEQQAASL EETAAALDEI TATVRKTAAG AKEASSVVAV ARSDAEKSGR
IVSQAVSAMT EIETSSNQVS QIIGVIDEIA FQTNLLALNA GVEAARAGDA GRGFAVVAQE
VRALAQRSAE AAREIKTLIS TSTQQVGAGV DLVGQTGEAL QRIVDQVASI DILVNEISAS
ASEQSTGLHE VNTAVNQMDQ VVQRNAAMVE EATAAAHSLK GEANQLSALV GRFKVGTEAA
APASRASAPA RPNTSVRPGR AAAPASRGNT ALATKSDEWE EF