Gene Caul_0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0236 
Symbol 
ID5897510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp258686 
End bp260668 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content69% 
IMG OID641560720 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001681871 
Protein GI167644208 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.105118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCG ACGATCTGAA GATCTCCACC AAGGTGGCCC TGCCGGCCGT CATCCTGACC 
GTGGTGGCCC TGTCCATCAC CGGTGTCGGC GCCTGGCAGT CCAAGGTTTC GGAAGCCGCC
ACCAAGGTGC TCGTCGAACA GCGCGCCCCG GCCGAGCTGG AAGGCTCGCG GTTCAACCGC
CGGGTCGCGA CCATTGGCTA CGCCGCCTAT CGCACCATCT CCAACGACGC CGCCTCGCCC
GAGGCCAAGC AGGCCAGCGA CGAGATCGAC CTCGCCTACA AGGAGGGCAA GATCGCCCTC
GGCAAGATCA AGGCCGCCGA CCCGGCCGCC GCCAAGAAGG TCGCCGACTA TCAGGTCCGC
CTGGACCGCA TCTATTCCAG CGCCCGGCAG GGCGCGGACC TGGGCCTGCA GAACGCCAAC
GATGCGGCCA AGATGGTCAT GGGCGTGATC GATCCGGACA TCGCCAGCCT GAGCAAGGAC
GTCTCGACCT ATACCAACAC CCATAGCGAC CAGACCCGCG CCATGGTGGC CAAGGCCGCC
AAGGCGGCGT CGGCCGGCAC GCTGATGACC ATCCTGTTTG GCCTGATCGC CTCGGCCTCC
GCCCTGGTCT TCGCCCTGTG GATCGGTCGT TCGAAGATCT CGGCCCCGCT GGCCGGCCTG
TCCAAGACCA TGGAAGTCCT GGCCCAGGGC TCGGTGGACG TCGAGGTGGT GGGCGCCCTG
CGCAAGGACG AGGTCGGCGC CATGGCCCGC TCGGTCCAGG TGTTCAAGGA CAACGCCCTG
GCCCTGCGCA CCGCCGAGGC CGCCCAGCAG CGCCTGAGCG CCGAAACGGA AACCGAGCGT
CAACGCAACC AGGAAGCCGC CGAGGCCGCC GCCCGCGAGC AGGCCTTCGT GATGGAGAAC
ATCGCCACGG GCCTGACCAA GCTGGCCGAG GGCGATCTGA CCTATCGCGT CGACGCCCAG
TTCCCGCAGG CCTACCAGCG CCTGCAGAGC GACTTCAACG GCGCCATCGC CCAGATGGAA
GAGGCGATGC GCACCATCGT CCACGCCGCC AGCAGCATCG GCTCGGGCAG CGACGAGATC
GCTTCGGCCG CCGACGACCT GTCGCGCCGC AGCGAGCAGC AGGCCGCCAG CCTGGAAGAA
ACCGCCGCCG CCCTCGACGA GATCACCGCC ACGGTGAAGC GCTCGTCGGC CGGCGCCGTC
GAGGCCTCGC GCGTCGTCAC CTCGACCCGC GCCGATGCCG AACGCTCCAG CGTCGTGGTG
CGCGGCGCCG TCGAGGCCAT GAACCAGATC GAGAAGTCGT CGCAGTCGAT CAGCCAGATC
ATCGGCGTCA TCGACGAAAT CGCCTTCCAG ACCAACCTCC TGGCCCTGAA CGCCGGGGTC
GAGGCGGCTC GGGCCGGCGA TGCTGGCCGC GGCTTCGCGG TCGTGGCCCA GGAAGTGCGG
GCCCTGGCCC AGCGCTCGGC CGACGCGGCC AAGGAAATCA AGACCCTGAT CTCGACCTCC
TCGCAGCAGG TCGGCCAGGG CGTGTCGATG GTCGGCCAGA CCGGCGATGC TCTGCAGGCC
ATCGTCGGCA AGGTCAGCGA GATCGACGCC CTGGTCAGCG AGATCGCCGC CGGCGGGGCC
GAGCAGGCCA CCGGCCTCAA CGAGGTCAAC GCCGCCGTCA ACCAGATGGA CCAGACCGTC
CAGCAGAACG CCGCCATGGT CGAGCAATCG ACGGCCGCCA GCCACGCCCT GAAGGGCGAG
GCCAACAACC TGATGCAAAT GATCGGGCGT TTCCAAGTTA GCGGCGCCAG CGCCGCCGTG
CGCTCCACCA CTCGCCGCGC CGCGCCGCCG ACCCAGGTGA CCCGTCCGGC TCCGCGCCCG
ACGCTCGCCC CGGCCACCGC GGCCAACCGT CCCGGCGCCA ACCCGGTTCG CGCCGCCCAG
GCCAAGCTGG CGGCCTTCGC CGGCTCGGCC CAGCCCAGCA GCGACGACTG GGAAGAATTC
TAG
 
Protein sequence
MKFDDLKIST KVALPAVILT VVALSITGVG AWQSKVSEAA TKVLVEQRAP AELEGSRFNR 
RVATIGYAAY RTISNDAASP EAKQASDEID LAYKEGKIAL GKIKAADPAA AKKVADYQVR
LDRIYSSARQ GADLGLQNAN DAAKMVMGVI DPDIASLSKD VSTYTNTHSD QTRAMVAKAA
KAASAGTLMT ILFGLIASAS ALVFALWIGR SKISAPLAGL SKTMEVLAQG SVDVEVVGAL
RKDEVGAMAR SVQVFKDNAL ALRTAEAAQQ RLSAETETER QRNQEAAEAA AREQAFVMEN
IATGLTKLAE GDLTYRVDAQ FPQAYQRLQS DFNGAIAQME EAMRTIVHAA SSIGSGSDEI
ASAADDLSRR SEQQAASLEE TAAALDEITA TVKRSSAGAV EASRVVTSTR ADAERSSVVV
RGAVEAMNQI EKSSQSISQI IGVIDEIAFQ TNLLALNAGV EAARAGDAGR GFAVVAQEVR
ALAQRSADAA KEIKTLISTS SQQVGQGVSM VGQTGDALQA IVGKVSEIDA LVSEIAAGGA
EQATGLNEVN AAVNQMDQTV QQNAAMVEQS TAASHALKGE ANNLMQMIGR FQVSGASAAV
RSTTRRAAPP TQVTRPAPRP TLAPATAANR PGANPVRAAQ AKLAAFAGSA QPSSDDWEEF