Gene Caul_1640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1640 
Symbol 
ID5899095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1720686 
End bp1722488 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content73% 
IMG OID641562129 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001683267 
Protein GI167645604 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0736059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0470065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCT CCGGCGTGAT GCGCGCGTTC GGCGGGGCGC TGGCCGTGGG GGTGATCCTG 
GCCGGCGGCG TCGCCTTGTT CGCGATCCAG GGCCTGAGGG TCGGCGGGCC GTTGTTCCGC
CAGATCGTCG ACAACAAGGA TATCGTCGCC GACATCCTGC CGCCGCCGCT CTACGTGGTC
GAGGCCGACC TCGTCGCCAC CAAGCTGGTG GCCCATCCCG AGACCGTGGA CGCGGCCGCC
GACAAGCTGG CGGCCCTGCG CAAGGACTAC GACACCCGCC GCGCCTACTG GACCACCGCG
CCGATCGACC CGGCCATGCG CGAGATGCTG GTCAACAAGT CCGCCGCGCC CGCCGCCGAT
TTCTGGCGCG AGATCGACCA GGAAATCCTG CCGGCCGTCC GGGCCGGCGA CCTGGACCGC
GCTCGCCAGG GCCTGGCCCG CGCCGACGCC GCCTACGAGG CTCACCGCGC CGTGATCGAC
AAGGTCGTGG TCATGGCCAA TCGCGACGCC GCGGCGGTCG AGGCCGCGGC CGTGCGCCAG
ACCCGTCTGG CCTTCATCCT GCTGGCCGGG GCGGGCGCCC TGGTGTTGCT GGTGGTCGGG
GGCGGGATCG CCCTGATGAG CCGGCGCATC GTCCAGCCCA TCCAGGCCAT GACCGGCTTC
ATGGGCCGGC TGGCGGCCGG CGACTACGAC AGCGCCGTGC CGTTCGCCGG TCGCGGCGAC
GAGATCGGCG GCATGGCCAC CTCGGTGGCG GTGTTCCGCG AGGCGATCGT CGCGCGGCGG
GCGGCCGAGG ATCGTATCGC CGCCGAACGC GACGCGACCG AGTCCGCCAA GACCCGGCAT
GACGCCGAGC GGCGCGCCGA GGAGGCCGGA CGCCTGCAGG TCGTCGAGGC CCTGGGCGCC
GCGGTCGCCA GGCTCAAGGC CGGCGACCTG ACGGCGCGGC TCGATAAGGC CTTCCCCAAC
GCCTATGAGG GTCTGCGGGG CGACTTCAAC CGCGCCATCG CCGGGCTGGA GGCCGCCCTG
AGCGAGGTGG TCGGCGGAGC CGGCGGCCTG AGAGCCGGGG TCGGCGAGAT CAGCGGCGCG
ATCAATGATC TATCGGCCCG CACCGAGCGC CAGGCGGCCA ACCTGGCCGA GACGGCGGCG
GCCCTGGGCG AAGTCAGCGC CAGCGTCAGG CGAACCGCCG AGGGAGCGGA CGAGGCCAAT
GCCGCGGTGA TCGACAGCCG GGCGGTGGCC GATCGCTCCA GCCAGGTGGT CGGCCGCGCG
GCCCAGGCCA TGACCCAGAT CGAAACCTCG TCGGCCAAGG TCTCGCAGAT CCTGGGCGTG
ATCGACGAGA TCGCCTTCCA GACCAATCTT CTGGCCTTGA ACGCCGGGGT CGAGGCGGCC
CGGGCCGGAG ACGCGGGGCG CGGCTTCGCG GTCGTCGCCC AGGAAGTCCG GGCCCTGGCC
CAGCGCTCGG CCGAGGCGGC CAAGGAGATC AAGGTGCTGA TCTCGGAATC GTCGGACCAT
GTCCGCGCGG GCGTGACCCT GGTCGGCGAG GCCGGGACCG CCCTGGGCGA CATCGTCGAG
CGGATCGGCC GGATCGGCGG ACTGATGGGT CAGATCGCCC AGTCCAGCCG CGAACAGGCG
GCGGGCCTGG CCGAGGTCGA CCTGGCGGTT GGCCAGATGG ACCAGGTCAT CCAGCAGAAC
GCCGCCATGG CCGAGCAGGC CAGCGCCGCC AGCCGTGGCC TGGCCGACGA CGCCGTGCGC
CTGGAGGCCC AGGTCGAGCA CTTCCAGGTC TCGACCGACC AGATGGCCTG GGCGGCTGCC
TAA
 
Protein sequence
MKISGVMRAF GGALAVGVIL AGGVALFAIQ GLRVGGPLFR QIVDNKDIVA DILPPPLYVV 
EADLVATKLV AHPETVDAAA DKLAALRKDY DTRRAYWTTA PIDPAMREML VNKSAAPAAD
FWREIDQEIL PAVRAGDLDR ARQGLARADA AYEAHRAVID KVVVMANRDA AAVEAAAVRQ
TRLAFILLAG AGALVLLVVG GGIALMSRRI VQPIQAMTGF MGRLAAGDYD SAVPFAGRGD
EIGGMATSVA VFREAIVARR AAEDRIAAER DATESAKTRH DAERRAEEAG RLQVVEALGA
AVARLKAGDL TARLDKAFPN AYEGLRGDFN RAIAGLEAAL SEVVGGAGGL RAGVGEISGA
INDLSARTER QAANLAETAA ALGEVSASVR RTAEGADEAN AAVIDSRAVA DRSSQVVGRA
AQAMTQIETS SAKVSQILGV IDEIAFQTNL LALNAGVEAA RAGDAGRGFA VVAQEVRALA
QRSAEAAKEI KVLISESSDH VRAGVTLVGE AGTALGDIVE RIGRIGGLMG QIAQSSREQA
AGLAEVDLAV GQMDQVIQQN AAMAEQASAA SRGLADDAVR LEAQVEHFQV STDQMAWAAA