Gene Caul_0631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0631 
Symbol 
ID5898086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp698559 
End bp700064 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content71% 
IMG OID641561113 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001682262 
Protein GI167644599 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.22133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.283314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGAAGTG GAGACCGCTA CCACGTGATG AACACGGTTT CGGCCGGCTC GCTGCACAGC 
GACCTGCGGC CCCGGGCCTA TCTGGCGGCG ATAGTCACGG TGCTGGCGTG CTGCCTGGTG
CAATTGGCCT TGCCGCCCGG CTTCACCGCC CCTTCGGCCT TCCTGCTGTT CGTGCCGGCG
GTGCTGATCA GCGCGGCGGT CGGCGGCCTG GCGCCGGGTC TGTTCGCCAC GGTGCTGGCG
GCCGGGGCGG TGTGGCTGCT GAAGGTGCGC GGCGTTCCCG ACCAGGCCAC GGCGTTGTCC
AGCCTGGTGT TCGTGATGAT CGGTTTCGGC ATGTCGGTCG GCGGCGGCTG GTTCCACGCC
GCCCGGGCCC GCGCCGCCGC CATGACCCAT CACCTGCAGT CGATCCTCGA CTCGGCCCCC
GACGCGGTGA TCGTCATCGA CCCGGCCGGC CTGATGACCT CGTTCAGCCC CGCCGCCGAG
CGGCTGTTCG GCTGGACCTC GGCCGAGGCG ATCGGCCGGA ACGTCAGCCT GCTGATGCCC
GACCCCGACG GCGCCGGCCA CGACGGCTTC CTGGCCAACT ACAGCCGCAG CGGCGAGAAG
CGGATCATCG GCACGGGCCG CGTCGTGGTG GGCAAGCGCA GGGACGGCTC GACCTTCCCG
ATGGAGCTGG CGGTCGGCGA GACGCGGGGC GCGCGGCCGT TCTACACAGG CTTCATCCGT
GACCTGACCG ATCGCCAGCA GACCGAGGCC CGGCTGCGCG ACCTGCAGAC CGAACTGGTC
CACGTCTCGC GCCTGACCGC CATGGGCGAG ATGGCCTCGA CCCTGGCCCA CGAGTTGAAC
CAGCCGCTGT CGGCGATCGC CAACCTGCTG ACCGGCTCGC GCCGCCTGCT CGACCGCGGC
CGCCCCGAGG ACCAGGCCAA GGTGCGCGAC GCCGTCGACA AGGCCTCGGC CCAGGCCCTG
CGCGCCGGCG ACGTCATCCA CCGCATGCGC GAGTTCGTCC GGCGGGGCGC GACCGAACGC
GCGCCGGAAA GCCTGTCCAA GGTGGTCGAG GACGCCGCCG CCCTGGCCTT GATCGGGGCT
CGCGAGCACT TGGTCCAGAC ACGCCTGCAA CTGGATCCCG CCGCCGACGC CGTCTATGCC
GACCGCGTGC AGATCCAGCA GGTTCTGGTC AATCTGATCC GCAACGCCGT CGACGCCATG
GCCGACTCGC CGCGCCGCGA ACTGACCATC GCCAGCCAGC GGCTCGCCAA CGGTTCGGTC
CAGGTGAGCG TCACCGACAC CGGCTCGGGG ATCAGCGACG ACTTCCGCGA GCGCCTGTTC
CAGCCGTTCA TGACCACCAA GGCCGAGGGC ATGGGGGTGG GCCTGTCGAT CTCGCGCTCG
ATCGTCGAGG CGCATGGCGG CAAGATCTGG GCCGACGCGA ACCCCACGGG CGGGACGGTG
TTCCACTTCA CCCTGCCGCC CCGCCGCGAC AAAATCGAAG AGCATGGGAA ACCGATCGAT
GAGTGA
 
Protein sequence
MRSGDRYHVM NTVSAGSLHS DLRPRAYLAA IVTVLACCLV QLALPPGFTA PSAFLLFVPA 
VLISAAVGGL APGLFATVLA AGAVWLLKVR GVPDQATALS SLVFVMIGFG MSVGGGWFHA
ARARAAAMTH HLQSILDSAP DAVIVIDPAG LMTSFSPAAE RLFGWTSAEA IGRNVSLLMP
DPDGAGHDGF LANYSRSGEK RIIGTGRVVV GKRRDGSTFP MELAVGETRG ARPFYTGFIR
DLTDRQQTEA RLRDLQTELV HVSRLTAMGE MASTLAHELN QPLSAIANLL TGSRRLLDRG
RPEDQAKVRD AVDKASAQAL RAGDVIHRMR EFVRRGATER APESLSKVVE DAAALALIGA
REHLVQTRLQ LDPAADAVYA DRVQIQQVLV NLIRNAVDAM ADSPRRELTI ASQRLANGSV
QVSVTDTGSG ISDDFRERLF QPFMTTKAEG MGVGLSISRS IVEAHGGKIW ADANPTGGTV
FHFTLPPRRD KIEEHGKPID E