Gene Caul_3968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3968 
Symbol 
ID5901430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4296608 
End bp4298188 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content75% 
IMG OID641564489 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001685591 
Protein GI167647928 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.179224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGC GGTCGCGGGC CTGGGCCAGG ACCTGGTGGC CGGCTCTGCG GCTGCGCACC 
ATCCTGCTGA GCGTGCTGCT GTTCGCCGCC GCCATGCCGG CCATCGGGGC GGTGTTCCTG
CGCACCTACG AGAACACCCT GGTGCGCCAG ACCGAGGCCG AGCTGGCCAG CCAGGGCGCG
GCCCTGGCGG CGACGGCCGG CGCCCTGTGG CCCGGAGCCA TCCGCGACAC CACCCCGGCC
GATCCCGACG CCCGCGACGA CCCCGGCTAC TACCGGCCCG AGGCCACCAG CATCGACCTG
CGCGACTCGC CGGTGCTGCC CGAGCGTCCG GCCGCGCCGC CCGGCCCGCC CGCCGATCCC
CAGGCTGAGG CGGCGGCCGA GGTGCTGGAG CCGATCCTCG ACCGGACCAG CCGCAGCACC
CTGGCCTCGA TCCTCATCGT CGACCGTCAC GGCGTGGTGG TGCGCGGCCT GGGGCAGGGC
GGGAGTCTCG CGGCCCTGCC GGAGATCCAG GCGGCGCTGA AGGGCCGGTC GCGCACGGTC
CTGCGCCGCA ATGGCGGCTA TCATCCGCGC TACCGGTTCG AGTGGCTCAG TCGCGCCTCG
GCCGTGCGCC TGCACCATGC CCGGCCGATC ATCGTCAACG GCAAGGTGCA GGGGGCGCTG
CTGCTGTCGC GCTCGCCCCG GGCGCTATTC CGAGGCGTCT ACCAGGACCG GGGCAAGATC
GCGATCGGTG CGGGGGCGAC GATCCTGCTG CTGGTGCTGC TGTCGGGCCT GGTGTCGCGC
GGCGTGACCC GGCCGATCGA GGCCCTGAGC GCGGCGACCC GCGGCGTGGC CAGCGGCCAG
GGGACCGTGC CGGAGACCCC CGTCACCGCC GCCGTCGAGA TCCGCGACCT CTACCAGGAC
TTCCGGGTGA TGGCCGACGC CATCGCCGTG CGCTCGCGCT ACCTGCGCGA CTTCGCCGCC
GCCGTCAGCC ACGAGTTCAA GACCCCGCTG GCCGGGATCA CCGGGGCGGT CGAGCTGCTG
GACGACCATT TCGACACCAT GACCCCGGAC GAGCGCCGGC GGTTCCTGGG CAACATCTCG
GCCGACAGCG CCCGGCTCTC GCACCTGGTG GGCCGGCTGA TGGACCTGGC GCGGGCCGAC
ATGGCCATGC CGCAGGCGGG GGTGACGTCC GAGCTGGCCG CCGCCGCGCG GCGGGTGGCC
GACGCGCAGG GGCGCGACAT CGCCGTGGTG CTGGACCTGC CGGCTGGCCT GCCGCGGGTG
GCCGCGCCCG AGGCGACGGT GGAGACGGTG CTGACGACGC TGGTGGAGAA CAGCCGGCAG
GCGGGCGCGC GGACGGTTCG GATCGTCGCG CGGGTCGTGG GCGAGGAGGT GGTGCTGCGG
GTCAGCGACG ACGGCCCCGG CGTGCCGCCG GCTGACCGCG ACCGCCTGTT CGAGCCGTTC
TTCACCAGCC GGCGGGAGAC GGGCGGCACG GGGCTCGGGC TGTCGATCGC GCGGTCGCTG
CTGGCGGCGA GTTCCGGGCG GGTGGGGTTG GTCGAGGGCG AGGCGGGGGC GGTGTTCGAG
GTGGGGTTGT TGTGGGGGTA G
 
Protein sequence
MIERSRAWAR TWWPALRLRT ILLSVLLFAA AMPAIGAVFL RTYENTLVRQ TEAELASQGA 
ALAATAGALW PGAIRDTTPA DPDARDDPGY YRPEATSIDL RDSPVLPERP AAPPGPPADP
QAEAAAEVLE PILDRTSRST LASILIVDRH GVVVRGLGQG GSLAALPEIQ AALKGRSRTV
LRRNGGYHPR YRFEWLSRAS AVRLHHARPI IVNGKVQGAL LLSRSPRALF RGVYQDRGKI
AIGAGATILL LVLLSGLVSR GVTRPIEALS AATRGVASGQ GTVPETPVTA AVEIRDLYQD
FRVMADAIAV RSRYLRDFAA AVSHEFKTPL AGITGAVELL DDHFDTMTPD ERRRFLGNIS
ADSARLSHLV GRLMDLARAD MAMPQAGVTS ELAAAARRVA DAQGRDIAVV LDLPAGLPRV
AAPEATVETV LTTLVENSRQ AGARTVRIVA RVVGEEVVLR VSDDGPGVPP ADRDRLFEPF
FTSRRETGGT GLGLSIARSL LAASSGRVGL VEGEAGAVFE VGLLWG