Gene Caul_4629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4629 
Symbol 
ID5902091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5006987 
End bp5008666 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content71% 
IMG OID641565148 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001686247 
Protein GI167648584 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.119322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGC ATCCCGGGCC AGAGGGCAGG CCGAAGCGCC GGTTGACCTG GCCCGGGTCC 
GCCCGCTTCA GGCTTGCCTG GCCCAGGGGT TCGCGCCTGG GCCGGCTGAT CATCGGGCTC
AACCTGCTGG CCCTGGCGGT CCTGCTGGGC GGAGCCCTGG TGCTCAACGA GCTGCGCAGC
GGCCTGATCA AGGCGCGCAT CGACAGCCTG ACGACCCAGG GCGAGCTGAT CGCCAATGTG
ATCGACCTAG CCGCGACGGT GGGCGATCCC GAACCGCGGC TGGCGCCCGA CCAGGCCAGC
GACATCCTGC AGAGCCTGTT CATCCCGCGC TCGCAGCGGG CGCGGCTGTT CGACGCTGAC
GGCAACCAGC TGGCCGACTC CTACGTCGTC GCCGACCGGG TGGAGTCCAA GGTCCTGCCC
CCGGCTCGCA AGCCGGGCCA GCCGGCCTTC GGCCTGCCGG CCCCGGACGC CGGCGCCAAG
CCCAAGGCCG CCGAGGCCGC GCGCAAGGCC CTGGCCGCCG AGATCGCCCA GGCCAAGCTG
GGCGAGCCGG TGGCGGGAAT GCGCCGGGCC GAGAACGGCG AGCGGGTGGT GTCGGTGTCG
ATCCCCATCC AGCATGTGCG CGCGGTCCTG GGCGTCCTGA CGCTTGAAGC GGGGGATGTC
GACGAGATCA TCGCCGCCGA GCGCAAGGCC CTGCTGCCCT TCGCCCTGAT CGCCATCGCC
ACCACCCTGA TCTCGTCGTT CCTGCTCAAC CGTCTGATCG CCCAGCCGGT GCTGCGGATG
GCCAGCGCCG CCGACCGCGT GCGGCTGGAC GGGGCGCGGG CCATCTCCCT GCCAGACATC
TCCGGCCGCA AGGACGAGCT GGGCGACCTG TCGCGCGCCC TGGAGGAGAT GACGGATTCA
ATGTCCGAGC GGATGGACGC CATCGAGCGC TTCGCCGCCG ACGTCGCCCA CGAGATCAAG
AACCCCCTGA CCTCGATCCG CTCGGCGATC GAGACCCTGG ACCTGGTGTC CGACCCTGCC
GCCCGGGCCC GGCTGCTGGC CATCCTGCAG CAGGACGTCA CCCGGCTGGA CCGGCTGGTC
ACCGATATCT CCAACGCCTC GCGGCTGGAC GCCGAGCTGT CGCGCGAGGC GCCGCGGGCC
TTCGAGTTGA ACCTGCTGCT GGCCGAGGTG ATCCATCTCT ACGAGGCCCA ACTGCGGCCG
GGTCCCGCGC CGGGCAGCGT GCGGGTCACC TTCGACGCCC AGGCCGCGCC GCAGCATGTC
CGCGTGCTGG CCCGCGAGAC CCCGATCGGC CAGGTGTTCC GCAACCTGAT CGACAACGCC
CGCTCGTTCA GCGCCGCGGA CGGCGAGGTG CGGGTGACCC TGACCCGAGA TCCGGGCCTG
CGCGACCATC ACAACCGGCT GATCGTCACG GTGGATGACG ACGGGCCGGG GATCCCGCCC
GACAATCTCG AGACCATCTT CGAGCGGTTC TACACCTCGC GGCCCAAGGG CAAGGCGTTC
GGCGGCAATT CGGGCCTGGG CCTGTCGATC GCCCGGCAGA TCGTCGAGGC GCACGGCGGG
ACGATGCGGG CCGAGAACCG CATGGACGCC GAAGGCAAGG TGGTGGGCGC GCGCTTCCGG
GTCGACCTGC CCGAAGCCAG CGCCCATCAG GCCAATCACC ATGGCGGCTC GCGCGAATGA
 
Protein sequence
MAAHPGPEGR PKRRLTWPGS ARFRLAWPRG SRLGRLIIGL NLLALAVLLG GALVLNELRS 
GLIKARIDSL TTQGELIANV IDLAATVGDP EPRLAPDQAS DILQSLFIPR SQRARLFDAD
GNQLADSYVV ADRVESKVLP PARKPGQPAF GLPAPDAGAK PKAAEAARKA LAAEIAQAKL
GEPVAGMRRA ENGERVVSVS IPIQHVRAVL GVLTLEAGDV DEIIAAERKA LLPFALIAIA
TTLISSFLLN RLIAQPVLRM ASAADRVRLD GARAISLPDI SGRKDELGDL SRALEEMTDS
MSERMDAIER FAADVAHEIK NPLTSIRSAI ETLDLVSDPA ARARLLAILQ QDVTRLDRLV
TDISNASRLD AELSREAPRA FELNLLLAEV IHLYEAQLRP GPAPGSVRVT FDAQAAPQHV
RVLARETPIG QVFRNLIDNA RSFSAADGEV RVTLTRDPGL RDHHNRLIVT VDDDGPGIPP
DNLETIFERF YTSRPKGKAF GGNSGLGLSI ARQIVEAHGG TMRAENRMDA EGKVVGARFR
VDLPEASAHQ ANHHGGSRE