Gene Caul_4551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4551 
Symbol 
ID5902012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4926712 
End bp4928154 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content72% 
IMG OID641565070 
Producthistidine kinase 
Protein accessionYP_001686169 
Protein GI167648506 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.412829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGG CCTCGCCGTC TTCGTCAGCC CGTTCCGACG GGTCCGTTCC CACGCCGCTC 
GGCCTGTGGG CCGCGGTGCT GGTGGGACCC GCGGCGCTTC TGGTCCTGGC CGCGACGGGG
GCGGCCAATC CCGGCGCGGC GGTTCCGTTC GCCCTGGCCA GCGGCGCGGC CGGGCTGTTG
CTGGTGCGCC GAGCCCAGCG TCGGACGCCG GACCGGGCGG CCCCGGTCCC GCTGCCGGCC
GTGGTCGACC AACCGCCGCC GTTCGGCCTG ATTCTCGAAA CCCTGCCCGA TCCGCTGATG
GTCATCGCCG CCGAGGAGGC CGACGACCTG ACCGGCCGGC GATTCGTGTT CGCCAACGCC
GCGGCGCGCG ACCTGTTCAA GCTGCAGCCG CGCGGCGGGC TGCTGGTCTC GGCCATGCGC
AGCCCGCAGG TGCTGGAAGC GGTGGACGAA AGCCTGTTCG GCGGCGTGCG GCGCTCGGTC
GACTATGTCG GCGGCGGCGC CCAGGGGCGG GAATGGGCGG CGCACTCCGC GCCGCTGGGC
GTCGATGAGC GCGGCTCGCG CCTAGCCCTG CTGGTGCTCA GCGACGAGAC CGACACCCGT
CGCAGCGAGC GCACCCGGGC CGACTTCCTG GCCAACGCCA GCCACGAGCT GCGCACGCCC
TTGGCCTCGC TGTCGGGCTT CATCGAGACC CTGCGCGGCC ACGCCAAGGA CGATGTCGGG
GCGCGCGACA AGTTCCTGGG CATCATGCAG GCCCAGGCCG AACGGATGGC CCGGTTGATC
GACGACCTGA TGAGCCTGTC GCGCATCGAG CTCAACGAGC ACATCGCGCC GCTTGGCCAG
GTCGACCTGG CCATGGCGAC GATCGACGTG CTCGACGCCC TGGCTCCCCA GGCCAAGGAC
AAGGCCGTGA GCTTCGATCC CATCCTGCCG CCGCGCGGCG CGGCCGTGGT CGAGGGCGAT
CGGGACCAGA TCGTCCAGGT GATCCAGAAC CTCATCGACA ACGCCATCAA ATATACGCCC
CGCCACGGCG CGGTGCGGGT GGAGGTGTTT TCGGGCCTGA CCGCCGACAT GGCCGCCGCG
CCGCGCGACC CCGCCGCCGC GCGGATGTCG CTGCTGACCC CCGATCACGC GGTCGAGGAG
CGCTACGCGT CATTCCGGGT CAGCGACAAG GGGCCAGGCA TGGCCCGCGA GCACCTGCCG
CGCCTGACCG AGCGATTCTA TCGGGTCGAG GGCCAGAAGA GCGGCGAACG CTCGGGCACG
GGCCTGGGCC TGGCCATCGT CAAGCACATC ATGAACCGCC ACCGCGGCGG CATGACGGTG
GAGAGCGTGC AGGGCGCGGG CGCGACGTTC GGGGTCTATT TTCCCATGGC CAAGGTGGTC
CCGGAGAAGA TCCGCGCCTT GCCGGAGGCC GCCGGGACGG ACGCTGTCGC AAAACCGTCG
TGA
 
Protein sequence
MPMASPSSSA RSDGSVPTPL GLWAAVLVGP AALLVLAATG AANPGAAVPF ALASGAAGLL 
LVRRAQRRTP DRAAPVPLPA VVDQPPPFGL ILETLPDPLM VIAAEEADDL TGRRFVFANA
AARDLFKLQP RGGLLVSAMR SPQVLEAVDE SLFGGVRRSV DYVGGGAQGR EWAAHSAPLG
VDERGSRLAL LVLSDETDTR RSERTRADFL ANASHELRTP LASLSGFIET LRGHAKDDVG
ARDKFLGIMQ AQAERMARLI DDLMSLSRIE LNEHIAPLGQ VDLAMATIDV LDALAPQAKD
KAVSFDPILP PRGAAVVEGD RDQIVQVIQN LIDNAIKYTP RHGAVRVEVF SGLTADMAAA
PRDPAAARMS LLTPDHAVEE RYASFRVSDK GPGMAREHLP RLTERFYRVE GQKSGERSGT
GLGLAIVKHI MNRHRGGMTV ESVQGAGATF GVYFPMAKVV PEKIRALPEA AGTDAVAKPS