Gene Caul_4725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4725 
Symbol 
ID5902187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5112197 
End bp5114041 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content66% 
IMG OID641565244 
Productintegral membrane sensor hybrid histidine kinase 
Protein accessionYP_001686343 
Protein GI167648680 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.269083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0435637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTGG GAGAACGGGA CGACCGACTG CGAAGCGACC CAATCAGGCT GGCGCTGCTT 
GGGCACGCCC ATCGCAACGC CTTGGCGACC ATGGCGGTGC AGGCGGCCGC CGCGATTGGC
GTCTGCCTGG TCGCCCAGCC CGTGGAGCGA CCCTTTCGGC TGATGTGGCT GGCGGTCGTT
CTGGCGCTGT TGGCGGTCAG GCTTTTCACT GATCGTCTGC TCGGCGCGGC CCTTGGCGGT
CGGTTGATGG CCGACCGGCT TTTGCTGCTG GCGCGAGTGC ACAGCCTGGG CCTGATCCTG
AGCGCGGGAT TGTGGGCGCT GCTGGCTTGC GTGCGAATTC CCCAGGACAG CGTCAGCGCC
CGCTACGTGC TGATCATCGT CTTGTCCGCC TTGGCCGGAG GCGCGGTGGG CGTCCTGTCG
CCGCTCAAGT GGACGGGCCG GATCTATGTT TCCCTGATCC TTCTGCCGGC CAGCCTGACC
CTGATTCTCA ATCGCGGCGT GGACGCCACC TTGGGCGTTC TCGGCGTGAT CTTCTGGATC
GTGATGATCG TGGGTCATCG CAACAACCAT GCCCTTTTGG TCGACGCCTT GCGACTGCGC
GACGAGAACC GCGAACTGTT GGCGGACGTC GCGCGGCGCA ATCACGCGAC CCTTCGCCTG
AACCATGATC TCGAAAGCAG CGTTCGCGCC CGCACGATCG AGCTTGAGCG CATGACCGAA
GAGGCCAAGG TCGCCAACCG CGCCAAGTCG CAATTCCTGG CCACGGTCAG CCACGAAATG
CGCACGCCGT TGAACGCCAT TCTGGGCGAG GGTCAATTGC TGGCGCGGGA GGCGCTGACG
CCGTCCCAGC GGAGTCGCCT CCAAGTCATC GACACCGCGT CCCGGGCGAT GCGGCACCTG
ATCGACGATG TGCTCGACAT CTCGCAGATC GAGGCGGGCG CGCTTCGGCT GAGACCTAAG
GTGTTCGCGC TCGCCACGCT CGTGGACGAT ATCCAGCAGA TCTACCGACC GCTTGCCGAA
GGGCGCGGTT TGTCGCTGAC GGTCTCGCTG CAACCGGAGA CGGCGCCGTT CCGGCGCGGC
GACCCTGATC GGCTTCGTCA GATCGTCGGT AACCTGATCG CCAACGCCTT GAAGTTCACG
CGCCAGGGCG GCGTGACCGT CAAGATCGGC GGCGACGACG AACAGCTCAC GGTTTCGGTG
AGCGATACGG GGATCGGCAT CGACGCCAAG GATCACGAGA CGATCTTCCA GCGCTTCGTT
CAGGTCGATA GTTCATCGAC GCGGGAGGCT GGCGGCATCG GCCTGGGCCT GGCCATCTGT
CGCGAACTTT CCGAACAGAT GGGAGGCTCT CTGAAGGTGA TCTCGGCGCG GGGCATCGGC
GCGCGGTTCG ATTTCAGCGC GCCGATCCCA TGCGTCCTGG CCTCCGCCCC GCTAGCCGTC
GCCGAGGACG CCGTGTCCGA CGATGGAGCG CCGGGCTCGG TCTTAGTGGT TGATGATAAT
CCCGTGAACC GCCGCATCCT GGCCGCCCTG ATGGAGCCGT TCGGCGTCGA ATGCGGCTTC
GCGACCAGCG GGAAGGAAGC CGTGGAGGCG TGGCGTCGCC AGCCCTGGGA CGCGATCTTC
ATGGACGTGC ACATGCCGGA CATGGACGGC GTCGAAGCCT CGCGGACGAT CCGCGCCGAA
GAGATTGTCG CCGGCCGCGG CAGGACGCCG ATCGTCGCCG TCACCGCCAG CGTGCTCACC
CATGAGGTGG AAGCCTATCG GCAAGCCGGT ATGGACGATG TGCTGCCCAA GCCCGTAGAC
GCTTCGGCCT TGGCCAGCAT GTTGTCGCGC TGCGCCGCGG CCTGA
 
Protein sequence
MQLGERDDRL RSDPIRLALL GHAHRNALAT MAVQAAAAIG VCLVAQPVER PFRLMWLAVV 
LALLAVRLFT DRLLGAALGG RLMADRLLLL ARVHSLGLIL SAGLWALLAC VRIPQDSVSA
RYVLIIVLSA LAGGAVGVLS PLKWTGRIYV SLILLPASLT LILNRGVDAT LGVLGVIFWI
VMIVGHRNNH ALLVDALRLR DENRELLADV ARRNHATLRL NHDLESSVRA RTIELERMTE
EAKVANRAKS QFLATVSHEM RTPLNAILGE GQLLAREALT PSQRSRLQVI DTASRAMRHL
IDDVLDISQI EAGALRLRPK VFALATLVDD IQQIYRPLAE GRGLSLTVSL QPETAPFRRG
DPDRLRQIVG NLIANALKFT RQGGVTVKIG GDDEQLTVSV SDTGIGIDAK DHETIFQRFV
QVDSSSTREA GGIGLGLAIC RELSEQMGGS LKVISARGIG ARFDFSAPIP CVLASAPLAV
AEDAVSDDGA PGSVLVVDDN PVNRRILAAL MEPFGVECGF ATSGKEAVEA WRRQPWDAIF
MDVHMPDMDG VEASRTIRAE EIVAGRGRTP IVAVTASVLT HEVEAYRQAG MDDVLPKPVD
ASALASMLSR CAAA