Gene Caul_0620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0620 
Symbol 
ID5898075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp684376 
End bp686313 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content69% 
IMG OID641561102 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001682251 
Protein GI167644588 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0784] FOG: CheY-like receiver 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGCC AAGGCTCAGG GGACCACGAC GGGTACGACA CCCTCGACAT CCTGAACGCC 
AGCCAGGACT GCATCTGGAT TCTCGGACTC GACGGCGCCG TCGAACAGAT GAACCACCGC
GCCGAGACAT TGTTTCCGGC CCAGACCGTC GACAGGGCCA ACTGGCGGCG TATCTGGCCC
GAGGAAAGCC GGTTCTCCCT GGATCGGGCG CTCCGCGCCG CCCTGACCGG GCAGGTCGCC
AAGTTCCGGG CTTTCCTGGG CCACGCCCGG GACTCGCGCA CCTATTGCGA CACCACCATC
GCGCCGGTTC GCGACCGCGA CGGCTTGGTC ATCCGCCTGC TGGCGACGGC GCGAGACGTC
ACCCGCGAGG TCGAGACCCA GTCCTTCCTG CGCACCGTGA TCCAGATGCT GCCGTCGCCG
CTGACGGTGA AAAACGTCGG GGACGGCCGC TATGTCCTCG TCAACCGCGC CGCCGAGGAC
GCCTTCGGCC TAGTCGCCGA CGAGGCGCTG GGCCGGACGG CCGCCGAGGT GCTCGATCCC
GACCGCGCCG GGCGGATAGC CATGGCCGAA GCCCTGGTGC TGAGCACCGG AGAAATGCAG
ATGTCCGAGG ATCGGGTCGG CGAGGACGCC GACGCGGCCA CGCGCCACTT CCTGACCAAG
GTCCTGGCGA CCTATGACGA CATGGGTCCC CGGCACCTGA TCACCTTGAG CACCGACGTG
ACGGCGCAGC GGGCCGCCGC CACGTCCCTG CGCCTGGCGC TGGAGGCGGC CGAGCAGGCC
AGCCTCGCCA AGAGCACCTT CCTGGCCAAT ATGAGCCACG AGATCCGCAC GCCGCTGAAC
GGCATCGTCG CCGGGGCCGA CATCCTGGCC CGCAGCGAGC TGACGCCCCG AGTCCGCGAG
TTGGTGGACA TCATCCAGAC GTCCGGCAAG AGCCTGGAGC GACTGCTGTC GGAGGTGCTC
GACCTGGTCC GCATCGAGGC CGGCCAGGTG ACGATCGAGA CCGGCGTCTT CCATCTGGGC
GACCTGACGC GCTCGGTCGC CGCGCTCTGC GCCCTGCACG CCGGCGAAAA GGGCGTGGCG
TTGGAGACCC GGATCGCGCC CGCCGCCGAC ATCGCCGTTA TCGGCGACGG CGCCCGGGTG
CGTCAGGTCC TGACCAACCT GGTCAGCAAC GCCGTCAAGT TCACCGACCA GGGACAGGTG
GCGGTCGAGG TCGATCTCGG CGCGGACGGC CAGACCCGGA TCAGCGTCCG CGACACTGGC
GTCGGCTTCG ACCCCGCCGA GAAGGCGCGG ATCTTCGGTC GGTTCCAGCA GGCCGACGCC
TCGTTCACCC GCCGGTTTGG CGGGACGGGC CTGGGCCTGA CCATCTCGCG CGAGCTGGTG
GAGCTGATGG GCGGGACGCT GTCTTGTGAC AGCCGTCCGG GCGCCGGCTC GACCTTCTGG
TTCGACCTGC CGCTGGCCGC CGTCGACGGA TCAACCGCGA ACGCCGCTAT CGACGACGAC
CCGGCCGAGC CGCCGCAAGG CGTGCCTCGC ATCCTGGTCG CCGACGACCA TCCGACCAAC
CGCAAGATCG TCGAGTTGAT GCTGGCCGAG GTGGCCGAGA TCTTCACCGC CGAGAACGGT
CAGGAGGCGG TAGACCTCTT CGAGGTCGCC CAGCCCGACC TGATCCTGAT GGACATGCAG
ATGCCGGTGA TGGACGGGCT GGACGCGGTC CGCGAAATCC GCCGCCTGGA GGCCGCCGCC
GGCCGGGCGC GGGTTCCGAT CGTGATGCTG ACGGCCAACG CCCGACCCGA GCACGTCCGC
GCCAGTCAGG AAGCCGGGGC CGACTTGCAT CTCGAAAAGC CGATCACCCG CGCCACGCTT
CTGGCCGCCA TCCAGCGCGC CTTCGAGACC GTCGAGGCCG AAAATCACCC CCTCCGGTCG
AAGCCTAAGA CCTATTAA
 
Protein sequence
MNGQGSGDHD GYDTLDILNA SQDCIWILGL DGAVEQMNHR AETLFPAQTV DRANWRRIWP 
EESRFSLDRA LRAALTGQVA KFRAFLGHAR DSRTYCDTTI APVRDRDGLV IRLLATARDV
TREVETQSFL RTVIQMLPSP LTVKNVGDGR YVLVNRAAED AFGLVADEAL GRTAAEVLDP
DRAGRIAMAE ALVLSTGEMQ MSEDRVGEDA DAATRHFLTK VLATYDDMGP RHLITLSTDV
TAQRAAATSL RLALEAAEQA SLAKSTFLAN MSHEIRTPLN GIVAGADILA RSELTPRVRE
LVDIIQTSGK SLERLLSEVL DLVRIEAGQV TIETGVFHLG DLTRSVAALC ALHAGEKGVA
LETRIAPAAD IAVIGDGARV RQVLTNLVSN AVKFTDQGQV AVEVDLGADG QTRISVRDTG
VGFDPAEKAR IFGRFQQADA SFTRRFGGTG LGLTISRELV ELMGGTLSCD SRPGAGSTFW
FDLPLAAVDG STANAAIDDD PAEPPQGVPR ILVADDHPTN RKIVELMLAE VAEIFTAENG
QEAVDLFEVA QPDLILMDMQ MPVMDGLDAV REIRRLEAAA GRARVPIVML TANARPEHVR
ASQEAGADLH LEKPITRATL LAAIQRAFET VEAENHPLRS KPKTY