Gene Caul_3431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3431 
Symbol 
ID5900886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3712006 
End bp3714540 
Gene Length2535 bp 
Protein Length844 aa 
Translation table11 
GC content71% 
IMG OID641563937 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_001685056 
Protein GI167647393 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.737925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGCGTG GCGGGGCGAG AACCGGGAGG CTGGCAGGAG CCGGTCTTTC GGCGGCGGGC 
GCGGCTCGCT CGCGCGCCGC GCGGTCGCCG TCCCAGACCT ATGTCCGTCT GGCCATTCTG
GCGGCCCTGC TGCTGTTGGC CATCTACACC GCCTTTGGGA TCAGCCGCCT CAAGCAGGAG
GCCAACGTGC CGCCGGGCGG CGCGTCCTTG GCCGCCGAGG CCAAGTTGGT GGCCGCCGCC
GCCGAAACCA ACCTGGCCGC CCAGCGCGCG GGGCTGTCGG CCGCCGCCGA CCTGCTGCAA
CGGGATCCGA GCGCACCGAT CGACGCCGCG GAAACCGCCC TGCGCGCCGC CGGCGGCGAG
GCCATGGCCG TGGCCGTGCT GGGTCCTGCC GACGTTCTGG CGGTCGCCGG CCGCGATCCG
GCGGCCGACT GGAAGGCCGC CGCCCGCGCC GCCGGAACCT CGGGCCGCAG CCTATGGATC
GGCGGCGGAA CCACTGGCCG CCTCTATGTG GTCATGGCCA CGCCGATCGA AGGCGGACGC
GGCTTCGTCA TCGCCTCCAG CGACCCATCC CGCCTAGTCG CCGAACCGGC CAAGGGCGCG
GCCAGCGCCC TGATGCTGGC CAACGGCAAG ATCCTCGCCG CCGCCGGCCG CCCCATCGAG
GGCGCGACCA CCCTGCGCGA AGCCTTTTCG CTGTCGATCG AGGATCTTGG CGACGGCCCG
GCGGCCCTCC GCGGCCAATC GGTCGATGGT CGCGCCCTCG ACGTCGCCGC GCAGCCCCTG
GCCCAGGGCG ACCTGCTGGC CGCCGCCGCC GCCCAGCCGC GCACCGTCGC CAATGTCGAC
CGCCAGATCA TGGAAGGCGC CGTTTCGCTG CTGGCCCCGC TGGCCGTCGG CATCGCCCTG
GCCCTGATGC TGATGCTGCA GAGCCGCCGG GTGGAGATGG CCCAGCGTGA ATTCGTCGAC
AGTGAGCAGC GCTTCCGCCT CGCCGTCGAG GCCGCCCGCT GCGGCATCTG GGAATGGGAG
CTGGGCGCCG ACCAGGTGTT CATGTCCGAC GTCACCGGCG CGATGTTCGG CTGGGGCGGG
GGCGGCGTAG TGGCCGGCCA GGAGTTGCTG GACCGCGTCT CGGTCGACCA TCGCGACAAG
GTCCGCCAGG CCCTGGCCAA CGCCGCGACC TATGGCGCCT TCGACGTCTC GTTCCGCGTC
CCGTCGCGCG ACGGCGGCCG CGCCATCTGG ATCGACGCTC GTGGGCAGGG CTTCGGCAAG
CCGGGCGAGG ACGGCTACTC GCGGATCATC GGCGTGGCGC TGGACGTCAC CGAGGAGCGT
CTGGCCCAGG CCCGCGCCCA GGCCGCCGAG AACCGCCTGC GCGACGCCAT CGAGAGCGTG
TCGGAGGCCT TCGTACTCTG GGACCGCCAG GGCCGGTTGC TGATGTGCAA CCGCAACTAC
CGCAACGTCT TCAATCTCGA GCCCAAGCTG CTCAAGCCCG GCGCGCCGCG CAACGAGGTC
AACCGCTTCG CCGCCCTGGC CATCAAACAC GACCAGCCCG CGCCAGACGG CGCCAAGGGC
GTACGCGAGG CCGAGCTGAA TGACGGCCGC TGGATCCAGA TCAGCGAGCG CCGCACGGCC
GAGGGCGGCT TGGTGATGAC CGCCGCCGAC ATCACCGCCA TCAAGAACCA GGAAGAGGCC
CGCCGCCTCA ACGAGGAGCA ACTCCAGCAC GCGGTCGCCG GCCTCGAGCG CTCGCAGGAA
CAACTGGCCG AACTGGCCCG CAAATACGAG ATGGAGAAGG TCAAGGCCGA GGGCGCCAAC
AAGGCCAAGA GCGAGTTCCT GGCCAATATG TCCCACGAGC TGCGCACCCC ACTGAACGCC
ATCAACGGTT TCTCCGAGAT CATGATGAAC GAGATGTTCG GCCCGCTCGG CGACCAGCGC
TACAAGGGCT ACAGCCTCGA CATCCACAAC TCCGGCCAAC ACTTGCTGGC CCTGATCAAC
GACATCCTCG ACATGTCGAA GATCGAGGCC GGCAAGATGA ACCTCAAGTT CGAGCCGCTG
AGCCTGGAGG ACGTGACCGA GGACGCCGTT CGCCTGGTGC GCAATCGCGC CGAGGCCGCC
GGCCTGAAGC TGGAGATCGA CTTCCCGCCC CTGCCCGAGG TCGAGGCCGA CTACCGCGCG
GTCAAGCAGG TGCTGCTGAA CCTGCTGTCC AACGCCATCA AGTTCACGCC CCGCGCCGGC
CGCATCGTCG TGCGGGCCGA GGTGCGTCGA GATCCGCTGG GCGAGCGCGT CCGCGTCTCG
GTCACCGACA CCGGCATCGG CATCGCCGCC GAGGACCTGG CCCGCTTGGC CCGCCCCTTC
GAGCAGGTCG AGAGCCAGCA CGCCAAGACC ACCCAGGGCA CGGGCCTGGG CCTGGCCCTG
ACCAAGTCGC TGGTCGAGAT GCACGACGGC GCGCTGGAAA TGACCTCCAC CCCGGGCGAA
GGCACCACCG TCAGCTTCAT CCTGCCGATC AGCCAGAGCG GCCTGGCGTC GCTGCGCGAC
TTCGCGGCGG CTTGA
 
Protein sequence
MERGGARTGR LAGAGLSAAG AARSRAARSP SQTYVRLAIL AALLLLAIYT AFGISRLKQE 
ANVPPGGASL AAEAKLVAAA AETNLAAQRA GLSAAADLLQ RDPSAPIDAA ETALRAAGGE
AMAVAVLGPA DVLAVAGRDP AADWKAAARA AGTSGRSLWI GGGTTGRLYV VMATPIEGGR
GFVIASSDPS RLVAEPAKGA ASALMLANGK ILAAAGRPIE GATTLREAFS LSIEDLGDGP
AALRGQSVDG RALDVAAQPL AQGDLLAAAA AQPRTVANVD RQIMEGAVSL LAPLAVGIAL
ALMLMLQSRR VEMAQREFVD SEQRFRLAVE AARCGIWEWE LGADQVFMSD VTGAMFGWGG
GGVVAGQELL DRVSVDHRDK VRQALANAAT YGAFDVSFRV PSRDGGRAIW IDARGQGFGK
PGEDGYSRII GVALDVTEER LAQARAQAAE NRLRDAIESV SEAFVLWDRQ GRLLMCNRNY
RNVFNLEPKL LKPGAPRNEV NRFAALAIKH DQPAPDGAKG VREAELNDGR WIQISERRTA
EGGLVMTAAD ITAIKNQEEA RRLNEEQLQH AVAGLERSQE QLAELARKYE MEKVKAEGAN
KAKSEFLANM SHELRTPLNA INGFSEIMMN EMFGPLGDQR YKGYSLDIHN SGQHLLALIN
DILDMSKIEA GKMNLKFEPL SLEDVTEDAV RLVRNRAEAA GLKLEIDFPP LPEVEADYRA
VKQVLLNLLS NAIKFTPRAG RIVVRAEVRR DPLGERVRVS VTDTGIGIAA EDLARLARPF
EQVESQHAKT TQGTGLGLAL TKSLVEMHDG ALEMTSTPGE GTTVSFILPI SQSGLASLRD
FAAA