Gene Caul_2345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2345 
Symbol 
ID5899800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2542533 
End bp2544062 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content64% 
IMG OID641562836 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_001683970 
Protein GI167646307 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTA GCGATGGGAA TGCCCGCGCG GCCTCGAAGG CGAGCGCGCT TGGGGCCCTG 
ATGATGCGCC AGGCGAGCGG CTACGCCATC GTGTTGCTGG ATCCGCAAGG CGTCATCCTC
GATATGAACA ACGCGACCGA GGTCGTCGGA TATTGGCCTC CCGAACAAGT GCGTGGGCGT
TCCCACGAAC TCTTCTACCC GCCCGACGGA ATTGCCGAGG GACGGCCCGC GGCCGATCTG
TCTGCCGCGG TGAAGGACAG CCTCCTGGAA CGTGACGCTT GGCGCGTTTG TCACGACGGT
TCCGAGTATC TTGCACGGCT GACGATAAAC GCGCTGTGGG ACGAGAGCGG CGTCCTGCGG
GGCTTTAGCT GCATCGCGCG CGACATCACC GTGGATGTTG CCGTGCGGGC GTCGATCGAG
GCCCGAGAGC AGCATCTGCA GTCCATACTC GCCACCGTTC CCGACGCCAT GATCATCATC
GACGAGCGAG GTACGATTAC CTCGTTCAGC GCGGCGGCCG AGCGCCTGTT CGGCTACGGG
GAAAGCGAGC TGCTCGGCAG GAACATCTCC TGTCTCATGC CGGAGCCTGA CCGCGGCCGC
CACGACCAGT ATATCGCCCA CTATCTCGAA ACCGGCGAGC GCCGCGTCAT CGGAATGGGT
CGCGTGGTGG TGGGCCAGCG CCGTGACGGG ACCACCTTTC CCATGGAGCT ATCCGTCGGC
CAGGCCGGCG AGGACGGCAG CCGGATCTTC ACCGGCTTCG TTCGGGACCT GACCGCGAAG
GAGCGTGACG AGCTCAGGCT CAAGGAACTG CAGGCGGAAC TCGTTCACGT CTCGCGACTG
AGCGCCATGG GCACTATGGC CTCCACCCTC GCCCACGAAA TCAACCAGCC TTTGACGGCG
GTGGCCAACT ACCTTGAGAC AATCCGCGAT CTGCTGATCG GCGACGCCGA GATCGACCGC
TCCCTATTGC GCGAGGCCGT TGGGGAAGCC GCGAGCGAGA CGTTGCGCGC AGGATTTATC
GTTCGCCGGC TTCGGGATTT CGTCGCGCGC GGCGACGTGG ACAAGAGCAT CGAGGACCTA
CCCCGGCTGA TCGAGGAAGC CGGCAATCTG GCTCTGGTCG GTGCGCGAGA GCGCGGCGTC
AGGAGCTTTT TCAGGTTCGA TCTCGAGGCT ACGCCGGCCC TCGTCGATCG CGTGCAGATC
CAGCAAGTCC TCGTGAACCT CATGCGCAAC GCCGTAGAGG CCATGGCGGA ATCAGAGGTG
CGAGAACTGA CCGTGTCGAC CTTGCTCCGG CCCGACGGCG CGATCGAGGT GTCGGTCGAG
GATACGGGGC CCGGCATTTC CGACGAGATC GCGCCTCGAC TGTTCCAGGC TTTCGTCAGC
AGCAAGGCCG AAGGCATGGG TCTTGGCCTG TCGATCTGCC GAACGATCAT CGAGGCGCAT
GGCGGACGCA TCTTGGCCGA CGCGTTGCCG GGCGGCGGCA CGGCCTTCCG ATTCACACTC
ATTCATGGGC GGGCGGACGA AGAGAAATGA
 
Protein sequence
MSVSDGNARA ASKASALGAL MMRQASGYAI VLLDPQGVIL DMNNATEVVG YWPPEQVRGR 
SHELFYPPDG IAEGRPAADL SAAVKDSLLE RDAWRVCHDG SEYLARLTIN ALWDESGVLR
GFSCIARDIT VDVAVRASIE AREQHLQSIL ATVPDAMIII DERGTITSFS AAAERLFGYG
ESELLGRNIS CLMPEPDRGR HDQYIAHYLE TGERRVIGMG RVVVGQRRDG TTFPMELSVG
QAGEDGSRIF TGFVRDLTAK ERDELRLKEL QAELVHVSRL SAMGTMASTL AHEINQPLTA
VANYLETIRD LLIGDAEIDR SLLREAVGEA ASETLRAGFI VRRLRDFVAR GDVDKSIEDL
PRLIEEAGNL ALVGARERGV RSFFRFDLEA TPALVDRVQI QQVLVNLMRN AVEAMAESEV
RELTVSTLLR PDGAIEVSVE DTGPGISDEI APRLFQAFVS SKAEGMGLGL SICRTIIEAH
GGRILADALP GGGTAFRFTL IHGRADEEK