Gene Caul_4759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4759 
Symbol 
ID5902221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5143818 
End bp5145599 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content71% 
IMG OID641565278 
Productintegral membrane sensor hybrid histidine kinase 
Protein accessionYP_001686377 
Protein GI167648714 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0993289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAAGCGC ATGGCGGGGG GCCGCGCGCG ACCGGGGGCG GCGTGGACGA CAGGACGCAT 
CAGTCGGGCG AGGTGAGGGT CGCGCTCGCA CGGCGGGGCA ACCTTTACCT GCGCGTCGTG
GCGACGGTGA TGATCGCGCT GATGCTGCAG GTCTTCATCG GCATGAGCTG GACCTGGATC
TGGGCCGGCG TCTACGCGGC GGCGCAACTG CTGGAACTGC TGCTGATCCG CGCCCTGGTC
CATCGACCGC AGGCGCGACG CGGTCGCGCC CTGCGGATCG CTGTCGCCGT GATGCCGGTG
GTCACCTCGG CGATCTTCGG CTTCCTGGCC CTGCCATTGT TCGCCTCGCA CGAGCGCTTC
GCGCCCACCC TGGGCGGCCT GCTGCTGGCC GGCGCCCTGC TCAACGTCAT CATCGTCAAC
AGCAGCCTGA GGTCGGCGAC GCTCTCGGCC GCCGCGCCGC ACGTCATCTA CCTGCTGATC
GTGCCGCTGG TCGCCCGGAC GGCTAATCCG GACGCACGGC TGGCCAACGC CCTGTGGCTG
GGCGTGGGGC TGCTGATCAT CTCGGTGTTC GTGGCCGCGC GGACCCTGGA GCGGGCGCTG
AGGGCCGAGG CCGAGGCCAA GGAGGAGGCC GAGCGCCGCC GCCACGAGGC CGAAGAGGCC
GTCGCCGCCA AGTCGGCCTT CGTGGCGATG ATCAGCCACG AGCTGCGCAC GCCGATCAGC
GCCATCCTGG CCGGCGCGGC GCGCCTGCAT AGCGACGTGC CCGACGCCGC CTCCAAGAGC
CACGCCCAGC TGATCGCCGA CGCCGGGGCG ATGATGCGCA CCCTGCTCAA TGACCTGCTG
GACCTGTCGC GCCTCGACGC CGGGCGGATG GCGGTCGATC AGAGCCCGTT CGACCTGCGC
CAGGCCATGG CCGACACCCT GCGCCTGTGG CGTCCGGACG CCCAGCGCAA GGGCCTGCGT
CTGCGGGTCG AAGGCGCCGC GAATCTGCCG CGCTGGGTCG CGGGCGACTC CCTTCGCCTG
CGCCAGGTGC TCAACAACCT GCTGTCCAAC GCGATCAAGT TCACGCCGCG GGGCGCGGTC
ACCGTGCGCC TGGCCGCGCG CGACGAGGGC GAACACCTGA TCTTCGAGGC GATCGTCGCC
GACACCGGTC CGGGCCTGAC CGAGGAGCAG CTGTCGCGCC TGTTCACGCC GTTCGACCAG
TTGGCTTCCA GCGTGGTGCG CGAGCACGGC GGTTCGGGCC TGGGCCTGGT GATCAGCCGC
GAGCTGACCC GGCTGATGGG CGGCGACCTG ACCGTGACCA GCAAGCCCGG CAAGGGCTCG
CGCTTCCGCC TGCAGATCCG GGTCCTGGCC GCCGAGCCCG AGGACGACCG GACGGGCGGG
GTCTCGATCG AGGGCGCGCG GGTGCTGGTG GTCGACGACC ATGTCGTCAA TCGTCGGGCC
ATCGAGCTGG TCCTGCAGTC GTTCGGCATC CAGCCCACCC TGGCCGAGTC CGGCGAGCGG
GCCCTGGAGC TGCTGCATTC CGAGGTGTTC GACGTGCTGC TGATGGACGT CTACATGCCT
GGCATGGACG GCCGCGACGC CACGCGCCAG CTACGGGCCG GCGAGGGTCC CAATCGCGAC
ATCCCGGTGA TCGCCGTCAC GGCCTCGGCC ACCCCCAAGG ATTGGGAGGC CTGTCACGCC
GCCGGCATGA ACGCCCATGT CGCCAAGCCG ATCGACCCCA GCCAGCTGCA CGCGGCGCTC
AGCGAAGTGC TGCCGGCGGG CGCGGAGCGG GCGGTGGCTT AG
 
Protein sequence
MQAHGGGPRA TGGGVDDRTH QSGEVRVALA RRGNLYLRVV ATVMIALMLQ VFIGMSWTWI 
WAGVYAAAQL LELLLIRALV HRPQARRGRA LRIAVAVMPV VTSAIFGFLA LPLFASHERF
APTLGGLLLA GALLNVIIVN SSLRSATLSA AAPHVIYLLI VPLVARTANP DARLANALWL
GVGLLIISVF VAARTLERAL RAEAEAKEEA ERRRHEAEEA VAAKSAFVAM ISHELRTPIS
AILAGAARLH SDVPDAASKS HAQLIADAGA MMRTLLNDLL DLSRLDAGRM AVDQSPFDLR
QAMADTLRLW RPDAQRKGLR LRVEGAANLP RWVAGDSLRL RQVLNNLLSN AIKFTPRGAV
TVRLAARDEG EHLIFEAIVA DTGPGLTEEQ LSRLFTPFDQ LASSVVREHG GSGLGLVISR
ELTRLMGGDL TVTSKPGKGS RFRLQIRVLA AEPEDDRTGG VSIEGARVLV VDDHVVNRRA
IELVLQSFGI QPTLAESGER ALELLHSEVF DVLLMDVYMP GMDGRDATRQ LRAGEGPNRD
IPVIAVTASA TPKDWEACHA AGMNAHVAKP IDPSQLHAAL SEVLPAGAER AVA