Gene Caul_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1074 
Symbol 
ID5898529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1132131 
End bp1133576 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content72% 
IMG OID641561556 
Productsignal transduction histidine kinase 
Protein accessionYP_001682702 
Protein GI167645039 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.255381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.984755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCAAGC CGGTGAGACC CATCGATCCC GTCGGGGCGC CCGACCCGTC GCCGGTGCTG 
ACCGCCCTGC GCCAGGCCCC GATCGGCATC GCCATCTTCG ACCGCGACAT GCGCTATCTG
TTCGCGTCGG GGCGCTATCT GTCCGATCAG GATCTACCGA TCGACCTGCC CCTGGTCGGA
CGGCTGCACT ACGACGTCTT CCCCGGCATT CCCCAGGTCT GGCGCGACAT CCACGCCCAG
GCGCTCCGCG ACGGGATCGA GCGCAACCAC CCGGGCGAGC GCTTCGTCAG GCCCGACGGC
GGGATCGACT GGGTCCGCTG GTCGGTGGCC CCCTGGCGCG CCGACGGCGG CGAGATCGGC
GGGCTGGTGC TCTATACCGA GTTGGTGACG GCCGACGTCG AGGCGCGGAT GGCCCTGGAA
GCGGCCGAGG CCCGCTACCG GGCGGTGTTC GACCAGGCGG CGATGGGCGT GGCCCGCGTG
GCGCTGGACG GCCGGTTCCT GGAGGTCAAC GACCGCTACT GCCAGATCGT CGGCCACGAC
CGCGACGCGC TGCTGGCCGG AGACTTCCAG ACGATCACCC ACGCCGACGA CCTGGAGAAG
GACATGGCCC TGGTCCAGGC CCTGTTGGCC GGCGAGCGGC AGACCTTCGC GATGGAGAAG
CGCTACGTCA CCGCCGCCGG CGAAACCGCC TGGGTCGGCC TGACCGTGTC GATGGTGCGT
ACGGCCGACG GCCGGCTCGA TCATTTCGTG GCGATCATCC AGGACATCGC CGAGCGCAAG
GCGTCCGAGG GCCAGCAACT GCGTCACCAC CAGCAGCTGC GGCTGATGAT CAACGAGCTG
AACCACCGGG TGAAGAACAC CCTGTCGACC ATCCAGTCGA TGGCGTCCCA GACGCTGCGC
AACGACCCCG ATCCGCTGTC GGCCTACGGA AAGTTCGAGG CGCGGCTGCT GGGCCTGTCT
CGGGTGCATG ACCTGCTGAC CGGCCAGCAT TGGCACGGGG CGGACCTGCG CGCCGTCGCG
ACGCGGGCGC TGCGGCCGTT CGTCGAGGAC GCGGCGGGCG GGGCGGCGGG CGGCGGCGTC
GAGATCGACG GACCGGACGT CTGGGCGCCG CCCTCGGCGG CCCTGGCCGT GGCCATGCTG
CTGCACGAAC TGGCCACCAA CGCCACCAAG TACGGGGCCT TATCGACGCC GGCGGGGCGC
GTGCGCCTGG CCTGGACCTT CGACGAGACC GAGCGCCTGG TGCGGCTGAC CTGGACGGAA
TCCGGCGGAC CGCCGGTCAA GGCGCCGGAA CGCCAGGGCT TTGGCTCGCG CCTGATCGCC
CGCGCCCTGC GCGACCTGCA GGGCGCGGCG GCCCTGCGGT TCGAGCCGGC GGGCGTGGTC
TGCGAGATGC ATCTGCGGCT GCCGACCAGC GAGGACCCGC TGGATCACGC GGCCCTGGCC
GGTTAG
 
Protein sequence
MTKPVRPIDP VGAPDPSPVL TALRQAPIGI AIFDRDMRYL FASGRYLSDQ DLPIDLPLVG 
RLHYDVFPGI PQVWRDIHAQ ALRDGIERNH PGERFVRPDG GIDWVRWSVA PWRADGGEIG
GLVLYTELVT ADVEARMALE AAEARYRAVF DQAAMGVARV ALDGRFLEVN DRYCQIVGHD
RDALLAGDFQ TITHADDLEK DMALVQALLA GERQTFAMEK RYVTAAGETA WVGLTVSMVR
TADGRLDHFV AIIQDIAERK ASEGQQLRHH QQLRLMINEL NHRVKNTLST IQSMASQTLR
NDPDPLSAYG KFEARLLGLS RVHDLLTGQH WHGADLRAVA TRALRPFVED AAGGAAGGGV
EIDGPDVWAP PSAALAVAML LHELATNATK YGALSTPAGR VRLAWTFDET ERLVRLTWTE
SGGPPVKAPE RQGFGSRLIA RALRDLQGAA ALRFEPAGVV CEMHLRLPTS EDPLDHAALA
G