Gene Caul_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0166 
SymbolhrcA 
ID5897878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp185097 
End bp186173 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content69% 
IMG OID641560650 
Productheat-inducible transcription repressor 
Protein accessionYP_001681801 
Protein GI167644138 
COG category[K] Transcription 
COG ID[COG1420] Transcriptional regulator of heat shock gene 
TIGRFAM ID[TIGR00331] heat shock gene repressor HrcA 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.714476 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.458775 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAGC TGTTTTCCGG GTTTTCCGCT CAGACGCCCA GTCTCACCGA CCTGGATGGG 
CGGGCTCGCG ACATTTTCCG GCGGGTGGTC GAATCCTATC TCGAGACCGG CGAACCGGTC
GGATCGCGGA CCATCTCGCG CGGCGGGGTG CAGCTGTCGC CGGCCTCGAT CCGCAACACC
ATGCAGGATC TGGCGCAGCT GGGCCTGCTG GACGCCCCCC ATTCCAGCGC CGGCCGCATC
CCCACCCATG CGGGCCTGCG GATGTTCGTC GACGGCCTGC TGGAGGTGGG CGACATCGGC
GAGGAGGAGC GGCGGACCAT CGAATCGCGA CTGTTCGCCC ACGGTCGCTC GTTCGAAGAG
GCGATGGGCG AAGCCAGCGC CATCCTGTCG GGCCTGGCCG GCGGGGCGGG CATCGTAGTC
ACCCCGGTCC GCGAAGGCGG GGTCAAGCAC GTGGAGTTCG TGGCCCTGGG CGCCGACCAG
GCCCTGGCGA TCATGGTGTT CGACGACGGC ACGGTTGAGA ACCGGTTGAT GAAGCGCTCG
GCCGGCGTCA CGCCGGCCTC CCTGCAGGAG GCCTCCAACT TCCTCAACGC CCGCCTGCGC
GGCCGCACCC TGAACGAGGC CAAGACCGAG ATGGCGGCCG AGCTGGACAC GGCCCGGCGC
GAACTGGACG CCACGGCGGC GCGCCTGGTC GAGGACGGCC TGGCGGCCTG GAGCGGCGGC
GACGACCCCG ACCGCGCCCT GATCGTCCGC GGTCGCGCCA ACCTGCTGGC CGACGCCAGC
GCCCGGGAAG ATCTCGAGCG CGTCCGGCGG CTGTTCGATG ACCTGGAGCA GAAGGGCCAA
CTGATCGGCC TGCTGGACGA TGTGCGATCC GCCGAGGGCG TGCGCATTTT CATCGGGGCC
GAAACGCGAC TCTTTTCGCT TTCGGGTTCC TCCCTGATCG CGGCGCCCTA TATGTCGGGC
CGACAAAAGG TGTTGGGAGC GATCGGCGTG ATCGGTCCCA CGCGTTTAAA CTATGCCCGG
GTGATCCCGC TGGTGGACTA TACCGCTCGC GTGCTTGGCC GGATGATGGA CGGATAG
 
Protein sequence
MTQLFSGFSA QTPSLTDLDG RARDIFRRVV ESYLETGEPV GSRTISRGGV QLSPASIRNT 
MQDLAQLGLL DAPHSSAGRI PTHAGLRMFV DGLLEVGDIG EEERRTIESR LFAHGRSFEE
AMGEASAILS GLAGGAGIVV TPVREGGVKH VEFVALGADQ ALAIMVFDDG TVENRLMKRS
AGVTPASLQE ASNFLNARLR GRTLNEAKTE MAAELDTARR ELDATAARLV EDGLAAWSGG
DDPDRALIVR GRANLLADAS AREDLERVRR LFDDLEQKGQ LIGLLDDVRS AEGVRIFIGA
ETRLFSLSGS SLIAAPYMSG RQKVLGAIGV IGPTRLNYAR VIPLVDYTAR VLGRMMDG