Gene Caul_5052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5052 
SymbolhslU 
ID5902514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5451571 
End bp5452866 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content68% 
IMG OID641565573 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_001686670 
Protein GI167649007 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.42386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.164562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAT TCTCCCCCCG CGAAATCGTC TCCGAACTCG ACCGCTACAT CGTCGGCCAC 
ACCGAGGCCA AGAAAGCCGT CGCCGTGGCC CTGCGCAACC GCTGGCGTCG CCGCCGCGTG
CCCGCCGACC TGCGCGATGA GGTGACGCCC AAGAACATCC TGCTGATCGG CCCCACCGGC
GTGGGCAAGA CCGAGATCGC CCGCCGCCTG GCCAAGCTGG CCCAGGCCCC GTTCCTGAAG
GTGGAAGCCA CCAAGTTCAC CGAGGTCGGC TATGTCGGCC GCGACGTCGA CCAGATCGTC
CGCGACCTGG TGGAGAGCGC CCTGTCCATG GTTCGCGACA AGCGGCGGGG CGGGGTCAAG
GCCAAGGCCG AGGCGGCCGC CGAGGAGCGC ATCCTCGACG CCCTGACCGG TCCCGGCTCG
ACGGCGGCGC GGGAAAGCTT CCGCAAGAAG CTGCGGGCCG GCGAGTTGGA CGACAAGGAG
GTCGAGCTGC AGCTGGCCGA CACCGGCGGC GGCGGCATGT TCGAGATCCC CGGCCAGCCG
GGCGCCTCGG TGTTGAACCT GTCGGAAATG ATGAAGTCGT TCGGCGGCGG TCGCACCAAG
ACCCACAAGA CCACCGTCGT CGGCGCGTGG GCCCCGCTGA TCGCCGAGGA AAGCGACAAG
CTGCTGGACC AGGAGGCCCT CACGCAAGAG GCCCTGGAGC TGGCCGAGAA CCACGGCATC
GTCTTCTTGG ACGAGATCGA CAAGGTCGCC AGCTCGAGCC AACGCGGCGG CGCCGACGTC
TCCCGAGAGG GCGTGCAACG AGATCTCCTG CCGCTGATCG AGGGCACGAC GGTCTCGACC
AAGTACGGAC CCGTGAAGAC CGACCACATC CTGTTCATCG CCTCGGGCGC CTTTCACGTG
GCCAAGCCGT CGGACCTGCT GCCCGAGTTG CAGGGTCGCC TGCCGATCCG CGTGGAACTG
AAGGGCCTGA CCCGCGACGA CCTGCGCCGC ATCCTGACCG AGCCGGAAGC CAACCTGATC
CGCCAGCACC AGGCCCTGCT GCTGACCGAG GGCGTCACCC TGGTGTTCAC CGAAGAGGCC
ATCGACGCCG TCGCCGACGC AGCCGTGGCG GTCAACGCCT CGGTCGAGAA CATCGGCGCC
CGCCGGCTGC AGACCATCCT GGAGAAGGTG GTCGAGGAGA TCAGCTTCAC GGCGGCCGAT
CGCTCCGGCG AGACCGTGAC GGTGGACGCC GATTACGTGC AAGCGCGGAT CGGAGCCCTG
GCGGCCAACG CGGATCTCAG CCGGTTCATT CTTTAG
 
Protein sequence
MTEFSPREIV SELDRYIVGH TEAKKAVAVA LRNRWRRRRV PADLRDEVTP KNILLIGPTG 
VGKTEIARRL AKLAQAPFLK VEATKFTEVG YVGRDVDQIV RDLVESALSM VRDKRRGGVK
AKAEAAAEER ILDALTGPGS TAARESFRKK LRAGELDDKE VELQLADTGG GGMFEIPGQP
GASVLNLSEM MKSFGGGRTK THKTTVVGAW APLIAEESDK LLDQEALTQE ALELAENHGI
VFLDEIDKVA SSSQRGGADV SREGVQRDLL PLIEGTTVST KYGPVKTDHI LFIASGAFHV
AKPSDLLPEL QGRLPIRVEL KGLTRDDLRR ILTEPEANLI RQHQALLLTE GVTLVFTEEA
IDAVADAAVA VNASVENIGA RRLQTILEKV VEEISFTAAD RSGETVTVDA DYVQARIGAL
AANADLSRFI L