Gene Acid345_0212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0212 
SymbolhslU 
ID4071665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp226406 
End bp227824 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content59% 
IMG OID637982213 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_589291 
Protein GI94967243 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.571703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAATTT ATCTTCCCGC AACAACCGAT GATGAACAGG TAATGCTCGA CGAGCTGACG 
CCGCGCGAGA TCGTGGCGGA ACTCGACAAG TACGTAGTGG GCCAGCACGA AGCCAAGCGC
GCGGTAGCGA TCGCGCTCCG CAATCGCATG CGCCGGCAGC GGCTGACGCC CGATCTCGCG
GAAGAGATCA TTCCGAAGAA CATCATCATG ATCGGGCCCA CCGGCGTGGG CAAAACCGAG
ATCGCACGAC GGTTGGCAAA GCTGGCGAAC TCGCCCTTCC TGAAGGTCGA GGCATCGAAG
TTCACCGAGG TCGGCTACGT AGGTCGCGAC GTGGAATCGA TGATCCGAGA CCTGGTCGAG
ATCGCGATTG ACATGATCCG CGAAGAGAAG CTCGACGATG TGGCCGACAA GGCCGAGATG
AACGCCGAAG AGCGTCTGCT CGATTTGCTG CTTCCGAGTT CACCGCAGCC GGCTGCTGCG
CACGAAGCGG GTGCCGGATT CACGCAAGGA CAGTTGGAGT TGCCGGGCGA CGGAGGCGGT
TCCCGCACGC GGGAAAAATT GCGTCAGCAG CTGCGTGAAG GCAAGCTCGA CGAGCGCACC
GTGGAACTCG ATGTCCGCGA GAAGAACTTC CCTGCCTTCG AGATCATCTC GAACCAGGGC
GTGGAAGAGA TGGACATCAA CATGAAGGAC ATGTTGCCCA ACATCTTCGG CTCGCGCACC
AAGAAGCGGA AGATGAAAGT CAACGAAGCC TTCGATTATC TGATCCAGGA AGAAGAGCAG
CGACTCATCG ATATGGAGCA GGTGCAGCGC GTAGCGGTGG AACGCGTGGA GCAATCGGGC
ATCATCTTCC TCGATGAGAT CGACAAGATC GCCGGCCGCG AAGGCGGCCA TGGGCCTGAT
GTTTCGCGCG AGGGCGTGCA GCGCGACATC CTGCCGATCG TTGAAGGCAC CACCTGCAAC
ACTCGCTACG GCATGGTGCG CACCGACCAC ATCCTGTTCA TCGCCGCGGG CGCGTTCCAC
GTATCGAAGC CCAGCGATTT GATTCCGGAA CTGCAGGGGC GCTTCCCGAT CCGCGTGGAG
TTGCAATCGC TGAGCGTTGC GGACTTCATC AAGATCCTGA CCGAGCCGAA GTCGTCGCTG
GTGAAGCAGT ACACAGCGCT GCTGGAGACC GAAGGCGTGA AGCTGGAGTT CACACGCGAT
GCGCTGGACG AAGTCGCGAA CTTCGCGGCG ATTGTGAATG AGGGCACCGA GAATATCGGT
GCACGACGTT TGCACACGAT CATGGAAAAA GTTCTGGACG AGATCAGCTT CTCCGCGCCG
GACCTGGAAA ACAAGAATGT AACCGTGGAC GCGGAGTACG TGCGCAATGC TCTGGTTCAC
ATCGTGAAGA ACCAGGATTT GTCGCGGTAC ATTCTGTAA
 
Protein sequence
MAIYLPATTD DEQVMLDELT PREIVAELDK YVVGQHEAKR AVAIALRNRM RRQRLTPDLA 
EEIIPKNIIM IGPTGVGKTE IARRLAKLAN SPFLKVEASK FTEVGYVGRD VESMIRDLVE
IAIDMIREEK LDDVADKAEM NAEERLLDLL LPSSPQPAAA HEAGAGFTQG QLELPGDGGG
SRTREKLRQQ LREGKLDERT VELDVREKNF PAFEIISNQG VEEMDINMKD MLPNIFGSRT
KKRKMKVNEA FDYLIQEEEQ RLIDMEQVQR VAVERVEQSG IIFLDEIDKI AGREGGHGPD
VSREGVQRDI LPIVEGTTCN TRYGMVRTDH ILFIAAGAFH VSKPSDLIPE LQGRFPIRVE
LQSLSVADFI KILTEPKSSL VKQYTALLET EGVKLEFTRD ALDEVANFAA IVNEGTENIG
ARRLHTIMEK VLDEISFSAP DLENKNVTVD AEYVRNALVH IVKNQDLSRY IL