Gene Clim_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1118 
SymbolhslU 
ID6355760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1219015 
End bp1220487 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content55% 
IMG OID642668735 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_001943166 
Protein GI189346637 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATTA CCAGCGACAC TGAAGCTGCT GCCAGGACAG AAGGAAGAAG TGCTATTGCC 
GCACATAATC TTACACCGAA CCAGATTGTC GAACTTCTCG ATAAATATAT CATCGGGCAG
AAAGACGCCA AGAAATCGGT AGCCATCGCT CTGCGCAACC GGTTGCGCCG TCAGCATGTA
GGCGACGATC TTCGCGAGGA GATCATGCCG AACAACATCA TCATGATAGG GCCTACCGGC
GTGGGTAAAA CCGAAATAGC CCGGAGGCTT GCCAAGCTTG CCAAAGCCCC GTTTGTAAAG
GTAGAGGCTT CAAAATTCAC CGAAGTCGGC TATGTGGGGC GCGATGTCGA ATCCATGATC
CGCGACCTGG TCGATCAGTC GGTAGCCATG GTGCGCAGCG AGAAATCCGA AGAGGTAAAA
GAAAAAGCCG CTCTTCTTGT CGAGGAGCGT CTTCTCGATA TACTCCTTCC TCCGGCTCCG
CCGTCACGAT CGCATGAGGA TCAGGACGAC GACCTGGACG AAAACCGGAA TGCCATGGCT
CCGGCGGACG AGAACGATAT TTCACAGGAG GTTAACCGCC GCAGCCGGGA AAAGATGCTT
GAACGGCTTC GCAAGGGAAA GCTCGAGGAC CGTCAGATCG AAATGGATAC GGCAAGCGAG
AACCCAGGGG GGATGATGCA AATATTCGGT CCTCTCGGCC AGATGGAGGA GATCGGAAGC
ATCATGCAGG ATCTCATGAG CGGTCTGCCG CGCAAGCGCA AAAAACGTCG GGTAACGATA
GCAGAAGCCC GCCGGATACT CGAACAGGAG GAGGTGCAGA AGCTTATCGA TATGGACGCC
GTGGTCAAGG ATGCCATCAA CAAGGTCGAA CAGTCCGGCA TTGTGTTCAT CGACGAGATC
GACAAGATAG CCGCTCCGTC GACTGGTTCG GGAGGCGGCA AAGGCCCCGA CGTCAGCCGT
GAAGGGGTGC AGCGCGACCT TCTGCCTATT GTCGAAGGAT CCAACGTCGC CACCAAATAC
GGCATCGTCA AAACCGACCA TGTGCTTTTC ATCGCATCAG GCGCTTTTCA CGTCTCCAAG
CCCTCCGACC TCATTCCCGA ACTGCAGGGC CGCTTTCCCA TCAGGGTCGA ACTCAAAAGC
CTTACCGAGG AGGATTTCTA CAAGATTCTC ACCCAGCCGA AGAACGCGCT CATCAAGCAG
TACAAGGCGC TGATCAGCAC CGAGGGGGTC GATCTGGACT TTACCGACGG AGCGATACTT
GAGATCGCCA GAATCGCGGC CAAGGTCAAC GAAAGCGTTG AGAATATCGG AGCACGCCGG
CTGCACACCA TCATGACCAA TCTGCTCGAA GAGCTGATGT TCAACATTCC CGAAAGCGTG
ACGGAAGAAA AGGTAGTGAT TGACGAAGCC ATGGTGCAGG ATAAGCTTTC CGCGGTCTCA
TCGGATCGTG ATCTGAGCCA GTATATTCTC TAA
 
Protein sequence
MTITSDTEAA ARTEGRSAIA AHNLTPNQIV ELLDKYIIGQ KDAKKSVAIA LRNRLRRQHV 
GDDLREEIMP NNIIMIGPTG VGKTEIARRL AKLAKAPFVK VEASKFTEVG YVGRDVESMI
RDLVDQSVAM VRSEKSEEVK EKAALLVEER LLDILLPPAP PSRSHEDQDD DLDENRNAMA
PADENDISQE VNRRSREKML ERLRKGKLED RQIEMDTASE NPGGMMQIFG PLGQMEEIGS
IMQDLMSGLP RKRKKRRVTI AEARRILEQE EVQKLIDMDA VVKDAINKVE QSGIVFIDEI
DKIAAPSTGS GGGKGPDVSR EGVQRDLLPI VEGSNVATKY GIVKTDHVLF IASGAFHVSK
PSDLIPELQG RFPIRVELKS LTEEDFYKIL TQPKNALIKQ YKALISTEGV DLDFTDGAIL
EIARIAAKVN ESVENIGARR LHTIMTNLLE ELMFNIPESV TEEKVVIDEA MVQDKLSAVS
SDRDLSQYIL