Gene Hhal_1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1400 
Symbol 
ID4711122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1512168 
End bp1514438 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content65% 
IMG OID639855867 
ProductATP-dependent Clp protease, ATP-binding subunit clpA 
Protein accessionYP_001002969 
Protein GI121998182 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAGTA AAGAGTTAGA GTTCACGCTC AATCTGGCTT TCAAGGACGC ACGGGAGAAG 
CGCCACGAGT TCCTCACCGT TGAGCACCTC TTGCTGGCCT TGACCGACAA CCCCGCTGCC
TCGCAGGTGC TCAAGGCCTG TGGCGCCGAT CTCGAGCGGC TGCGGGGGGA GTTACAAGCC
TTCCTGTCCG AGACCACGCC GCTGCTCCCG GTAAACGACA GCCGCGAGAC GCAGCCGACG
CTCGGGTTCC AGCGGGTCCT GCAGCGCGCC ATCCTCCACG TGCAGTCCTC CGGCAAGCGT
GAGGTCACCG GAGCCAACGT ACTGGTGGCC ATCTTCAGCG AGCAGGAGTC CCAGGCGGTC
TACTTTCTGC ACAAGCAGAA CATCTCTCGC CTGGACGTGG TGAACTACAT TTCCCACGGC
ATCTCCTCGG TCGGGGGTGA AGAAGACATG GGCAAGGAAG AGAGTGGCCC GGCGGATGAA
GAGGGCGCAG CGGAGCCGAC CCAGGGCGGG TCGCCGCTGG AGCAGTACGC CACCAACCTG
AATCAGCGGG CCCGCTCCGG GCAGATCGAC CCGCTGATCG GTCGTCGTTA CGAGATCGAG
CGGACGGTCC AGGTGCTCTG CCGGCGCCGC AAGAACAACC CGCTGTTCGT CGGCGAGGCC
GGTGTCGGCA AGACCGCCAT CGCCGAGGGG CTGGCCAAGC AGATCGTCGA GGGCGAGGTG
CCGGAGGTGC TCCGGGAGAG CACTATCTAC TCGCTGGATC TCGGTGCCCT GGTGGCCGGT
ACCAAGTACC GTGGCGACTT CGAGAAGCGC CTCAAGGCGT TGCTCCACCA GCTCAAAAAG
GACAGCGGCT CCATCCTGTT CATCGACGAG ATCCACACCA TCATCGGAGC TGGGTCGGCC
TCCGGTGGGG TGATGGACGC CTCGAACCTG ATCAAGCCGA TGCTCGCCTC CGGCGAGCTG
CGCTGCATCG GATCGACCAC GTATCAGGAG TACCGCGGTA TCTTCGAAAA GGATCGGGCC
CTGGCCCGGC GCTTCCAGAA GATCGATGTC AGCGAGCCGT CGGTGGAGGA TACCGTCCAG
ATCCTCAAGG GGCTGAAGAG CCGCTTCGAG GAGCACCACA ATGTCCGCTT CACGGAGCCA
GCCCTGCAAG CGGCGGCGGA GTTGTCGGCC AAGTACATCA ATGACCGGCG CCTGCCCGAC
AAGGCCATCG ACGTCATCGA CGAGGCCGGC GCCCGGCTGC GCCTGCGCCC GCGCTCGAAG
CGGCGCAAGA CGGTGGGCCT GCCGGATATC GAGTCGATCG TGGCGAAGAT CGCGCGGATC
CCCCCCAAGC GGGTCTCCAG CCAGGACATG AAGGTGCTCG AGAACCTCGA GGGCGAGCTC
AAGGGACTGA TCTTCGGTCA GGACGAGGCC ATCGAGAGCC TGGCGTCGAC CATCAAGATG
TCTCGGGCCG GGCTGGGTAC GCCGGATCGG CCGGTGGGCA CCTTCCTGTT TGCCGGTCCC
ACCGGCGTGG GCAAGACCGA GGTGACCCGA CAGTTGGCCG AGGTGACCGG TGTCCAGATG
ATCCGCTTCG ACATGTCCGA GTATATGGAG CGGCACACCG TCTCGCGGCT GATCGGTGCC
CCTCCGGGGT ACGTCGGCTA CGACCAGGGC GGGCTGCTCA CCGAGGAGGT CATCAAGCAC
CCGCACTCGG TACTGCTGCT CGACGAGATC GAGAAGGCCC ATCCGGACGT GTTCAACCTG
CTGCTGCAGG TGATGGATCA CGGCACGCTG ACGGACAACA ACGGCCGCGA GGCGGACTTC
CGCAACGTGG TGCTGGTGAT GACCACCAAC GCCGGGGCTG AGGAGATGAG CAAGCGCTCG
ATCGGCTTCA CCAACGAGAG CGACACCCAG GACAGCATGG AGGCGATCCG GCGGACCTTC
TCCCCGGAGT TCCGCAACCG CATCGATGCG GTGGTGCAGT TCCAGCCCCT GGGTCAGGAC
ACCGTGCAGC GGGTGGTCGA CAAGTTCATC CGCGAGCTGT CGGTGCAGCT GGCCGAGAAG
CGTGTCACCC TGGTCGTCGA CGGCGACGCG CGACGGTGGA TCGGGGAGAA GGGCTATGAC
CCGCAGATGG GTGCACGCCC GATGGCCCGA GTCATCCAGC AGCACATCAA GAAGCCGCTG
GCCGAGCAGC TGCTCTTCGG CGAGCTGACC GGTGGTGGCG AGGTCGAGGT CACGGTCGAG
GATGGCGAGC TGACCATCCA CGTCCGGGAG AGGGATACCG AGGGGGCGTA G
 
Protein sequence
MLSKELEFTL NLAFKDAREK RHEFLTVEHL LLALTDNPAA SQVLKACGAD LERLRGELQA 
FLSETTPLLP VNDSRETQPT LGFQRVLQRA ILHVQSSGKR EVTGANVLVA IFSEQESQAV
YFLHKQNISR LDVVNYISHG ISSVGGEEDM GKEESGPADE EGAAEPTQGG SPLEQYATNL
NQRARSGQID PLIGRRYEIE RTVQVLCRRR KNNPLFVGEA GVGKTAIAEG LAKQIVEGEV
PEVLRESTIY SLDLGALVAG TKYRGDFEKR LKALLHQLKK DSGSILFIDE IHTIIGAGSA
SGGVMDASNL IKPMLASGEL RCIGSTTYQE YRGIFEKDRA LARRFQKIDV SEPSVEDTVQ
ILKGLKSRFE EHHNVRFTEP ALQAAAELSA KYINDRRLPD KAIDVIDEAG ARLRLRPRSK
RRKTVGLPDI ESIVAKIARI PPKRVSSQDM KVLENLEGEL KGLIFGQDEA IESLASTIKM
SRAGLGTPDR PVGTFLFAGP TGVGKTEVTR QLAEVTGVQM IRFDMSEYME RHTVSRLIGA
PPGYVGYDQG GLLTEEVIKH PHSVLLLDEI EKAHPDVFNL LLQVMDHGTL TDNNGREADF
RNVVLVMTTN AGAEEMSKRS IGFTNESDTQ DSMEAIRRTF SPEFRNRIDA VVQFQPLGQD
TVQRVVDKFI RELSVQLAEK RVTLVVDGDA RRWIGEKGYD PQMGARPMAR VIQQHIKKPL
AEQLLFGELT GGGEVEVTVE DGELTIHVRE RDTEGA