Gene Hhal_0601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0601 
Symbol 
ID4709295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp678541 
End bp681003 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content66% 
IMG OID639855061 
ProductATP-dependent protease La 
Protein accessionYP_001002189 
Protein GI121997402 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.273158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGC CAACCGCTCA CAATCCCGAA TCCCCGCAGA CCGTCACCCA GGCGCCGGTT 
TTGCCGCTGC GCGACGTGGT GGTCTATCCG CACATGGTCA TCCCGCTGTT CGTTGGGCGG
GAACGCTCCA TCCACGCCCT CGAGGCGGCC ATGGAGCAGG ACAAGCGGAT CTTCCTGATC
GCCCAGCGCA GCGCCGAGGT CGATGACCCG GGTGTCGAAG AACTCTATGG CTACGGCACC
GTGGCCTCGA TCCTGCAGAT GCTCAAGCTC CCCGACGGTA CGGTGAAGGT GCTGGTCGAG
GGCGGGGAGC GGGCGCGCCT GGTCGAGCTG CTCGATAGCG GCGAATACCT CTCGGCCCAT
CTGGTTACGG TGCCCGAGCC GCAGCCCAGC GACGAGGACC GCGAGCTGGA GGTGGTGGCG
CGGTCCGCGA CCAACGTGTT TGAGCAGTAC GTCAAGCTCA ACAAGAAGAT CCCGCCGGAG
ATTCTCTCTT CGCTTTCCGG GATCGAGGAG CCCGGCCGAC TGGCTGACAC CATTGCTGCG
CACATGGCGC TGAAGGTCGA GGAGAAGCAG AAGGTCCTCG AGATGGAGGG CCCGCGCGAG
CGCCTCGAGC ACCTGATGGG CCTGATCGAG GGGGAGATCG ACATCCTCCA GATCGAGAAG
CGCATCCGCG GGCGCGTCAA GCAGCAGATG GAGAAATCCC AGCGGGAGTA CTACCTCAAT
GAGCAGATGA AGGCCATCCA GAAGGAGCTG GGTGAGCTGG AGGACGTGCC CAACGAGGTC
GAGGACCTCG AGAACAAGAT CGACCAGGCC GGGATGCCGC AGCAGGCCCT GGACAAGGCG
AAGCAGGAGC TGAACAAGCT CAAGATGATG TCGCCGATGT CCGCCGAGGC CACCGTGGTG
CGCAATTACC TCGACTGGCT GGTGAGCCTG CCCTGGAAGG ACAAGACGCG CGTGCGGCAC
GATCTCAAGC ACGCCGCCAA GGTCCTCGAT CAAGACCACT ACGGCTTGGA CAAGGTCAAG
GAGCGCATCC TCGAGTACCT GGCCGTGCAG CGACGGGTGC GCAAGCTCAA GGGGCCGATC
CTCTGTTTGG TTGGGCCGCC GGGTGTCGGC AAGACGTCGC TGGGGCAGTC CATCGCCCGG
GCCACCAACC GCAAGTTCAC CCGCATGTCC CTCGGTGGTG TGCGCGACGA GGCAGAGATC
CGTGGCCACC GCCGCACCTA CATCGGCTCG CTGCCGGGCA AGATTGTGCA GAACCTGAGC
AAGGCGGGTA AGCGCAACCC GCTGTTCCTG CTCGATGAGG TGGACAAGAT GGCCATGGAC
TTCCGGGGCG ACCCGTCCTC GGCCCTGTTG GAGGTCCTGG ATCCGGAGCA GAACAGCAGC
TTCAGCGACC ACTACCTTGA GGTCGACTTC GATCTCTCCG ACGTGATGTT CGTCTGCACG
GCGAACACCA TGAACATCCC GGCGCCGCTG CTCGACCGCA TGGAGGTCAT CCGCCTGCCG
GGCTACACCG AGGAGGAGAA GCTCGCCATC GCGCAGAGCT ATCTGCTGCC CAAGCAGATG
AAGGCCAACG GCATCCGCAA GGGCGAGCTG GACGTCAAGG AGAGTGCCAT CCGCGACGTC
ATCCGCTACT ACACCCGTGA GGCCGGCGTG CGAAATCTGG AGCGGGAGCT GGCGACGGTC
TGCCGCAAGG TGGTCAAGGG GCTGGTCGAG GGCGAGTCCA AGGGCCGCAA GCGCAGCGCC
GGCGTGCAGG TGACCCGCCG CAATATCGAC AAGTACCTCG GGGTGCGCCG CTACCGCTTC
GGGGTCGCCG AGACCGAGGA TCGCATCGGC CAGTCCACCG GTCTGGCCTG GACCGAGGTC
GGTGGCGAGT TGCTGACCAT CGAGGTGGCT GTGGTGCCCG GCAAGGGGCG GGCGACGCAG
ACCGGACAGC TCGGCGACGT CATGAAGGAG TCCATCGACG CTGCCACCAC CGTGGTGCGC
AGCCGGGCCC GCACCCTGGG ACTGGATCCG GAGTTCTACA CCAAGAACGA CTACCACATC
CACGTCCCCG AGGGGGCCAT CCCCAAGGAC GGTCCGTCGG CTGGCACTGG CATGTGTGTC
GCCCTGGTCT CGGCCTTGAC CGGCATCCCG GTGCGGGCGG GCGTGGGCAT GACCGGTGAG
ATCACCCTGC GCGGCGAGGT GTTGCCCATC GGTGGGCTCA AGGAGAAGCT TCTGGCCGCC
CTGCGGGGCG GCATCGACAC GGTGCTGATC CCGTCGGAGA ACGAAAAGGA TCTGGCGGAT
GTGCCCAAGG ACGTGAAGTC GAAGCTCGAC ATCCGGCCGG TGCGCTGGAT CGACGAGGTC
CTCGAGGTGG CCCTGACCCG TCAGCCTGAA CCGCTCCCGG CCCCGGAGGG TGAAGGCGAT
GCCGACGCGG CCACGCGCGT GGCAGTCGGC GAGGGGGAGG GTGACCCCAA GCGGCCACAC
TGA
 
Protein sequence
MAEPTAHNPE SPQTVTQAPV LPLRDVVVYP HMVIPLFVGR ERSIHALEAA MEQDKRIFLI 
AQRSAEVDDP GVEELYGYGT VASILQMLKL PDGTVKVLVE GGERARLVEL LDSGEYLSAH
LVTVPEPQPS DEDRELEVVA RSATNVFEQY VKLNKKIPPE ILSSLSGIEE PGRLADTIAA
HMALKVEEKQ KVLEMEGPRE RLEHLMGLIE GEIDILQIEK RIRGRVKQQM EKSQREYYLN
EQMKAIQKEL GELEDVPNEV EDLENKIDQA GMPQQALDKA KQELNKLKMM SPMSAEATVV
RNYLDWLVSL PWKDKTRVRH DLKHAAKVLD QDHYGLDKVK ERILEYLAVQ RRVRKLKGPI
LCLVGPPGVG KTSLGQSIAR ATNRKFTRMS LGGVRDEAEI RGHRRTYIGS LPGKIVQNLS
KAGKRNPLFL LDEVDKMAMD FRGDPSSALL EVLDPEQNSS FSDHYLEVDF DLSDVMFVCT
ANTMNIPAPL LDRMEVIRLP GYTEEEKLAI AQSYLLPKQM KANGIRKGEL DVKESAIRDV
IRYYTREAGV RNLERELATV CRKVVKGLVE GESKGRKRSA GVQVTRRNID KYLGVRRYRF
GVAETEDRIG QSTGLAWTEV GGELLTIEVA VVPGKGRATQ TGQLGDVMKE SIDAATTVVR
SRARTLGLDP EFYTKNDYHI HVPEGAIPKD GPSAGTGMCV ALVSALTGIP VRAGVGMTGE
ITLRGEVLPI GGLKEKLLAA LRGGIDTVLI PSENEKDLAD VPKDVKSKLD IRPVRWIDEV
LEVALTRQPE PLPAPEGEGD ADAATRVAVG EGEGDPKRPH