Gene Hhal_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0121 
Symbol 
ID4710620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp137582 
End bp139702 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content67% 
IMG OID639854579 
Productcarboxyl-terminal protease 
Protein accessionYP_001001717 
Protein GI121996930 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCCCGTA GTCACCTGTT GTGGACCCCC GTACTGGCTC TCGGCCTGGC CTTTTCCTCC 
GTGGCACCAC TCCCCGGAGC GCCGGGTGCC GCCCCGGCGA CCGCGGGTGA GAATGGCGCA
GTACCCGGAC AGACCGAATT CGAGAAGGCG CGCGTCGTCG GCGACCTGCT CCAGCGCTAC
CACTACGGTG GCCCCGAGTC CGACGAGCAG CTCATGGAGC AGGCCACCGA GACGTACCTC
AAGCAGCTCG ACCACGGCCG CTTCTTTCTT CTGAAGGAGG ATGTCGAGGC GTTCCGCGAG
CGCATGAGCG AGCTCGACCC CGGCCAGGGC GACGCAATCC TGGAGGCCGC CTACGACCTC
CACGCCCGCT ATCGCGACCG GGTCGCCGAG CAGACGGAAT TCGCTCTGGC CCTTCTCGAG
GAAGGGTTCG GCTTCGACGG CGAAGGCCGC TTCGAACAAG ACCGCAGTGA AGCCGAATGG
GCCGCCGACC GTGAGGCCCT CGACGAGCTC TGGCGGCAGC GCGTGACCCA CGACGCACTG
ACCCTGGAGC TGGCCGAGCG CAGCACCGAG GAGATCCGCG ACAACCTCGA ACGGCGGTAC
ACCACGCTGC GCGACCGGGC CGTGGACGCG GAAAACAAGG ACATCATGGA TCAGTTCCTC
AGTGCCTGGG CGGCGGCCTA CGACCCGCAC AGCACCTTCC TCTCGCCGCA GCGCTCCGAG
GAGTTCGACA TGCAGATGTC GCTGCAGCTC GAGGGGATCG GGGCCAAGCT GACCATGGAT
CAGGACTTCA CCGAGATCGT CGAACTCATC CCCGGCGGAC CTGCCGAGCA GTCCGGTGAA
TTGCGCGAGG GCGAGCGCAT CATCGGCGTC GCCGACGGCG ATGACGGGGA GATGAAAGAC
GTCGTCGGGT GGCGGCTGGA CGAGATCGTG GACATGATTC GCGGCCCGAA GGAGTCCGTG
GTCCGACTCA ACGTGCTGCC GCCCGCCGGC GCCAGCGAGA GTTCACCGCG GGAGGTACGC
CTGGTCCGCA ACAAGGTCGA CCTGGAAGAC CAGGCCGCCC GTAAGGAGGT CATCGAGAAG
ACCAACGCCG AGGGTGAGCA GAAGCGTATC GGCGTGATCA CGATCCCCAA GTTCTACCGC
GACTTTGAGG CGGCCCACTC CGGGCAGGAC GACTTCCGCA GCACCACGCG GGACGTCGAG
CGCCTGCTCG GCGAACTGCT CGAGGACGGG GTCGACGGAC TGCTGATTGA CCTGCGCGGC
AACTCCGGCG GGGCCCTGCG CGAGGCCACC GCCCTGACCG CCCTGTTCAC CGGCGGCGGG
CCGGCGGTGC AGGTCCGCGA CTCCCGCGGC CACCCGGAGC AAGTCGGCGA GTCCAGCGGC
GATCCGGCTT ACGACGGCCC GTTGGGGGTG CTGGTGGATC GACGCAGCGC ATCGGCCTCC
GAGATCTTCG CGGCCGCGAT CAAGGATTAC GGGCGCGGGA TCGTGCTCGG CGATCAGACC
TTCGGCAAGG GCACGGTGCA GCAGATGATC GGCCTGGACA ATTACGCCAT CCCCGGAGAG
GAGCGTTCCG GTCAGCTCAA GCTGACCCTC GCCCAGTTCT ACCGGGTGAC CGGAGAGAGC
ACCCAGCTCG AGGGCGTAAA ACCGGATATC CACCTGCCGT CTGAGTTCAG CCACGAGGAG
TTCGGCGAAC GGGCCACCCG GAATCCGCTG CCGGCCACCC AGATCGACGG GCTCGACATC
ACCGTTCAGT ACGAGCTGGA GACCATCATC GACGAACTGG CCCGCCGGCA CGAGGCACGG
ATGGAGGAGA CCGAGACCTT CCGGGCCCTG GAGCGAAAAT TGGAGGCCCA ACGGGAGATC
CGCGAGGACA CCACGGTCGC CCTGAGCAAA ACGACCCGGC AGGAGGAGCA GAAGGCCCGC
GAGGAACGGC TGCTGGAGCT GCACAACGAT CGGCGCCGAG CCCACGGCAA GGATCCGGTG
GAGAGCTACG CCGACGTCGA CGCCGATGAC CTACCGGACG CCCTGCTCGA TGCCAGTGCG
GCGATCATTG CCGACTTCGC ACAGCTCCTG CGGGAGGCCG GCGACGAGGT ACTCACCGCC
GAGGCTCGCA AGGAAGGCTG A
 
Protein sequence
MARSHLLWTP VLALGLAFSS VAPLPGAPGA APATAGENGA VPGQTEFEKA RVVGDLLQRY 
HYGGPESDEQ LMEQATETYL KQLDHGRFFL LKEDVEAFRE RMSELDPGQG DAILEAAYDL
HARYRDRVAE QTEFALALLE EGFGFDGEGR FEQDRSEAEW AADREALDEL WRQRVTHDAL
TLELAERSTE EIRDNLERRY TTLRDRAVDA ENKDIMDQFL SAWAAAYDPH STFLSPQRSE
EFDMQMSLQL EGIGAKLTMD QDFTEIVELI PGGPAEQSGE LREGERIIGV ADGDDGEMKD
VVGWRLDEIV DMIRGPKESV VRLNVLPPAG ASESSPREVR LVRNKVDLED QAARKEVIEK
TNAEGEQKRI GVITIPKFYR DFEAAHSGQD DFRSTTRDVE RLLGELLEDG VDGLLIDLRG
NSGGALREAT ALTALFTGGG PAVQVRDSRG HPEQVGESSG DPAYDGPLGV LVDRRSASAS
EIFAAAIKDY GRGIVLGDQT FGKGTVQQMI GLDNYAIPGE ERSGQLKLTL AQFYRVTGES
TQLEGVKPDI HLPSEFSHEE FGERATRNPL PATQIDGLDI TVQYELETII DELARRHEAR
MEETETFRAL ERKLEAQREI REDTTVALSK TTRQEEQKAR EERLLELHND RRRAHGKDPV
ESYADVDADD LPDALLDASA AIIADFAQLL REAGDEVLTA EARKEG