Gene Hhal_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2202 
Symbol 
ID4709550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2416478 
End bp2417938 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content71% 
IMG OID639856677 
Productpeptidase M48, Ste24p 
Protein accessionYP_001003768 
Protein GI121998981 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000223537 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGATC GCCTCCGCCG AACAGCCGCG GCCCTACTGA TCGCCGCGCT GACGCTCACC 
GCCCCGGCCC CGTCGCCGGT CCAGGCGGAG AGCGCCCAGC TGCCGCAGCT TGGCGTACCG
GGGGCGGACG CCCTGCCCGT GCACAAGGAG CGCGAACTGG GGGCCAAAAT CATGCGCCAG
GTCCGCCAGC ACCTACCCCT GCACGAGGAC CCAGAAACCA ATGAGTATCT CCAGAACCTC
GGGCACCGCC TGGCGGCGCA CAGCAACGAG CCGGGATTCG GCTACAGTTT CTTCCTGGTG
GAAGACGACC AGATCAACGC CTTCGCCCTG CCCGGCGGAT ACATCGGCCT CCACACCGGG
CTGATCCGCG AAACCCGGAC GGAGAGCGAA CTCGCCGGCG TGCTCGCCCA CGAGATCGCC
CACGTCACCC AGCGCCACAT CGCTCGGCAG TACGCTCAGT CGCAGCAACT CAACCTGCAG
ACCGCCGCCG CCGTCCTGGC CGCCATCCTG ATCGGCTCGC AGAGCCCGCA GGCCGGCAGC
GCCGCCGCCA TGGCGGGCAT CGCCGCGCCC ATCCAGCAGC AGCTGAGCCA CTCCCGCACC
CACGAGCAGG AGGCGGACCG GGTCGGCCTC CACAACCTGG TGGCCGCCGG GCTCGACCCC
TACGGCATGC CCGGGTTTTT CGAGCGCCTG GCCGACGCCT CCCGTTTTGC CGAGGATCCG
CCGGAGTACC TGAGCACCCA CCCGCTCACC GAGCGCCGCC TGAACGAGGC CCAACGCCTG
GCCGAACGCC TGGAGGGCGG CACCGTCTAC GAGAGCGGCC ATCACGCCTT CATTCGCGCC
CGGCAGCAGG TCCTCACTAA TAGGGAGCGC AGCACCAGCG CAGTGGCGTT TATGCGGGAT
CAGTTGCGGC GCAGCACCGA CGACCCCACC GAGCGTGCGG CCGCCCTCTA CGGCCTCGCC
CTGGCGCTCT CTTGGGAAGA GGGGCAGCAC GCCCAGGCCC TGGCCCTGCT CAACACCCTG
AGCGCCATCG AGGGGGAGCG CCTCTACGTC CTCCTCGGGC GCGGCGAGAT CTTGCGCGCC
ATCGGCGACA CCGAGGAGGC ACTGGCCACC TACCGGGAGG CCCGCTCCCT GTACCCGGGT
AGCTGGGCCG CCACCTACCG ACTGGCCGAG ACCCTGCTGG CCGACGACGA CGCCAAGGAG
GCGCGCCGGG TGCTGGCCCG CGCGACCCGC GGGTCGTCCG GATCGCCTCA GCTCCTGCGG
CTGCTGGCCG ACGCCGCCCA CGCCGCCGGC CGCGAGGCCG AGGGTTACAT CGCGCTGGCC
GAACACTACC GGGGCCGCGG CGAACACCGA CTGGCCGTGG CGCAACTGAA CAATGCCATC
CGCCACGCCG GCGAAGACCG CTACCAGCGC GCCCGCGCCG AGGCACTCAA GGCGCGCTGG
ACGCAGCACG CCGCGGACTA G
 
Protein sequence
MLDRLRRTAA ALLIAALTLT APAPSPVQAE SAQLPQLGVP GADALPVHKE RELGAKIMRQ 
VRQHLPLHED PETNEYLQNL GHRLAAHSNE PGFGYSFFLV EDDQINAFAL PGGYIGLHTG
LIRETRTESE LAGVLAHEIA HVTQRHIARQ YAQSQQLNLQ TAAAVLAAIL IGSQSPQAGS
AAAMAGIAAP IQQQLSHSRT HEQEADRVGL HNLVAAGLDP YGMPGFFERL ADASRFAEDP
PEYLSTHPLT ERRLNEAQRL AERLEGGTVY ESGHHAFIRA RQQVLTNRER STSAVAFMRD
QLRRSTDDPT ERAAALYGLA LALSWEEGQH AQALALLNTL SAIEGERLYV LLGRGEILRA
IGDTEEALAT YREARSLYPG SWAATYRLAE TLLADDDAKE ARRVLARATR GSSGSPQLLR
LLADAAHAAG REAEGYIALA EHYRGRGEHR LAVAQLNNAI RHAGEDRYQR ARAEALKARW
TQHAAD