Gene Hhal_0821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0821 
Symbol 
ID4709097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp902573 
End bp903871 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content73% 
IMG OID639855280 
Productpeptidase M48, Ste24p 
Protein accessionYP_001002399 
Protein GI121997612 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.371859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGCA GACGGGCGTT TCAATGGCTG ATCCCGGCGG TGCTATTGGC GCTGAGCGTG 
GCTAGCTGCG CCACCAACCC GGTCACCGGC GAGCGGGAAC TGCGGCTGAT CTCCGAGGGC
GAGGAGGTCG CCATGGGCGA GCAGCACTAC GAGCCCACCC TGCAGAGCAT GGGCGGACGC
TACAACGCCG ACCCGGACCT GGTCGCCTAC GTCGATGAGG TCGGCCAGCG GGTGGCCGCC
GAGAGCCACC GCCCGGGGCT GCCCTACGAG TTCGTGGTGC TCAACGACGG CACCCCCAAC
GCCTGGGCCC TGCCCGGCGG TAAGATCGCC ATCAACCGTG GCCTGCTCAC CGAGATGGAG
AATGAGGCCG AGCTGGCCGC GGTGCTTGGC CACGAGATCG TCCACTCCGC CGCCCGCCAC
GGCGCCCAGC GCGTCGAGCG CGGGATGATG ATGCAGGCCG GGGTGGCCAC CGTCGGCCTG
GCCACCCAGG ACCACCAGCT CTCCGGACTG CTGGTGGCCG GGGCCAGCGT CGGTGTGGGC
TTGATCAGTC AGCGCTACTC GCGGCAGGCG GAGCTAGAGG CGGACGACTA CGGCACCCGC
TACATGGCCC AGGCCGGCTA CGACCCCGAG GCCGCCGTCA CCCTGCAGGA GAAGTTCGTG
CGCCTGGCCG GGGGCGGGGA GTCGAGCTGG CTCGAGGGGC TGTTCGCCAG CCACCCGCCG
TCCCGGGAGC GCGTGCGCGC CAACCGCGAG ACCGCCCAGA CCCTGCGCGA GGAGCTCGGC
GGCGAAGACT GGACCCTGGG CGAGGAACGC TACGCCCGGC ACATGCGGGT CCTGGAGGAG
AACCGGGAGG CCTACGCCCA GCTGGATGAG GCGCAGCAGG CGCTACGCGC CAAGGAGCCC
GAGCGGGCCC TGGAGCTGGC CGACGCGGCC ATCGACGCCT ATCCCGAGGA GGCCGCCTTC
CACGCCGTCC GCGGCCAGGC CCTGGCCCGC ATGGGCGAGG AGGCATCGGC CATCGCCGCC
CTGGATGCCG CCATTGAGCG CAACGACGGC TACTTCAGCT ACCACCTCGA CCGCGGCCTG
CTGCACCGGG CCCGCGGCGA CGACGAGCGC GCCCGCACGG ACCTGGAGCG CTCGGCCAGC
CTGCTGCCCA CCGCGCCCGC CCACCTGGCC CTGGGCCAGC TGGCCGAGGC CGACGGCGCC
CGGGCGGACG CCATCGGCCA CTACGAGAAG GCGGCCAGTG CGGAAGGCTT CTTCGGGGAG
CGGGCGCGGG AGGCGCTCAG CCGTCTGCAG GACGGCTGA
 
Protein sequence
MDGRRAFQWL IPAVLLALSV ASCATNPVTG ERELRLISEG EEVAMGEQHY EPTLQSMGGR 
YNADPDLVAY VDEVGQRVAA ESHRPGLPYE FVVLNDGTPN AWALPGGKIA INRGLLTEME
NEAELAAVLG HEIVHSAARH GAQRVERGMM MQAGVATVGL ATQDHQLSGL LVAGASVGVG
LISQRYSRQA ELEADDYGTR YMAQAGYDPE AAVTLQEKFV RLAGGGESSW LEGLFASHPP
SRERVRANRE TAQTLREELG GEDWTLGEER YARHMRVLEE NREAYAQLDE AQQALRAKEP
ERALELADAA IDAYPEEAAF HAVRGQALAR MGEEASAIAA LDAAIERNDG YFSYHLDRGL
LHRARGDDER ARTDLERSAS LLPTAPAHLA LGQLAEADGA RADAIGHYEK AASAEGFFGE
RAREALSRLQ DG