Gene Hhal_0718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0718 
Symbol 
ID4711059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp802972 
End bp804246 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content70% 
IMG OID639855181 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_001002302 
Protein GI121997515 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTAG CACACGCGAT GCAGCATCGA CGCGCGCCGC GGATGGTGGT GGCGGCGCTG 
GCGACCGCTC TGATCGGCCT GGTGGCCACC CCGGCGCTGG CCGACGACCT GGAGCACCTC
CAGCCGGATG AGCGCAACAC GGTGGAGATC TTCCAGCGCT ACGGCCCCTC GGTGGTGGCC
ATCGAGGTCG AGGTCCGCGG CGAGCGGGTC GACCCCTTCG ACCGCATCCC CGAAGGGATG
CTCCCGCGGG AATTCCGCGA GTTCTTCGAG CGCCGGCAGC AGCCCCGCGA GGACTCGCCC
CGCCGCCAGG GCGCCGGGAG TGGCTTTCTG GTCGATGATG CGGGGCATAT CGTCACCAAC
TACCACGTCA TCCGCAACGC CCTGGAGGAG GAGAGCGTGG ATCTGCGCGA GGGCGCTTCC
CTGAAGCTCA GCTTCGCCGA GCACGAAGCG GTGCCGGCCC GAGTGGTGGG CGCCAACGCG
CTCTACGACC TGGCCCTGCT CAAGCCCGAG GATCCGGACA GCATCCCGGA CGGCGCCGAG
CCGCTGCCGC TGGCCGACTC GGATCAGACC CTGGTGGGCC AGAAGACCAT CGCCATCGGC
AACCCCTTCG GGCTGAGCTC TACGGTGACC ACGGGGATTG TCTCCGGTGT CGGCCGGGAT
CTGCCGGGGA TCGGGCAGAT CGAGATCCCC ATGATCCAGA CCGATGCGGC GATCAATCCG
GGCAACTCCG GCGGGCCGCT GCTCAACTCC GCCGGAGAGG TGATCGGTGT GAACACCGCC
ATCGTGCCGG GCGGCGGCGG GCTGACCGGC CGGGGCGGCT CGGTGGGCGT GGGCTTTGCC
GTGCCCAGCA ATCTGCTCCA GGAGAGCCTG GACGAGATGG AGGAGGGCGG TCTGACGGAT
CTGACCTCCC GGGCCCGTCT GGGGGTGATG GTGGCCGGTC TGCAGGGCTA TCCGGAGGGG
GTCCGCGAGC GGCTGAATCT CCCGGAGCGC GGGGTGATGG TGGTGGATGT CGAGTCGGGC
AGTCCGGCCG AGGAGGCGGG GCTGCAGGGG GCGTCGTTCG AGGTGAGTGT CGAAGGGCGT
GCCATGCCGG CCGACGGGGA TGTCATCACC CACGTGAACG GTGAGGCGGT GTCGGAGCCG
CGGGAGTTGC AGCGGTTGGT CTTCGCCCGT CGGGCGGGGG ATGCGGTGAC GCTGACGGTC
CTGCGCGACG GTGAGGAGCG GAAGTTCGAG GTGGAGCTCC GTGAGGTGCC GCGGGAGCAG
CAGCGGCGCC GGTAG
 
Protein sequence
MTVAHAMQHR RAPRMVVAAL ATALIGLVAT PALADDLEHL QPDERNTVEI FQRYGPSVVA 
IEVEVRGERV DPFDRIPEGM LPREFREFFE RRQQPREDSP RRQGAGSGFL VDDAGHIVTN
YHVIRNALEE ESVDLREGAS LKLSFAEHEA VPARVVGANA LYDLALLKPE DPDSIPDGAE
PLPLADSDQT LVGQKTIAIG NPFGLSSTVT TGIVSGVGRD LPGIGQIEIP MIQTDAAINP
GNSGGPLLNS AGEVIGVNTA IVPGGGGLTG RGGSVGVGFA VPSNLLQESL DEMEEGGLTD
LTSRARLGVM VAGLQGYPEG VRERLNLPER GVMVVDVESG SPAEEAGLQG ASFEVSVEGR
AMPADGDVIT HVNGEAVSEP RELQRLVFAR RAGDAVTLTV LRDGEERKFE VELREVPREQ
QRRR