Gene Hhal_2368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2368 
Symbol 
ID4709083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2601254 
End bp2603200 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content71% 
IMG OID639856843 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001003933 
Protein GI121999146 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGGC CGAGCCGGTT GTCCCTGGCC CTTGGCGGCG CATTGCTGCT GGCCGGCCCC 
GGCCCGGTGA GTGCCCCCGA GGGCGCGCCG CCGAACTCCC CGTCGTCGTC CGCTGAATCA
CCCCAAGAGC TGCTGGTACG GTTCGCGCCG GATGTGGCGG AGGGCGTCCG CACTGCCACC
CACCGCGCCT ACGGCGGCCA CACCAAGCGT CGCCATGAGC GGGGTCGGTT CGAGGTCGTG
CCGCTACCGC CTTACGCCGA CCGCGAGGAG GCCCTGCGCA AATACCGCGA CGATCCCCAC
GTGGAGCACG CCGAGCCCAA CCGGTACGTG GAGCGCGTCG CGGCGCCTGC CGACGCCGCG
AGCGATGCCA ACGGGTGGTG GCAGGAGCGC ATCCGCCTCA CCGAGCTGGA GACAGCAGAA
CGCAAGGCCG GGGACACCAT CGTCGGCATC CTGGATACCG GCATCCAGTG CGACCACCCG
GCCCTGGCCG ACAACACCTG GGACGACGGC GACGGTCAGT GCGGGAAGAA CTTCATCGAT
CCGGACACCC CCCCGGACGA CGACTCGGAT CGGGGACACG GGACCCACGT GGCCGGCATC
ATCGGCGCGA ACAGCGACGA GATGACCGGC GTCGCCCGAT CCGTCCAACT CCAGGCCCTG
AAGTTCCTGG GGTCACTGGA CGACGGCACC CTCGCCGATG CCATCGAGGC CATCGACTAC
GCCATCGAGC AGGGGACGGA CGTCCTCAAT GCCAGCTACG CCTACACGGC GAGCCGTACC
GACGACGGCC CCCTGCCTAC CAGTTGCGCG GATCTCGCCG ATACCATGGA GGGGGCTTCG
CGCCTGCATT GCGAGGCCGT TGCCGACGCC GGCGAGGCCG GGATCCTCTT CGTGGCAGCG
GCACACAACT CCGGTAACGA CAACGACACC GGCACGGTCG CGCTGCCGGC AGGCTACCCG
CTGGATAACG TGATCGCGGT GGCCGCCAGC AGGGAGACCG CGGCGGGCGA GCCGAGCGAC
CAACTGGCCG ATTTCTCGAA CTTCGGGCGG CAAACGGTCC ACCTGGCAGC GCCCGGCGTG
GGCATCCACA GCACCGTGGC CGGGGATGAC TACGACGAGC TTTCGGGCAC CTCCATGGCC
ACCCCGATGG TCGCCGGGGT TGCGGCGCTG CTGCTCGATC AGGCCGGGTC GGAGGCCTCG
CACCTCACGA TCCGTGAGCG GCTACTCGGG TCCGTGGCCT GCGACCGCGA CGCCGACCCG
AGCCTGCCAC GCTGCGATGG GAACCCGCTC GGCGACAGCG CCGGCCAACA GCTGGCCGCC
AAGACCCTGT CCGGGGGCCG GCTCGACGCC GCCGCAGCCC TGGGCGCTGA TCCGGATACG
GTGCCCCCCG TTCCGCCGAG CCACGTCAGC ATCGAGACCG CCTCCGGCAT CCCGGAGCTG
CGTTGGCTGT CCTCCAGCCC CACCGCCCGG GGCTACCGCA TCGAGCGTTT CGACACCGCC
GAGGGGACTT TCCGGCATGT GGCCACCGTC GAGCCGGACC GGGATCGGTA CCGGGATCGC
AGCGCCCCGA GCGATACCCT GCTCGGCTAC CGCATCCGCG CCCTGGGCAG CGACGGGCAA
CCGAACTCGC GCTGGACGGA GGCCGGAACC GTCGAGACCG ACCAGGCCCT GAACCGAGTG
ACGGAGCAGT TCGCAGGCTT CAGCAGTGCC GAGCGGGATG AGCGGTGTTT CATCGCCACG
GCCGCCTACG GCTCCGAGCA GGCGCACCAG GTCGAGGCGC TGCGCGATTT CCGGGACGAC
TACCTGATGC CCCACCCGCC CGGCCGGGCG CTGGTGGCGG CGTACTACGC GGTCAGCCCG
CCCATCGCCG AGTGGGTCGC CGCCGAGGAA CACCGCCAGC GCTGGGTCCG GCGGCTCCTG
GCCCTGTTGC CGACTCAAAA GGAGTAG
 
Protein sequence
MNRPSRLSLA LGGALLLAGP GPVSAPEGAP PNSPSSSAES PQELLVRFAP DVAEGVRTAT 
HRAYGGHTKR RHERGRFEVV PLPPYADREE ALRKYRDDPH VEHAEPNRYV ERVAAPADAA
SDANGWWQER IRLTELETAE RKAGDTIVGI LDTGIQCDHP ALADNTWDDG DGQCGKNFID
PDTPPDDDSD RGHGTHVAGI IGANSDEMTG VARSVQLQAL KFLGSLDDGT LADAIEAIDY
AIEQGTDVLN ASYAYTASRT DDGPLPTSCA DLADTMEGAS RLHCEAVADA GEAGILFVAA
AHNSGNDNDT GTVALPAGYP LDNVIAVAAS RETAAGEPSD QLADFSNFGR QTVHLAAPGV
GIHSTVAGDD YDELSGTSMA TPMVAGVAAL LLDQAGSEAS HLTIRERLLG SVACDRDADP
SLPRCDGNPL GDSAGQQLAA KTLSGGRLDA AAALGADPDT VPPVPPSHVS IETASGIPEL
RWLSSSPTAR GYRIERFDTA EGTFRHVATV EPDRDRYRDR SAPSDTLLGY RIRALGSDGQ
PNSRWTEAGT VETDQALNRV TEQFAGFSSA ERDERCFIAT AAYGSEQAHQ VEALRDFRDD
YLMPHPPGRA LVAAYYAVSP PIAEWVAAEE HRQRWVRRLL ALLPTQKE