Gene Hhal_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2110 
Symbol 
ID4710050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2314790 
End bp2316043 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content69% 
IMG OID639856584 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_001003676 
Protein GI121998889 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family
[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0560997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCTAG CAGGACGATC CGGTTGGCGG CCATTGGGCC TGGAGGGCAC CGAGTGGATT 
CGGTTTCTCC TCGGATACAC CGGGCTGGGC GTGGTGTTGG CCCTGGTCAT CGTCTGGATC
AACCCTGACC TGCTGGGCCC CCTGACGCCC CGGGTCGAGA TCACCGAGAG TGAAGATTGC
CAGACCGTCG CCCCGGGGCG ACAAAACGAC GCAGCCGATG CGCCGCCTCG ACGGGAACCG
GTCTCCTACG CCGACGCGGT GGAACGGGCC GCGCCGGCGG TGGTGAACAT CTTCACGGTC
AAGCAGGTCA CCGAACAGCT CACCCCGCCC GGATTCGACG ATCCCCTATT CCGCCGCTTC
TTCGGCGACC CTCCGACCCG CGAGCGCCAA CGCACCGAGA CCAGCCTCGG CTCGGGGGTC
ATCGTCGCCG AGGAAGGCTA TGTGGTCACC AACCACCACG TCATCGACGA CGCCGACCAG
ATCCAGGTAC TGCTGGCCGA CGGTCGCCAG AGGGCGGCCA CGGTGGTGGG GCGCGACCCG
GAGACGGACC TGGCAGTGCT GCGCATCGAG GCCGAACGGC TGCCAGTGAT CACCTTCGCC
CGGGACGAGC GGGTGCGGGT GGGCGACGTG GTGCTGGCCA TCGGCAACCC GTTCGGGGTC
GGACAGACGG TCACCCAGGG GATCATCAGC GCCACCGGGC GCGACCAGCT GGGGCTGTCG
ACCTTCGAGA ACTTCCTGCA GACCGATGCC GCCATCAACC CGGGCAATTC CGGTGGCGCC
CTGATCGACG CCGAGGGGCG GCTGGTGGGG ATCAATACCG CCATCTTCAG CGGCACCGGC
GGCTCACAGG GGATCGGCTT CGCCATCCCG GCGGGCATTG CCCAGGCGGT GATGTCCGAC
CTGATCCAGT ACGGACGCGT GGTGCGCGGC TGGCTGGGGG TACAGGCACA GCGGCTGACG
CCGGCCCTCG CCGAGTCCTT CGGCCACCCC CCGGATACCG AGGGCGTAGC GGTCACCCAC
ATCCTGCCCC GCGGACCGGC GGACCAGGCC GGCCTGGAGG CCGGCGACAT CATCGTCGAG
CTGGGCGGTC AGCGCATCCG CGACGTCCAG GATCTGCTGC AGGTGGCCAG CGCGGCGGCA
CCGGGCACCG AGATGGAGAT CTCCGGCTAC CGCGATCAGG AGCCTTTCTC AACCTCCGTC
ACCCTCGGTG AGCGGCCGGA CATGCAGCAG CCCAGCCACC CCGGGCGGCG CTAA
 
Protein sequence
MRLAGRSGWR PLGLEGTEWI RFLLGYTGLG VVLALVIVWI NPDLLGPLTP RVEITESEDC 
QTVAPGRQND AADAPPRREP VSYADAVERA APAVVNIFTV KQVTEQLTPP GFDDPLFRRF
FGDPPTRERQ RTETSLGSGV IVAEEGYVVT NHHVIDDADQ IQVLLADGRQ RAATVVGRDP
ETDLAVLRIE AERLPVITFA RDERVRVGDV VLAIGNPFGV GQTVTQGIIS ATGRDQLGLS
TFENFLQTDA AINPGNSGGA LIDAEGRLVG INTAIFSGTG GSQGIGFAIP AGIAQAVMSD
LIQYGRVVRG WLGVQAQRLT PALAESFGHP PDTEGVAVTH ILPRGPADQA GLEAGDIIVE
LGGQRIRDVQ DLLQVASAAA PGTEMEISGY RDQEPFSTSV TLGERPDMQQ PSHPGRR