Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0837 |
Symbol | |
ID | 8418655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 993291 |
End bp | 994721 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 645037405 |
Product | protease Do |
Protein accession | YP_003197706 |
Protein GI | 258404964 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.134567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.708087 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAAA TTGCTTCGAC AATCGTGTGC GGGCTGGCCC TGCTGGTGTT AAGCGGCAGT ACGGCCCTGG CTCAACTCCC GGAATTCACC GAATTGGCCA AGTCGGCCGG CAAGGCTGTG GTCAATATCA GTACGGTCAA AACAGTCGAC CAATCCCAGG GGGTCGAGGA GTTTTTTAAT CGTTTCCACC GTCGTGGTGG CCCTTTTGAG GATTTTTTCG ATCAATTTGA ACGCTTCTTT GGGCCCCAGC AGATGCCCAA ACGCCAGCAG CGGTCGCTGG GCTCGGGGTT TATCATGTCC CGGGACGGCT ATATCGTGAC CAACAACCAT GTTGTTGAGC AGGCGGACAA AATCACCGTC AATCTTCAGG GAGGGGAGAC CTCCTACCAG GCCGATATTG TTGGTCGGGA TCCTGAAACC GATCTGGCGC TTTTAAAGAT CGAGGTCGAT CGCGAGTTGC CAGTTCTCGA ATTCGGAGAT TCCGGAGAGA TGGAAATCGG TGACTGGGTT ATGGCCATCG GCAATCCTTT TGGCCTCGAC CACAGCGTGA CCGCAGGCAT CATCAGCGCC AAAGGACGAG TCATCGGTGC CGGTCCGTAT GATGATTTCT TGCAGACTGA TGCTTCGATC AACCCCGGCA ATAGCGGCGG CCCGCTCCTG AACACCGACG GTAAGGTCAT CGGCATCAAT ACCGCGATCA TTGCCAGCGG CCAGGGCATC GGCTTTGCCA TACCGTCTGA TATGGCCAAA CAGGTTATTG CGCAACTCAA GAAATACCAG AAGGTCAAGC GTGGTTGGTT GGGTGTGACC ATCCAGGACG TGGACGAAAA CATGGCCAAA GCTCTTGGTC TTGACGCGCC CAAAGGCGCC CTGATTGCTG GCGTCCGGGC CGGTGATCCG GCCGATGAGG CAGGTCTTAA GGCAGGTGAC GTGGTCGTCT CCCTCAATGG CGAGCCGGTG GAGGATGCCG ACGGATTGAC TCGTCGTATC GGGCGCATGG AGCCAGATAC AAAAGCGAAT ATGACGATCT GGCGCCAGGG AAAGGTCAAG AAAATCGCCG TCGTGCTTGG CGAGCGGGAC ACCGCCCAGG AAGAAGCTCG AGCCGAGCAA CCCGATTCTG AGCAAACCAG CGGCAGACTC GGCATCGTCG TCCGGCCGGT TCGCGATGAA GAGGCCCGAG CCCTGGGCAT GGATGAAGCC AGGGGGCTTT TGATCCAGGA TGTCGAACAG GCTTCCCTCG CCGCAGAGGC TGGGTTGCGC CCCGGAGACG TCATCCTGGC TGCTAATGGG CAAGAGGTAG AAACCGTTCG GGGATTGTCG CAGATCCTGA ATGAAGACGC CGCTGAGAAA GGGGCTGTTC TTTTCCTCGT CAATCGCAAG GGACAGAACC TTTTTGTAAG CATTCCCCTG ACTGACGGGG ATGGCCAATA A
|
Protein sequence | MRKIASTIVC GLALLVLSGS TALAQLPEFT ELAKSAGKAV VNISTVKTVD QSQGVEEFFN RFHRRGGPFE DFFDQFERFF GPQQMPKRQQ RSLGSGFIMS RDGYIVTNNH VVEQADKITV NLQGGETSYQ ADIVGRDPET DLALLKIEVD RELPVLEFGD SGEMEIGDWV MAIGNPFGLD HSVTAGIISA KGRVIGAGPY DDFLQTDASI NPGNSGGPLL NTDGKVIGIN TAIIASGQGI GFAIPSDMAK QVIAQLKKYQ KVKRGWLGVT IQDVDENMAK ALGLDAPKGA LIAGVRAGDP ADEAGLKAGD VVVSLNGEPV EDADGLTRRI GRMEPDTKAN MTIWRQGKVK KIAVVLGERD TAQEEARAEQ PDSEQTSGRL GIVVRPVRDE EARALGMDEA RGLLIQDVEQ ASLAAEAGLR PGDVILAANG QEVETVRGLS QILNEDAAEK GAVLFLVNRK GQNLFVSIPL TDGDGQ
|
| |