Gene Dret_0837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0837 
Symbol 
ID8418655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp993291 
End bp994721 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content57% 
IMG OID645037405 
Productprotease Do 
Protein accessionYP_003197706 
Protein GI258404964 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.134567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.708087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAAA TTGCTTCGAC AATCGTGTGC GGGCTGGCCC TGCTGGTGTT AAGCGGCAGT 
ACGGCCCTGG CTCAACTCCC GGAATTCACC GAATTGGCCA AGTCGGCCGG CAAGGCTGTG
GTCAATATCA GTACGGTCAA AACAGTCGAC CAATCCCAGG GGGTCGAGGA GTTTTTTAAT
CGTTTCCACC GTCGTGGTGG CCCTTTTGAG GATTTTTTCG ATCAATTTGA ACGCTTCTTT
GGGCCCCAGC AGATGCCCAA ACGCCAGCAG CGGTCGCTGG GCTCGGGGTT TATCATGTCC
CGGGACGGCT ATATCGTGAC CAACAACCAT GTTGTTGAGC AGGCGGACAA AATCACCGTC
AATCTTCAGG GAGGGGAGAC CTCCTACCAG GCCGATATTG TTGGTCGGGA TCCTGAAACC
GATCTGGCGC TTTTAAAGAT CGAGGTCGAT CGCGAGTTGC CAGTTCTCGA ATTCGGAGAT
TCCGGAGAGA TGGAAATCGG TGACTGGGTT ATGGCCATCG GCAATCCTTT TGGCCTCGAC
CACAGCGTGA CCGCAGGCAT CATCAGCGCC AAAGGACGAG TCATCGGTGC CGGTCCGTAT
GATGATTTCT TGCAGACTGA TGCTTCGATC AACCCCGGCA ATAGCGGCGG CCCGCTCCTG
AACACCGACG GTAAGGTCAT CGGCATCAAT ACCGCGATCA TTGCCAGCGG CCAGGGCATC
GGCTTTGCCA TACCGTCTGA TATGGCCAAA CAGGTTATTG CGCAACTCAA GAAATACCAG
AAGGTCAAGC GTGGTTGGTT GGGTGTGACC ATCCAGGACG TGGACGAAAA CATGGCCAAA
GCTCTTGGTC TTGACGCGCC CAAAGGCGCC CTGATTGCTG GCGTCCGGGC CGGTGATCCG
GCCGATGAGG CAGGTCTTAA GGCAGGTGAC GTGGTCGTCT CCCTCAATGG CGAGCCGGTG
GAGGATGCCG ACGGATTGAC TCGTCGTATC GGGCGCATGG AGCCAGATAC AAAAGCGAAT
ATGACGATCT GGCGCCAGGG AAAGGTCAAG AAAATCGCCG TCGTGCTTGG CGAGCGGGAC
ACCGCCCAGG AAGAAGCTCG AGCCGAGCAA CCCGATTCTG AGCAAACCAG CGGCAGACTC
GGCATCGTCG TCCGGCCGGT TCGCGATGAA GAGGCCCGAG CCCTGGGCAT GGATGAAGCC
AGGGGGCTTT TGATCCAGGA TGTCGAACAG GCTTCCCTCG CCGCAGAGGC TGGGTTGCGC
CCCGGAGACG TCATCCTGGC TGCTAATGGG CAAGAGGTAG AAACCGTTCG GGGATTGTCG
CAGATCCTGA ATGAAGACGC CGCTGAGAAA GGGGCTGTTC TTTTCCTCGT CAATCGCAAG
GGACAGAACC TTTTTGTAAG CATTCCCCTG ACTGACGGGG ATGGCCAATA A
 
Protein sequence
MRKIASTIVC GLALLVLSGS TALAQLPEFT ELAKSAGKAV VNISTVKTVD QSQGVEEFFN 
RFHRRGGPFE DFFDQFERFF GPQQMPKRQQ RSLGSGFIMS RDGYIVTNNH VVEQADKITV
NLQGGETSYQ ADIVGRDPET DLALLKIEVD RELPVLEFGD SGEMEIGDWV MAIGNPFGLD
HSVTAGIISA KGRVIGAGPY DDFLQTDASI NPGNSGGPLL NTDGKVIGIN TAIIASGQGI
GFAIPSDMAK QVIAQLKKYQ KVKRGWLGVT IQDVDENMAK ALGLDAPKGA LIAGVRAGDP
ADEAGLKAGD VVVSLNGEPV EDADGLTRRI GRMEPDTKAN MTIWRQGKVK KIAVVLGERD
TAQEEARAEQ PDSEQTSGRL GIVVRPVRDE EARALGMDEA RGLLIQDVEQ ASLAAEAGLR
PGDVILAANG QEVETVRGLS QILNEDAAEK GAVLFLVNRK GQNLFVSIPL TDGDGQ