Gene Dret_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0844 
Symbol 
ID8418663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1001183 
End bp1002595 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content59% 
IMG OID645037413 
Productpeptidase M48 Ste24p 
Protein accessionYP_003197713 
Protein GI258404971 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000290245 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0785618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGTGC TGCGTTGGGC ACTGATCGCT GTCGCCCTGA CAGGCCTGCT GTTTTCCGGC 
CCTGTTCAAG GGGCTCTTCT CGGGGACTTC ACTATTCAAG ACGAAGTTGA ATTGGGTCGT
GAAGTCAACC AATTCGTTCG TTCGCACTTT GATCTCCTCA ACGATCCGGT TGTTGCCGAG
TATGTCCGCT CGGTCACTGA GCGTATCGAG AGCCATCTTC CTCCCCAGCC TTTTCCCATC
AGCGTCACTG TTGTCAACGA TGAGAGCCTG AACGCCTTTG CCGCTCCAGC CGGGTACATG
TTTGTTCACA CCGGGTTGCT TTTGCATCTG GAATCCGAGG CCCAATTGGC CGGGGTTATC
GCCCACGAAT TGGCCCACGT GACCCAGCGC CATATCGCCC GAAACATTGA GCGGAGCCAG
TTGATCAATC TGGGGACTCT GGCCGGTATG CTGGCCGGCG TCTTTCTCGG AGGCGGCGGA
GAGGGGAGTG AAGCCCTGGC CATGGGCTCC CTTGCCGGGG GACAGGCCGC AGCGCTGAAA
TATTCGCGCG AGGACGAGCG TGAGGCCGAT CAGATAGGGG TCCACTATCT GCAAAACGCG
GGCTACCCTG TCCAGGGCAT GGTCGAGGCC TTTGAAGTCA TTCGCAAACG GAAATGGTTC
TCAGGCCATT CTTTGCCGAC CTACCTGAGC ACGCACCCCG GAGTGGGAGA ACGGATTGCC
GCCTTACAGG GCCGTGTTGA GGGAGAATCT TCTGGATCAT CGTCGTTGCA AAGCGGCACG
TCCCGTTTTG AGCGCATTCA AATGTTGGTC CGGGCCAGAT ACACCGACCC CAGCAGTGCC
CTTGTTGCTT TCAACGACAC CCAATCGGAA CAAGACACGT GTATGCTGGC CCTGGGCCGG
GCCATCGCCC TGGAACGCTT GCAACGGGTG GAGGAGGCCG GTCAGGCGTA TGAGCGAGCC
CTGGATTGTG CGCCGGACGA TCCTTTAGTC CAGCGGGAAG CCGGTCACTT TATGTATCTC
CAGGGCAAAC TGGACCGAGC CCGACGTTTG CTCAAGGCCG CATTGGACCA GCGCCACACC
GATACCCGGG CGATGTTCTG GTATGCCCAG ACATTGGCCC AGGCCGGAGA TCCAGAGCAC
AGTATCCGGA TTGGAGAAGA GGTCGTGCGT CGTGAGCCGC GTAATGCGCG GGCCCATGCT
TTTCTCGGCC GTCTTCAGGG GCAACGGGGG AAGCTGTTTG AAGCCCATCT CCATTTGGCC
TACGCGGCGC TCTATGGACA CGGCGCCTCG CAACTCCCGT TTCATATTCA GAAAACCAAA
GAGATGGCCC ATTCCGAAAA ACAGCGGCAG CGCCTCTCTT CCCTGCAGGA GGCTGTGGCG
GAGTCCAAGA AGTTGGAAAA GCTCACGCCT TGA
 
Protein sequence
MHVLRWALIA VALTGLLFSG PVQGALLGDF TIQDEVELGR EVNQFVRSHF DLLNDPVVAE 
YVRSVTERIE SHLPPQPFPI SVTVVNDESL NAFAAPAGYM FVHTGLLLHL ESEAQLAGVI
AHELAHVTQR HIARNIERSQ LINLGTLAGM LAGVFLGGGG EGSEALAMGS LAGGQAAALK
YSREDEREAD QIGVHYLQNA GYPVQGMVEA FEVIRKRKWF SGHSLPTYLS THPGVGERIA
ALQGRVEGES SGSSSLQSGT SRFERIQMLV RARYTDPSSA LVAFNDTQSE QDTCMLALGR
AIALERLQRV EEAGQAYERA LDCAPDDPLV QREAGHFMYL QGKLDRARRL LKAALDQRHT
DTRAMFWYAQ TLAQAGDPEH SIRIGEEVVR REPRNARAHA FLGRLQGQRG KLFEAHLHLA
YAALYGHGAS QLPFHIQKTK EMAHSEKQRQ RLSSLQEAVA ESKKLEKLTP