Gene Dret_0716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0716 
Symbol 
ID8418529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp850833 
End bp852059 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content64% 
IMG OID645037280 
Productpeptidase U32 
Protein accessionYP_003197586 
Protein GI258404844 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0755954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.409296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGAG GAGGATTGCC GGAACTCCTC GCTCCGGCAG GGGATTGGGA ACGGCTTCGC 
ACCGCCCTGG TGTACGGCGC CGACGCCGTC TATCTCGGTG GCAGCGGTCT GGATCTGCGC
GCGCAAAGCA AAGGATTCTC TCCGGGGGAA CTCCCCGCCG CTGTGGCTTT CGCCGCCAGA
CACAGCGCCA AGGTCTATTT TTGCTTGAAC ATCCTGGCCC GGCAACACCA TCTGTCCCAG
ATAGAAGCCA CCCTGGAGCA GCTGGCCGCC ACCCCGATAC ACGGCCTCAT CGTGGCCGAC
CCCGGCGTGA TCGCCCTGGC CCGGCGCCTG GCCCCGGGTA TCCCCTTGCA TCTGAGCACC
CAGGCCAATA CCTCGAACGC CGCCAGCATC GCCTTCTGGC GCGATTGCGG TGTCCAGCGG
GTCAACTTGG CGCGCGAACT CTCCGGCCCG GAAATGCGCC GCATCCGCAG GGAGGTGCCG
GACATGGAAT TGGAATCCTT TGTCCACGGC GCCCAGTGCA TGGCCATTTC CGGACGCTGC
CTGCTCAGCG ACCACCTCAA CGGGCGCTCC GCCAATCTGG GGGCCTGTAC CCACCCCTGT
CGTTTCGGCT ACCGGCGCCA CATTCTGGAG GAAGGAGTCC GTCAGGGCCA GCCGTGTTGG
GAGATTGAAC AGGACAGAGA TTTTACGCAC ATCCTGGCCG CCGAGGACCT GTGTCTGGTG
CCCTACCTGG CCTGGTTCGT CCACCAGGGG TGGCAGAGTC TCAAAATCGA AGGGCGAATC
AAGACCTGTT CCTATGTCGG CCAGGTCGTG GACGTCTACC GCACCGCTCT GGACGATATC
GCCGCCCGCC GTTTTCGGCG AGACACCTAC CTGCGGGAAT TGGAGCCAAG CGCCACCAGG
AATCTGGGAA CAGGATTTTT CCTGCCCCAT GCCCGCGGAC TGACACTTCG CGCCGCCCCC
AGCTACCGCA CCCCCATTGT GGCTCGTATC GAACGCGAAC TTGCGCCGGG AACCTGGGAA
ATCAGCGCCC GGCACCGCTT TGCCGTGACT GACGATATCG AAATCGTGGC TCCCGGGCTG
CAGCGCCCTT GCCTTGGCTC ATTCGGACTG GAAAAAGAGG ACGGAAACCG GATCGAGACC
ATCCACTCCG GGGTCCGCGG CCGACTGCGA AGCGACAATC CCGCCTTGCG CCCCGACCTG
CTGTTACGAG CCCGTCTGCA GGCCTAG
 
Protein sequence
MSRGGLPELL APAGDWERLR TALVYGADAV YLGGSGLDLR AQSKGFSPGE LPAAVAFAAR 
HSAKVYFCLN ILARQHHLSQ IEATLEQLAA TPIHGLIVAD PGVIALARRL APGIPLHLST
QANTSNAASI AFWRDCGVQR VNLARELSGP EMRRIRREVP DMELESFVHG AQCMAISGRC
LLSDHLNGRS ANLGACTHPC RFGYRRHILE EGVRQGQPCW EIEQDRDFTH ILAAEDLCLV
PYLAWFVHQG WQSLKIEGRI KTCSYVGQVV DVYRTALDDI AARRFRRDTY LRELEPSATR
NLGTGFFLPH ARGLTLRAAP SYRTPIVARI ERELAPGTWE ISARHRFAVT DDIEIVAPGL
QRPCLGSFGL EKEDGNRIET IHSGVRGRLR SDNPALRPDL LLRARLQA