Gene Dret_0327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0327 
Symbol 
ID8418131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp408030 
End bp410000 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content61% 
IMG OID645036892 
Productpeptidase U32 
Protein accessionYP_003197207 
Protein GI258404465 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGATA TATCCGCCTT CAAGCCCGAA ATTCTGGCCC CGGCCGGCAA CAAGGAAAGC 
TTCCTGGCCG CTGTCGATGC CGGAGCAGAC GCCATATATT GCGGCCTGAA ACATTTTTCC
GCCCGCATGG AAGCCGACAA TTTTGCTCTG CGCGACCTGA GCCGCCTGCA GGGCCTGGCT
CAGAAACACG GCACCCGAAC CTACGTCGCC CTGAACACCC TTATCAAACC CAATGAGTTG
GACCAGGCCG GACGGCTTCT GGACCAATTG ACCCGCTACG TGCACCCCGA CGCGCTCATC
GTCCAGGACC CGGCCGCCCT CAAGCTAGCC CGACAGGCCG GCTTCACCGG CGAACTGCAC
TTCTCAACCC TGGCCAATAT CGGGCTCGCT TCAGCCCTGC CCAGTGTTTT GGCCCTGGGC
GCAGACCGCG TTGTTTTGCC CCGGGAATTG ACCGTCGACG AAATCAAAAC CTTGGACACT
GCCTGTCCCG ACGCCCTCGA TCTGGAAGTC TTTGTCCACG GTGCCCTGTG TTACGCGGTC
TCGGGCCGGT GCTACTGGTC GAGTTATCTG GGCGGGAAAA GCGGTCTGCG CGGGCGGTGC
GTCCAACCCT GCCGAAGGAT CTATACCGCT GGCGGACCGC CCCAGCGGCT TTTCTCCTGC
CAGGACTTGG GTCTGGACAT ATTGACCAAG GCCCTTTTGG AGGCCCCGCG GGTCAAAGCC
TGGAAAATAG AGGGGCGTAA GAAGGGCCCC CATTATGTGA CCTATACGGT CCGGGCCTAT
GCCCTGCTCC GGGACAACCC CCTGGATCCG CAGGCCAAGA AGGAAGCGGT CGACCTTCTG
GACCAGGCCT TGTCGCGGCC GACCACCCAC TACAGTTTTC TGCCCCAGCG CCCCCACCCT
CCGGTCACCC CGGACCAATC GACAGCCTCG GGAGGATATC TCGGCCGCTT GCAGGCCAAG
AACACCAAAA TCCAGCTCCG TCCTCACAAA GCGCTCTTGC CCCAAGACCT GCTTCGGATC
GGATATGAGG ACGAACCGGG ACACCGGATT TACAAGGTCA GCAAATACAT CCCCAAAGGG
GGGCTGTTGA CTATCCAAGG ACGCAACACA GAGAAACAAA AAGGGATGAG CGTCTTTTTG
ATCGACCGGC GCGAACCCGA ACTGGTGCAG CGCCTGAAGA AAATGGATCA GGAATTTCAG
GCCGAGCCCG CTCCTGAAAT CACCCCTTCC TCCTTCACCC CGACGCTGCC CGGAACCCGG
ACTCAGGCTG GCCGACCCAG CCTGGTCCGC GTCTATCGTA ACCTGCCGCC CGGCAAGACC
GGCAAGACGC CAGGACTGTG GGTCCAACCC AAACCGCCGC GGGGGATCAG CTCCACACGC
TACAGCCATA TCTGGTGGTG GCTGCCTCCA GTGCTCTGGC CCGAAGAGGA GCGACTGTGG
CAAGAGACGA TCGACCGTTT GCTGGCCCGC GGCGCGAAGC GCTTTGTTTT GAACGCCCCC
TGGCAGGTCG GATTCTTTAA CGATTCCGAG GATCTGATGC TCTGGGCCGG ACCGTTTTGC
AACCTGGCCA ATGCCCTGGC CCTGGAAAGC ATGGTCGAAC TCGGATTCAG CGGGGCTTTC
GCCAGCCCTG AACTCGCCGG ACCAGATCTC CTTGCCCTGC CCGGGCAGAG TCCGCTCCCA
ATGGGCTTGG TCCTCGGCGG AGCGTGGCCG TTTTGTCTCT CCCGGGTGCA GCCCCCCCAT
CTGGAGCCCG GCCAACGCGT GACCAGTCCG AAAAAAGAGA TATCCTGGAC CAAACGGTAT
GGGCCGACGA CTTGGCATTA CCCCAATTGG GACCTGGACC TCTACAAACA CCAGCGCGAG
TTGGAGCAGG CCGGATACAC CCTTTTTGCC CATCTCTATG AGCCGCGCCC CAAATCCGCG
CCCCGGCCCC ACCGGACCTC AACCTTCAAT TGGGATCTGC AACTCTTGTA G
 
Protein sequence
MPDISAFKPE ILAPAGNKES FLAAVDAGAD AIYCGLKHFS ARMEADNFAL RDLSRLQGLA 
QKHGTRTYVA LNTLIKPNEL DQAGRLLDQL TRYVHPDALI VQDPAALKLA RQAGFTGELH
FSTLANIGLA SALPSVLALG ADRVVLPREL TVDEIKTLDT ACPDALDLEV FVHGALCYAV
SGRCYWSSYL GGKSGLRGRC VQPCRRIYTA GGPPQRLFSC QDLGLDILTK ALLEAPRVKA
WKIEGRKKGP HYVTYTVRAY ALLRDNPLDP QAKKEAVDLL DQALSRPTTH YSFLPQRPHP
PVTPDQSTAS GGYLGRLQAK NTKIQLRPHK ALLPQDLLRI GYEDEPGHRI YKVSKYIPKG
GLLTIQGRNT EKQKGMSVFL IDRREPELVQ RLKKMDQEFQ AEPAPEITPS SFTPTLPGTR
TQAGRPSLVR VYRNLPPGKT GKTPGLWVQP KPPRGISSTR YSHIWWWLPP VLWPEEERLW
QETIDRLLAR GAKRFVLNAP WQVGFFNDSE DLMLWAGPFC NLANALALES MVELGFSGAF
ASPELAGPDL LALPGQSPLP MGLVLGGAWP FCLSRVQPPH LEPGQRVTSP KKEISWTKRY
GPTTWHYPNW DLDLYKHQRE LEQAGYTLFA HLYEPRPKSA PRPHRTSTFN WDLQLL