Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0327 |
Symbol | |
ID | 8418131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 408030 |
End bp | 410000 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645036892 |
Product | peptidase U32 |
Protein accession | YP_003197207 |
Protein GI | 258404465 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGATA TATCCGCCTT CAAGCCCGAA ATTCTGGCCC CGGCCGGCAA CAAGGAAAGC TTCCTGGCCG CTGTCGATGC CGGAGCAGAC GCCATATATT GCGGCCTGAA ACATTTTTCC GCCCGCATGG AAGCCGACAA TTTTGCTCTG CGCGACCTGA GCCGCCTGCA GGGCCTGGCT CAGAAACACG GCACCCGAAC CTACGTCGCC CTGAACACCC TTATCAAACC CAATGAGTTG GACCAGGCCG GACGGCTTCT GGACCAATTG ACCCGCTACG TGCACCCCGA CGCGCTCATC GTCCAGGACC CGGCCGCCCT CAAGCTAGCC CGACAGGCCG GCTTCACCGG CGAACTGCAC TTCTCAACCC TGGCCAATAT CGGGCTCGCT TCAGCCCTGC CCAGTGTTTT GGCCCTGGGC GCAGACCGCG TTGTTTTGCC CCGGGAATTG ACCGTCGACG AAATCAAAAC CTTGGACACT GCCTGTCCCG ACGCCCTCGA TCTGGAAGTC TTTGTCCACG GTGCCCTGTG TTACGCGGTC TCGGGCCGGT GCTACTGGTC GAGTTATCTG GGCGGGAAAA GCGGTCTGCG CGGGCGGTGC GTCCAACCCT GCCGAAGGAT CTATACCGCT GGCGGACCGC CCCAGCGGCT TTTCTCCTGC CAGGACTTGG GTCTGGACAT ATTGACCAAG GCCCTTTTGG AGGCCCCGCG GGTCAAAGCC TGGAAAATAG AGGGGCGTAA GAAGGGCCCC CATTATGTGA CCTATACGGT CCGGGCCTAT GCCCTGCTCC GGGACAACCC CCTGGATCCG CAGGCCAAGA AGGAAGCGGT CGACCTTCTG GACCAGGCCT TGTCGCGGCC GACCACCCAC TACAGTTTTC TGCCCCAGCG CCCCCACCCT CCGGTCACCC CGGACCAATC GACAGCCTCG GGAGGATATC TCGGCCGCTT GCAGGCCAAG AACACCAAAA TCCAGCTCCG TCCTCACAAA GCGCTCTTGC CCCAAGACCT GCTTCGGATC GGATATGAGG ACGAACCGGG ACACCGGATT TACAAGGTCA GCAAATACAT CCCCAAAGGG GGGCTGTTGA CTATCCAAGG ACGCAACACA GAGAAACAAA AAGGGATGAG CGTCTTTTTG ATCGACCGGC GCGAACCCGA ACTGGTGCAG CGCCTGAAGA AAATGGATCA GGAATTTCAG GCCGAGCCCG CTCCTGAAAT CACCCCTTCC TCCTTCACCC CGACGCTGCC CGGAACCCGG ACTCAGGCTG GCCGACCCAG CCTGGTCCGC GTCTATCGTA ACCTGCCGCC CGGCAAGACC GGCAAGACGC CAGGACTGTG GGTCCAACCC AAACCGCCGC GGGGGATCAG CTCCACACGC TACAGCCATA TCTGGTGGTG GCTGCCTCCA GTGCTCTGGC CCGAAGAGGA GCGACTGTGG CAAGAGACGA TCGACCGTTT GCTGGCCCGC GGCGCGAAGC GCTTTGTTTT GAACGCCCCC TGGCAGGTCG GATTCTTTAA CGATTCCGAG GATCTGATGC TCTGGGCCGG ACCGTTTTGC AACCTGGCCA ATGCCCTGGC CCTGGAAAGC ATGGTCGAAC TCGGATTCAG CGGGGCTTTC GCCAGCCCTG AACTCGCCGG ACCAGATCTC CTTGCCCTGC CCGGGCAGAG TCCGCTCCCA ATGGGCTTGG TCCTCGGCGG AGCGTGGCCG TTTTGTCTCT CCCGGGTGCA GCCCCCCCAT CTGGAGCCCG GCCAACGCGT GACCAGTCCG AAAAAAGAGA TATCCTGGAC CAAACGGTAT GGGCCGACGA CTTGGCATTA CCCCAATTGG GACCTGGACC TCTACAAACA CCAGCGCGAG TTGGAGCAGG CCGGATACAC CCTTTTTGCC CATCTCTATG AGCCGCGCCC CAAATCCGCG CCCCGGCCCC ACCGGACCTC AACCTTCAAT TGGGATCTGC AACTCTTGTA G
|
Protein sequence | MPDISAFKPE ILAPAGNKES FLAAVDAGAD AIYCGLKHFS ARMEADNFAL RDLSRLQGLA QKHGTRTYVA LNTLIKPNEL DQAGRLLDQL TRYVHPDALI VQDPAALKLA RQAGFTGELH FSTLANIGLA SALPSVLALG ADRVVLPREL TVDEIKTLDT ACPDALDLEV FVHGALCYAV SGRCYWSSYL GGKSGLRGRC VQPCRRIYTA GGPPQRLFSC QDLGLDILTK ALLEAPRVKA WKIEGRKKGP HYVTYTVRAY ALLRDNPLDP QAKKEAVDLL DQALSRPTTH YSFLPQRPHP PVTPDQSTAS GGYLGRLQAK NTKIQLRPHK ALLPQDLLRI GYEDEPGHRI YKVSKYIPKG GLLTIQGRNT EKQKGMSVFL IDRREPELVQ RLKKMDQEFQ AEPAPEITPS SFTPTLPGTR TQAGRPSLVR VYRNLPPGKT GKTPGLWVQP KPPRGISSTR YSHIWWWLPP VLWPEEERLW QETIDRLLAR GAKRFVLNAP WQVGFFNDSE DLMLWAGPFC NLANALALES MVELGFSGAF ASPELAGPDL LALPGQSPLP MGLVLGGAWP FCLSRVQPPH LEPGQRVTSP KKEISWTKRY GPTTWHYPNW DLDLYKHQRE LEQAGYTLFA HLYEPRPKSA PRPHRTSTFN WDLQLL
|
| |