Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_1909 |
Symbol | |
ID | 8419752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 2192636 |
End bp | 2193721 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645038495 |
Product | metalloendopeptidase, glycoprotease family |
Protein accession | YP_003198771 |
Protein GI | 258406029 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00508883 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.334757 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTTC TGGGCATTGA GACCTCTTGC GACGAAACCG GCCTGGCCGT GGTTGAGGAG GGGCAGCTGA TCGCCCAGGA ACTGGCCACG CAAGGGGCGA TGCACTCGGT TTTCGGGGGG GTGGTTCCGG AACTGGCCTC CCGGGAACAC CTCCGGGTCC TGGACCCGTT ATGGCAGTCC TTGTTGCAGC GCACCGGCCT GGCCCCGGAG GATTTCGACG TTGTGGCCGT GGCTCGCGGC CCCGGACTTT TGGGCAGTCT GCTGGTCGGT CTCGGCTTTG CCAAAGGGCT GGCCCTGGCG ACTGGGGCGC GGCTGATTGG GGTCAACCAT CTGCTGGGAC ACCTCCTGGC CCCGGGACTG GAGCGGGAGT TGTGTTTCCC CAGTCTCGGG GTGCTTGTTT CGGGCGGGCA TTCGCAATTG TATGCCCTCA ACAGCCCGAC AGAGGCCACG ATGCTCGGCT CGACCCTGGA TGACGCGGCG GGTGAAGCCT TTGACAAGCT GGCGAAACTC CTTAATCTTC CGTATCCCGG CGGCAAGCAC ATCGATGAGT TGGGAGCGCT TGGTGTCCCG GATCCGGAAC TTTTGCCCAT CCCCTACGTG GACAACCGCA ATCTGGATTT CAGTTTCAGC GGGTTGAAGA CGGCGGCGGC CCAGCACATC CAGCGCCACC CGGACCTGCG GCTGCCCGCT ATGCCCGACG TGGACGAGGT GACCGAAATC GGGCGGCAGC GGCCCGAGCT GAGCCGGCTG TGTGCTTCGT TCAACCACGC CGTGGCCCGC GCCCTGTGCA TCAAGGCCAA GCGGGCCCTG GAGCGGGGCC CTCAGGCCCG GCAACTGATC GTGGCCGGAG GGGTGGCGGC CAACAGCGCA GTGCGACGGG AGATGGCTTC CCTGGCCCAG GAGATGGGTG TTGAACTTGT CTTGCCGTCG CTGTCCTTGT GTACGGATAA TGCGGCGATG GTGGCCTATA CCGGAGCCTT GTTCGCTGAG GCCGGACTGG GGCATGATCT CGATCTCGAA GCGGTCCCGC GGGGGCGGCA ACTGCCTTGG GATTATTGTC GATTCCCGAA CGCCGAAGCG AGTTGA
|
Protein sequence | MRVLGIETSC DETGLAVVEE GQLIAQELAT QGAMHSVFGG VVPELASREH LRVLDPLWQS LLQRTGLAPE DFDVVAVARG PGLLGSLLVG LGFAKGLALA TGARLIGVNH LLGHLLAPGL ERELCFPSLG VLVSGGHSQL YALNSPTEAT MLGSTLDDAA GEAFDKLAKL LNLPYPGGKH IDELGALGVP DPELLPIPYV DNRNLDFSFS GLKTAAAQHI QRHPDLRLPA MPDVDEVTEI GRQRPELSRL CASFNHAVAR ALCIKAKRAL ERGPQARQLI VAGGVAANSA VRREMASLAQ EMGVELVLPS LSLCTDNAAM VAYTGALFAE AGLGHDLDLE AVPRGRQLPW DYCRFPNAEA S
|
| |