Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_2349 |
Symbol | |
ID | 8420209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 2678808 |
End bp | 2681639 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645038951 |
Product | PDZ/DHR/GLGF domain protein |
Protein accession | YP_003199210 |
Protein GI | 258406468 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGCGG GTATATTTGG ATTCTGGGGT CTGGTCCTGG TTGCGGGGAT GCTGCTTTTT ACAGCCGCAG GGTGCAAAAC CAGCGGGCCG GCAGCGCTCG GATCTTCGGC GGCAGATGCT TCGCCCTCGG TCGCCAGTGG TCCGTCCCTC TCCCAAACGC TTGCTTCGGG GTGGGACCAT CTTCTCGATG AATCCTTTAC AGACAACGCC CGCTCCTGGC CGGAGCAGGA TACTGATCGT GTCCAAAGCA GGGTCGCCGG AGGGGTGTAC ACCGTGCAAT TGGATGACCG TCTTGCCCAT CTGCAAGCGT TGACCGATGT TCGGCTCAAC GGACTTGCCG ATTTCAGGGT TGCGGCGACG TTCAATTACA AAAGCAGGGA CAAGAGTTAT GTCGGGCTCC TCTTCGGTGC CGCGGATGTG CGGCATATGT TCCGTTTCCG TCTGGAGGCC AGCGGGCGGG TGTCACTGAG CCGTGTGGCG GCGGGCGAGT ATACAACACT GGTGAAAAAA CAGGTCCCGG ACCTCCTGAA CGCCCCGGGA GGCGAGTACC ATCTGGCGGT GGTCCAGGAA GGCGACACTT GGCGGTGTGA AGTTAATGGG CGCGAAGTAT TCCGTCTCCC GGCCGAACCA GTTTATGGAG GCCGTGTCGG ACTGTATGCC TACGGCAAGC AGCGGTTGCG GGTTCACCGT CTGCAGGTGG CACGGGATGG CACGGGCCTG CAGGCAGTGC GGACCTGGAC CGGGCTGGGC GATCGCTTCG GATTCGAGCA ATACGCGTTC AGTGAAGACG GCCGGCGTTT GGCCACGTGG TCTGCTGTCG GCCCCCGGGG GCTGCTCGCC TTGTGGGATG TGACCAGTGT GGCCCGCCCT CTGGTGGGCT GGCGATTCGT GGACCATCCG GTGCGGCGAC GCAAAGGCGC GGAACCGGTC CATTTTTTCA ACACCAATGG GCAAAAGCGC TTCGCGGTCT CGGATGACGG CACCCTCGGA GCGGTGACTT TCTGGTTTGA AGGCGACACC TCCCGCTTGG TGCTGCAGGT CTTTCGCTGG GATGATCCCG CCACGCCCGT ATTCCATATC CAAAAACGAG CCCCCGGCCG GGTGGTTCTG CCCCATGGCG TGGCCCTGCG GCCGGACAAC GACATGGTCG CCTTGAACAC CGCGGTCCAG GGCACCCAGG GGCAGGTCTT TGCCAGCGGA GAGGTGGTTT TTTTCCCTCT GTATGACGGC GGCACGCCCT CCAAATTCAC ACCGCCGGCC GATGGGGCTG TTTTTATGCA GCATCCGCGC TGGAGCGCCG GGGAAGGGTT TTTCGCCGAG GTGCTCTACC GGCCCCAGGG GAAGGCGGGT GAGATGTACT GGGTCGTCTA TCCCTTTGGC GGCGGGGGCG ATACCGGTCC GCAGGCCTTG CGGGCCAAGG GCGAGACCAC AATCGCTTAT CGCCGCGCCC GGAGTCTGGA TGTCACCGCG GATGAGCGTC TGCTGAGCGT ATTGGAGCAA GAGGGGGGCG TGCGCTGGTA TCGCACCCAT GATCTCACCA CGCCCTGGGT GCGTGTGGAC CTGGACGACC AGGCCTATCT GGGCGTTTTC GGCGGCCGGG ACGAATGGGG GATGCTCACC ACCAGTCTGT ATTTTCAGCG CTTTGCGATC CAGGACGACC GTGTCGTGCC CCTGGGGCGT GAATGGTGTC CGTTTCTGGC CCAGGATATG ATTTATTCTC CGGACCGTGG AGGATGGCTC GTCGCCGGGG CGAAGCAGGT ACGATTGTAT CCCGATTACG CTCAGGCGGA GCATGATGCC GCCCTGGCTC TGCATGAAGC CGAGGAATTA CTCCAGGTGG GATTTGCTGA ACAAGCCGAG GCCAAGCTGC AACAGGCCTT GGACCTGGAT TACACCTTCC AGGCCAAGGA TACTGAAGAG CTGTATCTGA CCCTGTTGCC GCGGATGGCG GCCAGCGCCC GGGCCCGGCT GCCGGGACGA CTGGCCCTGG AGCAATACCA GCGCGGCAGA CAGGCGCCGA AAATCCCGGT GCTGGGCCTG CAGGTCCGAA GCCAGGATGG TGGAGTCGTG GTCCAGAACA TCCATGACGG CACTCCAGCT GTCGCTTCCG GCCTGCGGAC CGGGGACCAC ATCCTGCGTT TCCAGGATCG GGCGGAGACG GATGTGGCGA GCTTGGTGGA AGCGGTACGG GCCTGCACTC CGGGCCAGCG GGTCAATCTT CAGGTCCGTC GCGGGGATAC GCTACAGACG GTCTCCCTGG ACACCCTGGC CCGCTGGAAG GAGGACGCGG CTCTGGAGGC CACGGTGTAC GGGTTGTTCA ATTATGGGCT GTTTGCCGCC GAAGCGGGGC AACCGGCCCT GGTCAGTGCT GCGGCCGACG CATTCGACTC CCTGCTCCGG ACACACCCCG GGGCCGTCAA ACCTGAAAAG CTCAACGGCT TTGCCGCGAT ACTGCGCGCG CTCGCTCTGG CCGGGCAGGG GCAGCGTGAT GAGGCGCTGG CAGCCTTACT CGGGGTCTCT CTGGATGCGC AGCAGCACAA GTATATCCTC AAGCGCACGG CGGCTTTTGC TCCATTGTAC GGGGAACGGG ACAAGCTGGC GTATGTGCTG GATGTCGATG CCAACGAGAT TCCGCAGACT ATAGCGCAAG CTGCCAAGCC CCAGCCGTAC CCGGATTTGC AGGGACGGCT GGTTCGACCG CCCTCCGCAC CGGAATTGCA GGCACCGCTC CACACCCCGG CAACAAGCGG CAGTTCAGCG ACTCCGGATA CCCCAGTAAC GCAACCGCAA TCCGGTCCGT CCTCTTCAGG CGGGGCCGTG ATTTTGGAAT AG
|
Protein sequence | MRAGIFGFWG LVLVAGMLLF TAAGCKTSGP AALGSSAADA SPSVASGPSL SQTLASGWDH LLDESFTDNA RSWPEQDTDR VQSRVAGGVY TVQLDDRLAH LQALTDVRLN GLADFRVAAT FNYKSRDKSY VGLLFGAADV RHMFRFRLEA SGRVSLSRVA AGEYTTLVKK QVPDLLNAPG GEYHLAVVQE GDTWRCEVNG REVFRLPAEP VYGGRVGLYA YGKQRLRVHR LQVARDGTGL QAVRTWTGLG DRFGFEQYAF SEDGRRLATW SAVGPRGLLA LWDVTSVARP LVGWRFVDHP VRRRKGAEPV HFFNTNGQKR FAVSDDGTLG AVTFWFEGDT SRLVLQVFRW DDPATPVFHI QKRAPGRVVL PHGVALRPDN DMVALNTAVQ GTQGQVFASG EVVFFPLYDG GTPSKFTPPA DGAVFMQHPR WSAGEGFFAE VLYRPQGKAG EMYWVVYPFG GGGDTGPQAL RAKGETTIAY RRARSLDVTA DERLLSVLEQ EGGVRWYRTH DLTTPWVRVD LDDQAYLGVF GGRDEWGMLT TSLYFQRFAI QDDRVVPLGR EWCPFLAQDM IYSPDRGGWL VAGAKQVRLY PDYAQAEHDA ALALHEAEEL LQVGFAEQAE AKLQQALDLD YTFQAKDTEE LYLTLLPRMA ASARARLPGR LALEQYQRGR QAPKIPVLGL QVRSQDGGVV VQNIHDGTPA VASGLRTGDH ILRFQDRAET DVASLVEAVR ACTPGQRVNL QVRRGDTLQT VSLDTLARWK EDAALEATVY GLFNYGLFAA EAGQPALVSA AADAFDSLLR THPGAVKPEK LNGFAAILRA LALAGQGQRD EALAALLGVS LDAQQHKYIL KRTAAFAPLY GERDKLAYVL DVDANEIPQT IAQAAKPQPY PDLQGRLVRP PSAPELQAPL HTPATSGSSA TPDTPVTQPQ SGPSSSGGAV ILE
|
| |