Gene Dret_2349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2349 
Symbol 
ID8420209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2678808 
End bp2681639 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content63% 
IMG OID645038951 
ProductPDZ/DHR/GLGF domain protein 
Protein accessionYP_003199210 
Protein GI258406468 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCGG GTATATTTGG ATTCTGGGGT CTGGTCCTGG TTGCGGGGAT GCTGCTTTTT 
ACAGCCGCAG GGTGCAAAAC CAGCGGGCCG GCAGCGCTCG GATCTTCGGC GGCAGATGCT
TCGCCCTCGG TCGCCAGTGG TCCGTCCCTC TCCCAAACGC TTGCTTCGGG GTGGGACCAT
CTTCTCGATG AATCCTTTAC AGACAACGCC CGCTCCTGGC CGGAGCAGGA TACTGATCGT
GTCCAAAGCA GGGTCGCCGG AGGGGTGTAC ACCGTGCAAT TGGATGACCG TCTTGCCCAT
CTGCAAGCGT TGACCGATGT TCGGCTCAAC GGACTTGCCG ATTTCAGGGT TGCGGCGACG
TTCAATTACA AAAGCAGGGA CAAGAGTTAT GTCGGGCTCC TCTTCGGTGC CGCGGATGTG
CGGCATATGT TCCGTTTCCG TCTGGAGGCC AGCGGGCGGG TGTCACTGAG CCGTGTGGCG
GCGGGCGAGT ATACAACACT GGTGAAAAAA CAGGTCCCGG ACCTCCTGAA CGCCCCGGGA
GGCGAGTACC ATCTGGCGGT GGTCCAGGAA GGCGACACTT GGCGGTGTGA AGTTAATGGG
CGCGAAGTAT TCCGTCTCCC GGCCGAACCA GTTTATGGAG GCCGTGTCGG ACTGTATGCC
TACGGCAAGC AGCGGTTGCG GGTTCACCGT CTGCAGGTGG CACGGGATGG CACGGGCCTG
CAGGCAGTGC GGACCTGGAC CGGGCTGGGC GATCGCTTCG GATTCGAGCA ATACGCGTTC
AGTGAAGACG GCCGGCGTTT GGCCACGTGG TCTGCTGTCG GCCCCCGGGG GCTGCTCGCC
TTGTGGGATG TGACCAGTGT GGCCCGCCCT CTGGTGGGCT GGCGATTCGT GGACCATCCG
GTGCGGCGAC GCAAAGGCGC GGAACCGGTC CATTTTTTCA ACACCAATGG GCAAAAGCGC
TTCGCGGTCT CGGATGACGG CACCCTCGGA GCGGTGACTT TCTGGTTTGA AGGCGACACC
TCCCGCTTGG TGCTGCAGGT CTTTCGCTGG GATGATCCCG CCACGCCCGT ATTCCATATC
CAAAAACGAG CCCCCGGCCG GGTGGTTCTG CCCCATGGCG TGGCCCTGCG GCCGGACAAC
GACATGGTCG CCTTGAACAC CGCGGTCCAG GGCACCCAGG GGCAGGTCTT TGCCAGCGGA
GAGGTGGTTT TTTTCCCTCT GTATGACGGC GGCACGCCCT CCAAATTCAC ACCGCCGGCC
GATGGGGCTG TTTTTATGCA GCATCCGCGC TGGAGCGCCG GGGAAGGGTT TTTCGCCGAG
GTGCTCTACC GGCCCCAGGG GAAGGCGGGT GAGATGTACT GGGTCGTCTA TCCCTTTGGC
GGCGGGGGCG ATACCGGTCC GCAGGCCTTG CGGGCCAAGG GCGAGACCAC AATCGCTTAT
CGCCGCGCCC GGAGTCTGGA TGTCACCGCG GATGAGCGTC TGCTGAGCGT ATTGGAGCAA
GAGGGGGGCG TGCGCTGGTA TCGCACCCAT GATCTCACCA CGCCCTGGGT GCGTGTGGAC
CTGGACGACC AGGCCTATCT GGGCGTTTTC GGCGGCCGGG ACGAATGGGG GATGCTCACC
ACCAGTCTGT ATTTTCAGCG CTTTGCGATC CAGGACGACC GTGTCGTGCC CCTGGGGCGT
GAATGGTGTC CGTTTCTGGC CCAGGATATG ATTTATTCTC CGGACCGTGG AGGATGGCTC
GTCGCCGGGG CGAAGCAGGT ACGATTGTAT CCCGATTACG CTCAGGCGGA GCATGATGCC
GCCCTGGCTC TGCATGAAGC CGAGGAATTA CTCCAGGTGG GATTTGCTGA ACAAGCCGAG
GCCAAGCTGC AACAGGCCTT GGACCTGGAT TACACCTTCC AGGCCAAGGA TACTGAAGAG
CTGTATCTGA CCCTGTTGCC GCGGATGGCG GCCAGCGCCC GGGCCCGGCT GCCGGGACGA
CTGGCCCTGG AGCAATACCA GCGCGGCAGA CAGGCGCCGA AAATCCCGGT GCTGGGCCTG
CAGGTCCGAA GCCAGGATGG TGGAGTCGTG GTCCAGAACA TCCATGACGG CACTCCAGCT
GTCGCTTCCG GCCTGCGGAC CGGGGACCAC ATCCTGCGTT TCCAGGATCG GGCGGAGACG
GATGTGGCGA GCTTGGTGGA AGCGGTACGG GCCTGCACTC CGGGCCAGCG GGTCAATCTT
CAGGTCCGTC GCGGGGATAC GCTACAGACG GTCTCCCTGG ACACCCTGGC CCGCTGGAAG
GAGGACGCGG CTCTGGAGGC CACGGTGTAC GGGTTGTTCA ATTATGGGCT GTTTGCCGCC
GAAGCGGGGC AACCGGCCCT GGTCAGTGCT GCGGCCGACG CATTCGACTC CCTGCTCCGG
ACACACCCCG GGGCCGTCAA ACCTGAAAAG CTCAACGGCT TTGCCGCGAT ACTGCGCGCG
CTCGCTCTGG CCGGGCAGGG GCAGCGTGAT GAGGCGCTGG CAGCCTTACT CGGGGTCTCT
CTGGATGCGC AGCAGCACAA GTATATCCTC AAGCGCACGG CGGCTTTTGC TCCATTGTAC
GGGGAACGGG ACAAGCTGGC GTATGTGCTG GATGTCGATG CCAACGAGAT TCCGCAGACT
ATAGCGCAAG CTGCCAAGCC CCAGCCGTAC CCGGATTTGC AGGGACGGCT GGTTCGACCG
CCCTCCGCAC CGGAATTGCA GGCACCGCTC CACACCCCGG CAACAAGCGG CAGTTCAGCG
ACTCCGGATA CCCCAGTAAC GCAACCGCAA TCCGGTCCGT CCTCTTCAGG CGGGGCCGTG
ATTTTGGAAT AG
 
Protein sequence
MRAGIFGFWG LVLVAGMLLF TAAGCKTSGP AALGSSAADA SPSVASGPSL SQTLASGWDH 
LLDESFTDNA RSWPEQDTDR VQSRVAGGVY TVQLDDRLAH LQALTDVRLN GLADFRVAAT
FNYKSRDKSY VGLLFGAADV RHMFRFRLEA SGRVSLSRVA AGEYTTLVKK QVPDLLNAPG
GEYHLAVVQE GDTWRCEVNG REVFRLPAEP VYGGRVGLYA YGKQRLRVHR LQVARDGTGL
QAVRTWTGLG DRFGFEQYAF SEDGRRLATW SAVGPRGLLA LWDVTSVARP LVGWRFVDHP
VRRRKGAEPV HFFNTNGQKR FAVSDDGTLG AVTFWFEGDT SRLVLQVFRW DDPATPVFHI
QKRAPGRVVL PHGVALRPDN DMVALNTAVQ GTQGQVFASG EVVFFPLYDG GTPSKFTPPA
DGAVFMQHPR WSAGEGFFAE VLYRPQGKAG EMYWVVYPFG GGGDTGPQAL RAKGETTIAY
RRARSLDVTA DERLLSVLEQ EGGVRWYRTH DLTTPWVRVD LDDQAYLGVF GGRDEWGMLT
TSLYFQRFAI QDDRVVPLGR EWCPFLAQDM IYSPDRGGWL VAGAKQVRLY PDYAQAEHDA
ALALHEAEEL LQVGFAEQAE AKLQQALDLD YTFQAKDTEE LYLTLLPRMA ASARARLPGR
LALEQYQRGR QAPKIPVLGL QVRSQDGGVV VQNIHDGTPA VASGLRTGDH ILRFQDRAET
DVASLVEAVR ACTPGQRVNL QVRRGDTLQT VSLDTLARWK EDAALEATVY GLFNYGLFAA
EAGQPALVSA AADAFDSLLR THPGAVKPEK LNGFAAILRA LALAGQGQRD EALAALLGVS
LDAQQHKYIL KRTAAFAPLY GERDKLAYVL DVDANEIPQT IAQAAKPQPY PDLQGRLVRP
PSAPELQAPL HTPATSGSSA TPDTPVTQPQ SGPSSSGGAV ILE