Gene Dret_0163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0163 
Symbol 
ID8417967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp205967 
End bp207106 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content61% 
IMG OID645036728 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003197043 
Protein GI258404301 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0101171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.219232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAGA GGTTTGCCCG GAAGGTCGGG TTGCTCGGTA TGGTGGTTGG GTTGCTGATG 
GCGACGGTGA CGGCCTCTGC GGCTGCGTCA CCCAGCCTAG AGGCTATGGT CGGCCAAATG
CTCATGATCG GCTTTCGAGG CACGACTTTT GAGGCCAAAA GTCCCCTCGG CGAGGCGATT
ACAGAGGGCA ATCTGGGCGG TGTGGTCCTT TATGGCCGAG ATGTGGCCCT GAACAAACCG
ATGCGCAATA TCCGCTCCGC CGCCCAACTG CAGGCGTTGA CCGCTGACCT TCAGGACCAC
GCCCGGATTC CGCTTTTCAT TGCCGTTGAT GAGGAGGGCG GTCAGGTCAG CCGCCTCGCC
CCTCGGTTTG GCTTTCCCGA GACTGTCACG GCGGCGACGT GGGGCCGACG CAACGATCCA
GCCTGGACCA GGCGCGGGGC CAGGGCTATC GGGCAGCGGC TCCGGAACCT TGGGTGCACC
ATAAATCTCG CGCCGGTGGT TGATCTGAAC ACGAACCCCG ACAATCCGGC TATCGGCAAG
CTGGAACGGA GTTTCGGGGC CAATCCGGAC ACCGTCACGC GCCAGGCGGC TGCGTTTATC
CACGGGCTGC ACGACGCCGG CATTCTGGCC TGCATCAAGC ATTTTCCAGG TCACGGCAGC
GCGTATAACG ATTCCCATCT GGGGTTGACC GATATCAGCA CGACCTGGTC GCCCAAGGAA
TTGGAACCCT ATAAACGGCT CGTCGACCGG GGACTGGCAG ACGCTGTGAT GACCGCCCAT
GTCTTTCATG CCGGATTGGA TCCGAAGGTG CCGGCGACGT TGTCGGCCGA GATCATTCCC
GATATTCTGC GCCGGGAGAT CGGGTATGAG GGGGTGGTGA TCAGCGACGA TCTGCAGATG
GGGGCTATTC GCCAGTCATT TTCGCTGCGG CAGACGGTGC GCCGGTGTCT GGAAGCGGAT
GTGGATATTT TCCTGTTCGG CAATAACCTG GAGTATGAGC CGTTTGTCTG GCGCCGTGTC
CAGCGGATCG TGCGGGACCT TGTCGATCAG AACATTGTCT CTCGCTCCCG CATCGAGCGC
TCCTACGAAC GAATCCAAAG GCTTAAAGAG CGGATGGACC TGTTTCAAGG GAGATCGTGA
 
Protein sequence
MGKRFARKVG LLGMVVGLLM ATVTASAAAS PSLEAMVGQM LMIGFRGTTF EAKSPLGEAI 
TEGNLGGVVL YGRDVALNKP MRNIRSAAQL QALTADLQDH ARIPLFIAVD EEGGQVSRLA
PRFGFPETVT AATWGRRNDP AWTRRGARAI GQRLRNLGCT INLAPVVDLN TNPDNPAIGK
LERSFGANPD TVTRQAAAFI HGLHDAGILA CIKHFPGHGS AYNDSHLGLT DISTTWSPKE
LEPYKRLVDR GLADAVMTAH VFHAGLDPKV PATLSAEIIP DILRREIGYE GVVISDDLQM
GAIRQSFSLR QTVRRCLEAD VDIFLFGNNL EYEPFVWRRV QRIVRDLVDQ NIVSRSRIER
SYERIQRLKE RMDLFQGRS