Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0163 |
Symbol | |
ID | 8417967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 205967 |
End bp | 207106 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645036728 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003197043 |
Protein GI | 258404301 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0101171 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.219232 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAGA GGTTTGCCCG GAAGGTCGGG TTGCTCGGTA TGGTGGTTGG GTTGCTGATG GCGACGGTGA CGGCCTCTGC GGCTGCGTCA CCCAGCCTAG AGGCTATGGT CGGCCAAATG CTCATGATCG GCTTTCGAGG CACGACTTTT GAGGCCAAAA GTCCCCTCGG CGAGGCGATT ACAGAGGGCA ATCTGGGCGG TGTGGTCCTT TATGGCCGAG ATGTGGCCCT GAACAAACCG ATGCGCAATA TCCGCTCCGC CGCCCAACTG CAGGCGTTGA CCGCTGACCT TCAGGACCAC GCCCGGATTC CGCTTTTCAT TGCCGTTGAT GAGGAGGGCG GTCAGGTCAG CCGCCTCGCC CCTCGGTTTG GCTTTCCCGA GACTGTCACG GCGGCGACGT GGGGCCGACG CAACGATCCA GCCTGGACCA GGCGCGGGGC CAGGGCTATC GGGCAGCGGC TCCGGAACCT TGGGTGCACC ATAAATCTCG CGCCGGTGGT TGATCTGAAC ACGAACCCCG ACAATCCGGC TATCGGCAAG CTGGAACGGA GTTTCGGGGC CAATCCGGAC ACCGTCACGC GCCAGGCGGC TGCGTTTATC CACGGGCTGC ACGACGCCGG CATTCTGGCC TGCATCAAGC ATTTTCCAGG TCACGGCAGC GCGTATAACG ATTCCCATCT GGGGTTGACC GATATCAGCA CGACCTGGTC GCCCAAGGAA TTGGAACCCT ATAAACGGCT CGTCGACCGG GGACTGGCAG ACGCTGTGAT GACCGCCCAT GTCTTTCATG CCGGATTGGA TCCGAAGGTG CCGGCGACGT TGTCGGCCGA GATCATTCCC GATATTCTGC GCCGGGAGAT CGGGTATGAG GGGGTGGTGA TCAGCGACGA TCTGCAGATG GGGGCTATTC GCCAGTCATT TTCGCTGCGG CAGACGGTGC GCCGGTGTCT GGAAGCGGAT GTGGATATTT TCCTGTTCGG CAATAACCTG GAGTATGAGC CGTTTGTCTG GCGCCGTGTC CAGCGGATCG TGCGGGACCT TGTCGATCAG AACATTGTCT CTCGCTCCCG CATCGAGCGC TCCTACGAAC GAATCCAAAG GCTTAAAGAG CGGATGGACC TGTTTCAAGG GAGATCGTGA
|
Protein sequence | MGKRFARKVG LLGMVVGLLM ATVTASAAAS PSLEAMVGQM LMIGFRGTTF EAKSPLGEAI TEGNLGGVVL YGRDVALNKP MRNIRSAAQL QALTADLQDH ARIPLFIAVD EEGGQVSRLA PRFGFPETVT AATWGRRNDP AWTRRGARAI GQRLRNLGCT INLAPVVDLN TNPDNPAIGK LERSFGANPD TVTRQAAAFI HGLHDAGILA CIKHFPGHGS AYNDSHLGLT DISTTWSPKE LEPYKRLVDR GLADAVMTAH VFHAGLDPKV PATLSAEIIP DILRREIGYE GVVISDDLQM GAIRQSFSLR QTVRRCLEAD VDIFLFGNNL EYEPFVWRRV QRIVRDLVDQ NIVSRSRIER SYERIQRLKE RMDLFQGRS
|
| |