Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dhaf_1089 |
Symbol | |
ID | 7258058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfitobacterium hafniense DCB-2 |
Kingdom | Bacteria |
Replicon accession | NC_011830 |
Strand | + |
Start bp | 1181250 |
End bp | 1182530 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643561004 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_002457585 |
Protein GI | 219667150 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0000284177 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAAAA GAACGATAGC AATATTCGTA ATCATTATTG CCGCTTTGGT TTTAGGATTT CTCTTTTTAT CCCAAAGGTT GAATCCCCAG GAGGGTAACC TTTCCATACC AGGTCAACAC GATCAGCAGG GGAACAATAA TCCGGCTGAA CCCTCAGAAA AGCCCCCCGA GATTGACCCC CTGCAGGAAC GGATCCAAGC TATGACCTTG GAAGAGAAGG TCGGGCAGCT GGTGATGGTG GGAGTGGACG GGTATGAAAT AAACGCTAAT GCTCAGCAGT TGATTGAAAA TTATCATGTG GGCGGCTTTG TTCTCTTAAA GAAAAATGTC AGGGATAGCG GGCAGATGTT AAACCTGATC AACACCTTGA AAGAGACCAA TGGAGTCAAT AGAATTCCCT TATTCCTGGC CCTTGATGAA GAGGGGGGCA GGATATCCAG GATGCCTGCT GAATTCAAGA AGATGCCTTC CAGTCAGCAG GTCGGAGCCC AGAATAGTGG TGCTTTAGGG AAGAAGATGG GAGAAATCCT GGGCCGGGAA GTCAAGGGAT TTGGGATGAA TGTGAATTTC GCTCCGGTTC TCGATATTTT CAGCAACCCC AAGAATAAAG TGATCGGTGA TCGTGCTTTC GGCAGCAACC CCGAGCTTGT CAGCAAAGTG GGAATTCAGA CTATGAGGGG AATCCAGGAG CAGGGCATTA TCTCTGTGGT TAAACATTTT CCCGGTCATG GGGATACTTC GGTGGATTCC CATGTGGGAT TGCCCCGCGT TGATTATGAT CTGGAACGAT TAAGGAATTT TGAGCTGAGG CCATTTGCAG AAGCCATTGC CAATGATGTG GATGCCATTA TGCTGGCCCA CATCCTGCTG CCGAAGCTTG ACCCGGATTA TCCGGCATCC TTTTCAGAAG TTCTTATCCG CGATATCCTG CGCAAAGAGA TGGACTATAA CGGGGTCGTG ATTACGGATG ATATGACTAT GGGGGCTATT GTGGAGAATT ATAATATCGG TGAGGCCGCG GTGAAATCCA TCCTGGCCGG CAGCGATATT GTCCTGGTCT GCCATGATTT CGCGAAAGAA GAGGCTGTTC TCAAGGAGAT CCTTCATGCT GCAGAGACAG GGAAAATTCC CGTGGACCGG ATCGATGAAA GTGTTTATCG TGTCTTAAAG CTGAAAGAGA AGTATGCTCT GGCCGACCGG CAGAAAGAAT CGGTGGATGT ACAAGGTATC AATGCCGAAA TCGAGCAGTT TTATAAGGAT TACCCGGCTT TAAAAGGGTA G
|
Protein sequence | MGKRTIAIFV IIIAALVLGF LFLSQRLNPQ EGNLSIPGQH DQQGNNNPAE PSEKPPEIDP LQERIQAMTL EEKVGQLVMV GVDGYEINAN AQQLIENYHV GGFVLLKKNV RDSGQMLNLI NTLKETNGVN RIPLFLALDE EGGRISRMPA EFKKMPSSQQ VGAQNSGALG KKMGEILGRE VKGFGMNVNF APVLDIFSNP KNKVIGDRAF GSNPELVSKV GIQTMRGIQE QGIISVVKHF PGHGDTSVDS HVGLPRVDYD LERLRNFELR PFAEAIANDV DAIMLAHILL PKLDPDYPAS FSEVLIRDIL RKEMDYNGVV ITDDMTMGAI VENYNIGEAA VKSILAGSDI VLVCHDFAKE EAVLKEILHA AETGKIPVDR IDESVYRVLK LKEKYALADR QKESVDVQGI NAEIEQFYKD YPALKG
|
| |