Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_4206 |
Symbol | |
ID | 8227808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | - |
Start bp | 5086472 |
End bp | 5089528 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644932053 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003088574 |
Protein GI | 255037953 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATT GGTTGTTCGG CCGGTTTCGT GTTGTAGTGA TTGTTGTGGC GCTGGTGGTG CTGACCTCCG TGGCTTCCGT TGCGTGGAAA ATGATGTCGG TTAAGACCAC GGCCGTCAAC AAACCCGCTG TTGCCGCACC CGAGCCCAAG CATAAGGCGC CCAAAAAGCG CCGTGACCCG GTGTTGGTTC TTCCGGAAAA TGAATGGGTG GACAGTACGC TCAAATCCCT GAGCATCGAG CAGAAGATCG GTCAGCTGTT CATGGTAGCT ACTTTTTCCA ACCGCGACGA ATCGCATTAC CGTTACATCG ACAAGCTGAT TAACGATTAC CAGATCGGCG GACTGATATT CTTCCAGGGC GGCCCCGTAA GCCAGGCGCA GCTCACCAAC CGCTATCAGG CGCATTCCAA CATTCCGCTT TTCATCGGCA TCGACGGTGA ATGGGGCCTG GGCATGCGGC TCGACAGCAC GATTTCCTTC CCGAAACAAA TGGTGCTCGG CGCGATCCAG GACGACCGGC TGATTTACCG CATGGGCGAC GACATTGCGC GCCAATGCAA ACGCCTCGGC ATTCATATCA ACTTCGCGCC GGTGTCGGAT GTGAACAGCA ACCCGGCCAA TCCGGTAATC GGTATACGTT CGTTTGGTGA GGACAAAGAG AATGTCGCGC GGAAGTCGAT CGCCTACATG AAAGGCTTAC AGCACAATCG CATTATCGCT ACTGCCAAGC ATTTCCCCGG CCATGGTGAT ACCGATGCCG ATTCGCATTT CACACTCCCG GTGCTTAATC ATTCGCTGGA ACAGCTGAAT TCCGACTTGT ACCCTTACCG CGAAATGATC GCCGACAGCC TGATGGGCGT CCTGAATGCG CATTTGTACA TTCCCGCATT GGACAACACG CCCAACCAGG CCAGCTCGCT GTCTGATAAA GTGGTGAATG GCTTGCTCCG CAAAGACCTG GGCTTCCGCG GCCTCGTGTT TACCGACGCC ATGAATATGC GCGGCGTGCT CAAAAACGGC AAGCCGGCCG ACGTGAACCT GAAAGCGCTG GTAGCAGGCA ACGACGTGCT GCTTTACCCG GAAAGCATCG CCGAAACCGT TTCGCGCATC AAGGATGCCA TCAATTCGAA GCTGATCAGT GAAAAGATCA TCGACGACAA GGTGAAACGC ATTCTGCAAG CCAAATACTG GGCCGGGCTG AACCAATATA AGCCGATAGA CATCAACAAT CTTTACAACG ACCTCAACAG CGAGAAGAGC AAGGAACTGA ACCGCGAGCT CTCTGAGGCT TCCGTAACCG TTGTTAAAAA CGACAATAAC CTGATCCCGA TCGGCTCGGT GATGGACAAC AACATAGCGT CCGTGAGCAT GGGCGAGGGC AGCGGTGTGG CATTCCAAAA GATGCTCTCC ACCTACAAAC CGATGCGGAG CTACTCGTTT TACGACGCGC CGTCGTCGCA GGACCAGATT ACCGAAATGC TCGGTTACCT GCAACCTTAC AATACAGTGA TCGTCGACGT CCACGGCATT AGCTCCAAGC CCGGCCGGAA TTATGGCATT ACCACGGGCA TGGTGGAGTT TGTTAACCAG CTCAAACAGC AGAATAAAAA AGTGATCCTC TGCCTGTTCG GAACGCCTTA TAGCATTCAG TTTTTCCCTG ACACCGATGT GCTCATTTGC GCCAACCAGG ACGGCAAAGA CCAGCAGGAG ATCGTGCCGC AGATCATTTT CGGCGCGCTG GGCTCGAAGG GCCGCCTGCC GGTGTCGGTA CTGGCCCACA AAAGTGGCGG CGGTGTAAGC ACCACGTCCA TCAATCGCAT TGCATTCGGC ACGCCGGAGA GTGTGGGAAT GGATGGTATT TCTCTTAAAA AAATCGATGA AATCGCCACT GCGGCCGTGA ATGACCATGT TTTCCCCGGC TGCGAAGTGC TGGTGGCCAG ACAGGGTAAG ATCATTTACG AAAAACAATT CGGTGGGTTA AGCTATCGCA CCAACGAACG CGTAACGCCC GAGACGATCT ACGACCTGGC GTCGCTTACC AAGGTTTCGG CTACATTGCA GGCTGTGATG CTCCTTTATG ACCGCAAGCA GATCAGTTTG GATGAAAAGG CATCCAAATA CCTGCCTGAA CTGGCGGGTA CCAATAAGCA AAACTTCACC GTCCGTGATT TGCTGCTGCA TCGTTCGGGA CTGGTTTCGT TCTACCCGTC GCTGTGGGAC CGTACCAAAA CCAGCGCAGG CGGTCTTCTG CCGGAATATT ACAGTTCGAA ACAAGACACG GCCTACTACC TGCAGGTAGC CCCGAAACTC TTCGCCAAAG GTGCATTGCG CGATTCGGTA TGGAAGTGGG TGGTGGAATC GCCCATGAAC AACCGCCGCG ACCGCGCCGG CAGCTACGGT TACCTGTACA GTGACCTCGG CTTTTTGACA CTCCAAAAAA TTGTCGAAAG AGTGGCCGGC CAGTCGCTCG ACAATTTTGT GGCCGCCAAC ATTTACGAGC CGCTGGGGCT GCCCTACCTG GGTTTCAACC CGCTGCGTCG TTTTCCTGAA AAGCAGATTG CGCCTACCGA GCAGGATTAC CGGTTCCGCG GGCAGCTGCT ACAAGGAACA GTCCATGATC AGATGGCCGC CATTGTTGGC GGCGTATCGG GCCACGCCGG CCTTTTTGGC ACCGCACGCG ACCTGGCGGT TTTACTTCAA ATGAACCTCT GGAAGGGCAA CTACGCAGGA AAACGCTATT ATGAGCAAGC CACGGTGCCA TTCTTCTCGC GGATGTACGA CGAATCCCAT CACCGCGGAC TGGGCTGGGA CAAGGCCCCT GCGGACGGCA ACAGCTCATT CGTATCACCG CTGGCGTCCG TCAACTCATT CGGCCACACG GGTTTCACCG GTACAATGGT GTGGGTAGAT CCCGAGGAAG ACCTGGTATT CATATTCCTT TCCAACCGCG TGAACCCCGA CCCGGAGAAT ACGGCCATTA CGACCCATCG TACACGAAGA AAAATCCAGG ATGTAGTGTA TGGTTCTTTG ATTCAACGGA AAAGCCAGCT TCCTTAA
|
Protein sequence | MKNWLFGRFR VVVIVVALVV LTSVASVAWK MMSVKTTAVN KPAVAAPEPK HKAPKKRRDP VLVLPENEWV DSTLKSLSIE QKIGQLFMVA TFSNRDESHY RYIDKLINDY QIGGLIFFQG GPVSQAQLTN RYQAHSNIPL FIGIDGEWGL GMRLDSTISF PKQMVLGAIQ DDRLIYRMGD DIARQCKRLG IHINFAPVSD VNSNPANPVI GIRSFGEDKE NVARKSIAYM KGLQHNRIIA TAKHFPGHGD TDADSHFTLP VLNHSLEQLN SDLYPYREMI ADSLMGVLNA HLYIPALDNT PNQASSLSDK VVNGLLRKDL GFRGLVFTDA MNMRGVLKNG KPADVNLKAL VAGNDVLLYP ESIAETVSRI KDAINSKLIS EKIIDDKVKR ILQAKYWAGL NQYKPIDINN LYNDLNSEKS KELNRELSEA SVTVVKNDNN LIPIGSVMDN NIASVSMGEG SGVAFQKMLS TYKPMRSYSF YDAPSSQDQI TEMLGYLQPY NTVIVDVHGI SSKPGRNYGI TTGMVEFVNQ LKQQNKKVIL CLFGTPYSIQ FFPDTDVLIC ANQDGKDQQE IVPQIIFGAL GSKGRLPVSV LAHKSGGGVS TTSINRIAFG TPESVGMDGI SLKKIDEIAT AAVNDHVFPG CEVLVARQGK IIYEKQFGGL SYRTNERVTP ETIYDLASLT KVSATLQAVM LLYDRKQISL DEKASKYLPE LAGTNKQNFT VRDLLLHRSG LVSFYPSLWD RTKTSAGGLL PEYYSSKQDT AYYLQVAPKL FAKGALRDSV WKWVVESPMN NRRDRAGSYG YLYSDLGFLT LQKIVERVAG QSLDNFVAAN IYEPLGLPYL GFNPLRRFPE KQIAPTEQDY RFRGQLLQGT VHDQMAAIVG GVSGHAGLFG TARDLAVLLQ MNLWKGNYAG KRYYEQATVP FFSRMYDESH HRGLGWDKAP ADGNSSFVSP LASVNSFGHT GFTGTMVWVD PEEDLVFIFL SNRVNPDPEN TAITTHRTRR KIQDVVYGSL IQRKSQLP
|
| |