Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_1725 |
Symbol | |
ID | 8225296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 2104564 |
End bp | 2106840 |
Gene Length | 2277 bp |
Protein Length | 758 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644929579 |
Product | Glycoside hydrolase, family 20, catalytic core |
Protein accession | YP_003086131 |
Protein GI | 255035510 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCGGA GCATTTTTCT GCTCAGTTTC TGCTGCCTCA TTATCTGGCC CGGAGCGCAC GCGCAAGTCC GGAGTTCGCA AAAATCGGTA GCCGCGGAAA GCCAGGCCGG CGCGGGGCTC CTGCCACTTC CCCAGCAAAT GCAGCTCACG GCCAAGCGGT TTTCCATGGG CACAAACTGG CATATCGCGG CCGATCCGGC GCTCGCCGGC GAGCAGGCGG TGATGAGCTT GCAGCAGGGA TTGAAAGAGG TAAATCTGGC GCTTAATGTG ACCGGAGCAA AGGGCAAGTC TCCCGCCATT CAGCTTATCG TGCAAAACGG CTCGGTCGAG ATCGGCGCTA GTGTGGATAC CAACCGCGCC GCGCTTACCC GCCAGGCTTA CCGGCTGAGC TTGAAACCCG GGCAGGTGAC CATCACGGCC AATGCGCAGC AGGGCTTGTA TTATGGTGTC CAAACTTTTT TACAGTTGGT CAAAGCCGGC CCGCAACTGC CTGAAGGCGA GATTACCGAC TGGCCCAATG TGGAGGTGAG GATGATTTAT TGGGATGATG CGCACCATCT TGAAAAGCTG ACTGCATTGA AACGGATCAT CAGGCAGGCA TCTACGTACA AAATCAATGC GTTTTCCATC AAACTCGAAG GGCATTTCGA ATACAAATCC GCCCCGGCGA TTGTAGAACC TTATGCGCTG ACAGCCAAAG AATATCAGGA ACTCACAGAC TTTGCCAAAG CACACTTTAT CGACCTCGTG CCGTTCCTCG ACGCCCCGGC GCACGTATCG TTCATCCTGA AACACCCCGA ATTCCGGAAA CTGCGGCTGA TCGACGACAT CAATTACCAG TTCTCGGTCA CAAATCCGGC CACGTTCAAG CTCCTGGACG CCATGTACAG CGAGTTGATC AATGCGAGCA AAGGCAGCAA ATACATTCTC CTTTCCAACG ACGAAGCCTA CTACACCGGC AAAGCACCAT CCGAAAAGGC GATGGCCGAC AGTCTCGGAG GCAACGGGCG TTTGCTGGCA TGGTTTATGA AAAAACTGGC CGACAGGCTC CACGAGCAGG GTCGCACGGT GCTTTTCTGG GGCGAATTCC CTTTGCGCAA AGAGGACATT ACCGCGCTGC CTTCACATAT GGTTAATGGG GTTTATAACA AATCCATCGC GGCCGACTAT AAAAAGCATG GCATCCGGCA GTTCGTATAC ACCGCCACGC AGGGCGCCGA GCCCATTTTT CCCAACTACT ACCCGAAACA CACTAAAACC GCCATCATGG CCGACGGCAG CGACCGCTCC TCGGGGCGGG TGGCGGATAT GTTCCGGGAA ATCGGGACGG CATTCTCGGG CAATCTGTCA TCGTTTATGG GCGTGATCAT CGCAGGCTGG GCCGATGCGG GGCTGCATCC CGAAACGTTC TGGCTGGGCT ATGCCACGGG CACGGCGGCC GGTTGGAATA ATAAAAATCT CACCCCTGCC AATGCTTCCG CACGGTTTAT GAATTCATTT TACGGGCCCA GGCAAAAGGA TATGGACAGC GTTTACCGCG TCCTGAGCGA GCAAGCCGAG TTCCATACCG AGAGCTGGGA TTGGATTCCA TCGCCCTGGC GCGCGCCCAT CAAAGGAAAT TCCGACAGCC TGTTCCTACA ACCGGAAAAA GACCAGACCT TGCCCCTATT GCCCTTACCG GCATCCAGCA CGCTGGCCGT TAGCAACAAA ACTTACCCGG TGAATGCCAA ACGCTTTGCC ACGGCGCAGT CTATGCTGCC GCGCAATGCC TATGCGCTCC AACTGCTCCG GGAGAATAAA TCGCTTGCCA AAACCCAGCT TTACAACCTG GAAGTACTCG AATCGGTGGC GGCGATTTGC GGACAGAACC TCCGCATGCT CGTCGCATTG AACCGGATTT CGGATTTAGT GCAGGAAGCT TCCGCCGCGA CCGCCGATCC GAAGCTGGCC GTCAGCAAAC TGGATGCGGC GATGGAAATG GTGGGCAAGC TGAAAAACCA GCGCGACAGC ACGTTTGCAT TTGTCGAAAG CATTTGGTTC AAAGAATGGC AGCCGCTGGT GGAAAAAGCC AATGGGAGGA CATTTTTACA CCAGGTCGAC GACATCAAAG ACCACCAGCC CGTGCGCACG ATCGACCTCA GCTACCTCAT TTACCGTGAG CTTCACTACC CGCTCGATCA ATGGTGGAAC GATGTGCTGA AAGTGCGGAA CGACTTCGCA CAGCAGCATA ACCTCCCGCT GGTGAACAAG CAGCTCGAAT GGGCGCGGTA CAGGTGA
|
Protein sequence | MYRSIFLLSF CCLIIWPGAH AQVRSSQKSV AAESQAGAGL LPLPQQMQLT AKRFSMGTNW HIAADPALAG EQAVMSLQQG LKEVNLALNV TGAKGKSPAI QLIVQNGSVE IGASVDTNRA ALTRQAYRLS LKPGQVTITA NAQQGLYYGV QTFLQLVKAG PQLPEGEITD WPNVEVRMIY WDDAHHLEKL TALKRIIRQA STYKINAFSI KLEGHFEYKS APAIVEPYAL TAKEYQELTD FAKAHFIDLV PFLDAPAHVS FILKHPEFRK LRLIDDINYQ FSVTNPATFK LLDAMYSELI NASKGSKYIL LSNDEAYYTG KAPSEKAMAD SLGGNGRLLA WFMKKLADRL HEQGRTVLFW GEFPLRKEDI TALPSHMVNG VYNKSIAADY KKHGIRQFVY TATQGAEPIF PNYYPKHTKT AIMADGSDRS SGRVADMFRE IGTAFSGNLS SFMGVIIAGW ADAGLHPETF WLGYATGTAA GWNNKNLTPA NASARFMNSF YGPRQKDMDS VYRVLSEQAE FHTESWDWIP SPWRAPIKGN SDSLFLQPEK DQTLPLLPLP ASSTLAVSNK TYPVNAKRFA TAQSMLPRNA YALQLLRENK SLAKTQLYNL EVLESVAAIC GQNLRMLVAL NRISDLVQEA SAATADPKLA VSKLDAAMEM VGKLKNQRDS TFAFVESIWF KEWQPLVEKA NGRTFLHQVD DIKDHQPVRT IDLSYLIYRE LHYPLDQWWN DVLKVRNDFA QQHNLPLVNK QLEWARYR
|
| |