Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_1521 |
Symbol | |
ID | 8225092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 1816855 |
End bp | 1820160 |
Gene Length | 3306 bp |
Protein Length | 1101 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644929379 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003085931 |
Protein GI | 255035310 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.285241 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TTATACTACT TGCGTGGGCG GTGCCTGCGC TTGCCTGGGG GCAGTCACCC GCTTTGCAGC AAGGCTTCCA AACGCCTCCC GATGCGGCTA AACCGCGTGT TTGGTGGCAT TGGATGAACG GCAACATTAC CAAAGAAGGG ATCACCAAAG ACCTCGAATG GATGAAGCGC GTAGGCATTG GCGGCTTCCA GAATTTCGAT GCGAGCCTTT TTACACCCAA TGTGACGCCC AAAAAGCTGG TGTTCATGAC GCCCGACTGG AAGGACGCTT TCAAACATAC CACCGACCTC GCCCAAAAGC TCGGCCTCGA AATGGCGATC GCAGGCTCAC CCGGTTGGAG CGTTACCGGC GGGCCGTGGG TACCGGCCGC AGATGCGATG AAAAAGTACG TTTGGACCGA AACGCATGTG CCGGGCGGGC AAACTTTTAC AGGCAAACTC CCCCCGCCCG CACCGGTGGC CGGGAAGTTT CAGAATGTGC CATTGCCTGC CGAAGGCGGG ATGTCAGGAC CGTCGGGCGA AGTGCCGGAT TATTATGCCG ATGCGCTCGT GATCGCCTAC AAGCTGCCTT CTGCCGACAA GCGCATGAGC ACGCTCAATC CCAAAGTAAC GTCGAGCGGA GGCTCTTTTT CGGTAGCCGA ACTGACCGAC GGCGATTTGG GCAAAACCTC CCTCCTGCCG CCGATGGAAG TTGGCCAGGA TATGTGGGTA CAATATGAAT TCGATACACC GCAGACATTC AAGGCATTGA CGATTGTAGG CGCCAGCTCG GGCGGCGCAT TGGCCGAGTT CAACGGAGCG CCCAACAACC GGACGCTGCG GGTAAGCGAC GACGGCGTCA CTTTCCGCGA CGTTGCGCCC ATCAAAGGCA GCATTGTTCC TCAAAACACA CTGGCATTTA CTCCCGTAAC GGCCAGATTC TACCGGATCG CGTTCAAAAC GCTGGCGCCG CCGTTCAATC CTTTCGTAGC GATGATGGGC GGAGGCGGCG GGCAGCCGGC CGCGCCCGAG GGCGTTCATG TGGCCGAAAT CGTGTTTCAT AATACCGACC GGATGGATCA GTTCGAAGAA AAGGCTGGTT TCAGCCCCTG GCGCGAGAAT ACTTCCTCGC TGATCCAGCC CAATGCGGAC GCGGTACCCC TCACCGACGT GATCGACCTG ACTTCCAAAA TGACCGCCGA CGGCAGCCTG AACTGGACAC CGCCTGCCGG CAACTGGGTG GTGGTACGGC TCGGGTACTC GCTCACCGGG CGCAAAAATC ATCCCGCCTC GCCGGAAGCA ACCGGTTTAG AAGTGGATAA GCTCGATAAA GCAGCCGTGA CCCGCTATAT TAATACCTAT CTCGATATGT ATAAGGACGC TACCGGCGGG CAAATGGGCG CGAAAGGACT GCAATTCATG GTGCTCGACA GCTACGAGGC AGGCCATATG ACCTGGACGA AAGAAATGCC GCAGGAGTTC AAAAAGCGCC GAGGCTACGA CATCACGCCC TGGATTCCCG CGCTCACAGG CATCATCGTG AAAAGCGCCG CTGAAAGCGA CCGCTTTTTG TGGGATTTCC GAAAAACCAT CGGCGAACTG ATCATCGAAA ACCACTACGA GGTGATCGGC GACGCATTGA AAGCGCGCGG AATGAAGCGC TATACCGAAT CGCACGAGGG CGGCCGCATT TACCTGGCCG ACGGCATGGA TGTGAAGCGC AAGGCCGACA TTCCGATGGC TGCAATGTGG ACGCCCGGCA GCCTGGCCGG GGGCGCCGAC GAGGAAGTGC GGAGCGAAGC CGACATCCGC GAATCGGCCT CGGTGGCGCA CATTTACGGA CAAAACATCG TCGCGGCGGA ATCGATGACG TCCGTTGGCA ATGCATTCAC CTGGTATCCC GAAAAGCTGA AACGCACGGC GGATCTCGAA ATGGCGTCGG GACTGAACCG GTTTGTGATC CACACCTCCG TTCACCAGCC ACTGGACGAC AAAAAACCCG GCTTTTCGCT CGGGCCGTTC GGGCAGTATT TCACGCGGCA GGAAACCTGG GCAGAACAGG CAAAGGCGTG GATGGATTAC CTGGGGAGGA GCTGTTTCAT GCTGCAACAA GGCAAGCCGG TGGTGGATGT GCTGTATTAT TATGGTGAAA ACTCAAATAT CACGCAGATC TCCACGCAAA AACTTCCCCC GATACCTGCC GGCTATGCAT TCGATTTCGC GAATTCGAGT GTGATCAAAG ATATGCTGAA AATAGAAAAA GGCTTGATCG CGACGCCATC GGGCCAGCAG TATCGTTTGC TGGTGCTGGA CTCTACCGCC AGGGATATGA CATTGCCGGT GCTGCAAAAG ATTGGCGAAC TGGTGGATAA CGGAATGAAA GTGGCGGGCG TGAAACCGGA GCGTTCCCCC AGCCTGGGTG ACGATCCGTC GGCATTTACG GCGCTCGTCA ACAGGATTTG GAATAATCCG AATGTTTCCT CGCAGCCGCT GGAAACCGTT TTGAGCAGCA TTGCGCCCAA AGATGTGGCC ATATCCGGCG AGAAAGCGAA GATCCTTTAC GTGCACCGCC AAACACCCGA CGCCGACATT TACTGGCTCG ACAACCGCAG TAACGAGGCC AACCAGGCCA GTATCAGCTT CCGCGTTACC GGCAAAATTC CCGTACTATG GAACCCAGAA ACCGGCAAAA CCAGTAAAGT CTTTTACCAG ATCGCCGACG GCCGGACCAC CATTCCTTTG AAATTCGATT CCTGGCAGGC CTATTTTATC GTTTTCAGTG GCAAGGCTCC CAACAATTTG CATTCCGAGC CCGAATGGCG CGAATCGGAC GCCACTGCTG TGACGGGCAC CTGGACCGTG CGCTTTCAAC CGGGCCTGGG CGCTCCTGCG CAGGCGCAAA TGAATGAACT CGCCTCGCTA TCGGAGCATG CGGATGCCGG GATCAAATAC TTTTCGGGAA CTGCCACTTA CGAAAACACG CTCAATGTGT CGGCAATTAA TAAAAAAGCG CGCTACTGGC TCCACCTCGG CGATGTGAAA AATCTCGCCG AAGTGATCGT GAATGGCAAA AATGCAGGCA TTATCTGGAA AAAGCCGTTC CGTATCGACA TTACCGACGC GGTGAAGCGG GGCGCTAATA GCATTCAGGT GCGTGTTACC AATACCTGGG TAAACCGCCT GATCGGCGAC GCACAACCCG GCACCGCCAC CAAAATCACT TATACGACGA TGCCGTTTTA CAAAGCCGAT TCACCTTTAC AGCCCAGCGG CTTGCTGGGC CCGGTGAAAC TGGTGGCGGC AACGCCCGCA AAATAG
|
Protein sequence | MKKLILLAWA VPALAWGQSP ALQQGFQTPP DAAKPRVWWH WMNGNITKEG ITKDLEWMKR VGIGGFQNFD ASLFTPNVTP KKLVFMTPDW KDAFKHTTDL AQKLGLEMAI AGSPGWSVTG GPWVPAADAM KKYVWTETHV PGGQTFTGKL PPPAPVAGKF QNVPLPAEGG MSGPSGEVPD YYADALVIAY KLPSADKRMS TLNPKVTSSG GSFSVAELTD GDLGKTSLLP PMEVGQDMWV QYEFDTPQTF KALTIVGASS GGALAEFNGA PNNRTLRVSD DGVTFRDVAP IKGSIVPQNT LAFTPVTARF YRIAFKTLAP PFNPFVAMMG GGGGQPAAPE GVHVAEIVFH NTDRMDQFEE KAGFSPWREN TSSLIQPNAD AVPLTDVIDL TSKMTADGSL NWTPPAGNWV VVRLGYSLTG RKNHPASPEA TGLEVDKLDK AAVTRYINTY LDMYKDATGG QMGAKGLQFM VLDSYEAGHM TWTKEMPQEF KKRRGYDITP WIPALTGIIV KSAAESDRFL WDFRKTIGEL IIENHYEVIG DALKARGMKR YTESHEGGRI YLADGMDVKR KADIPMAAMW TPGSLAGGAD EEVRSEADIR ESASVAHIYG QNIVAAESMT SVGNAFTWYP EKLKRTADLE MASGLNRFVI HTSVHQPLDD KKPGFSLGPF GQYFTRQETW AEQAKAWMDY LGRSCFMLQQ GKPVVDVLYY YGENSNITQI STQKLPPIPA GYAFDFANSS VIKDMLKIEK GLIATPSGQQ YRLLVLDSTA RDMTLPVLQK IGELVDNGMK VAGVKPERSP SLGDDPSAFT ALVNRIWNNP NVSSQPLETV LSSIAPKDVA ISGEKAKILY VHRQTPDADI YWLDNRSNEA NQASISFRVT GKIPVLWNPE TGKTSKVFYQ IADGRTTIPL KFDSWQAYFI VFSGKAPNNL HSEPEWRESD ATAVTGTWTV RFQPGLGAPA QAQMNELASL SEHADAGIKY FSGTATYENT LNVSAINKKA RYWLHLGDVK NLAEVIVNGK NAGIIWKKPF RIDITDAVKR GANSIQVRVT NTWVNRLIGD AQPGTATKIT YTTMPFYKAD SPLQPSGLLG PVKLVAATPA K
|
| |