Gene Dfer_2290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_2290 
Symbol 
ID8225862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp2823919 
End bp2827089 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content53% 
IMG OID644930125 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003086676 
Protein GI255036055 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAT TCTATCTCGC AGTCTGCCTG GCGTTTTCGC TTACCTGTTC GGCGCAGATT 
GTACCGCAAT CTCCGCCATT AAACGTGCCG ATCAGCAGCT TGCGAACTTC CTTCCGCACG
GGCCCGGACT CGCTTCCGCT GGCTGTTTAC TGGTACTGGC TTTCGGATCA TATTTCTAAA
GAAGGAGTTG TTAAGGATCT GGAATCGATG AAAAAAGTGG GGATCAACCG CGCATTCATC
GGGCATATCA ATGTGGGCGC TCCCTATGGT GAGCACAAAC TATTTTCGGA TGCCTGGTGG
GAAATCCTGC ATACTGCCTT GAAAAAAGCA GGCGAACTGA ACATTGAAAT TGGCCTTTTC
AACTCGCCGG GGTGGAGCCA GTCCGGCGGG CCGTGGATAA AGCCGGAGGC CAGCATGCGG
TATCTTGCGT CTACCGAGGT ACCGGTTTCC GGACCCAAAA AGATGCAGGG CCCGCTGCCC
GAGCTGGGGC CAGACGCGCA GGACGTAAAA GTGCTGGCTT ATCCCGTGCA GACCAAGCCA
GAAGCGTATT CCCAAATACT CGCCAAGAAA GACAAAGCGG AGGCATCGGT TGAAATGCCA
GTTTCGGGCA ATCAACCCAT GCGCAGCCTG ACCATTCAGG TGGACAAGCC CATCAAAACT
TCCGCCGTAC TCTACTATAA GCAAGGAAAT GAATACCGGG AACTCTTGCG CATTGAAGTA
GATCGCAGTA ATACCGCACT GAATGTGGGC TTTGCGCCCC TGGCCCCGGT GGTGATTTCA
TTGCCCGAGG TGAAGGCTTC TGCATTCAGG CTGGTGGTGG CTCCTGCGGG AACTGCCCGC
ATAGCCGTTT CGCTATCGTC GGAGCCGCAG GTGGAACGGT ACCCGGAAAA AACATTGGCC
AAAATGTTTC AAACACCACT TCCGCTTTGG GCCGACTATC TCTGGCGGCA GCAGCCGCAG
GTGACGGATC CCGGCACCAT TGTTCAGGCC AAAGCTGTCC GAGACCTGAC TTCCTTTTAC
AAAAATGGGC AGCTAACCTG GGATGTTCCG GCTGGGGAGT GGAAGATAGT CAGGCTGGCG
ATGCAAACCA CCAGCGTGAC CAATTCACCG GCAACTCCGG AAGGCACCGG GCTGGAAGTG
GATAAAATGA GCAAGCGCCA CGTTGCCACC CACTTCGACG CCTATCTCGG CGAAATCCTG
CGCAGGATAC CTCCGCAGGA CCGGCGGACA TTCAAAGTCG TGGTGCAGGA CAGCTATGAA
ACAGGCGGCC AAAACTGGAC GGACGATATG GAAGCCCGTT TTGTCAAAAC ATATGGGTAT
AACCCTGTGC CGTTTCTGCC GGCACTGAGC GGCAAGGTGG TGGAAAGTCA GGACAAATCG
GACCGCTTTC TCTGGGACTT GCGCCGCCTG ATTGCCGACC GCGTCGCTTA CGATTATGTG
GGTGGTTTGC GCGAGCAAAG TCACAAGCAC GGGCTTACCA CCTGGCTCGA AAACTACGGG
CACTGGGGAT TTCCCGGGGA GTTCTTGCAA TACGGCGGGC AGTCCGACGA GGTAGGCGGG
GAGTTCTGGA GCGAAGGTTC ATTGGGTGAT ATCGAGAATA AAGCTGCGTC CTCCGCTGCC
CACATTTATG GTAAGCAAAA AGTATGGGCC GAATCGTTTA CCGCTGGCGG GAAGGCCTTT
GCACGCTATC CTTATGTGAT GAAAGCCAGG GGGGACAGGT TCTTTACGGA AGGGATCAAC
AGCACCCTTC TTCACGTATT TATCCACCAG CCTTATGAAG ATCGCTGGCC GGGGATGAAC
ACGGAGTTTG GGAATGAATT TAACCGTAAG AACACCTGGT TTGATCAGAT GGATGTATTT
GCCGGTTATC TGAAAAGAAC CAATCTGCTG CTTCAACAGG GGCGATTTGT GGCGGACGTG
GCCTATTTTA TCGGTGAAGA CACGCCCAAA ATGACAGGCA TTTGTGATCC AGCGCTGCCG
AAAGGCTATT CGTTTGACTA CATCAACGCG GAAGTCCTGT TGCAGGAGGC ATCTGTGAAA
GATGGGAAAC TCACACTGGC AAGCGGAATG CAATACAAAG TGTTGGTACT GCCGCGACTG
ACAACCATGC GTCCGGAGGT TTTGCGTAAA ATCTCAGCAT TAGTGAATCA GGGACTTGCG
ATCCTGGGGC CGGCCCCGCA AAGCTCACCG AGCCTGGCAA ATTATCCGCA GGCGGATCAG
GAGGTGAAGC GAATGGCGCA AGCCCTGTGG AAAGCCACCA CGCCAACCGG ATTTGGAAAG
GTGGGGAAAG GATATGTATT TGGCGATCAG AATACGCTCG AAAACATTCT GGCTACATTA
TCGGTTATAC CCGACTTACA CATTGCCGCA GATTCTGCTT CCGTGCTGTT CTGCCATCGG
GCTCTTCCGG ATGGAAATAT TTATTTTTTA TCAAACCAGC AGCCCCAAAA AGTGAAGTTC
CAGGCCACAT TCCGGCAAAA CCAGGGGCGA CCTCAACTTT GGGACCCCTT AACGGGTGAA
GTGCGATCGC TCCCGGAGCA CAGCCGCACA GCCCAGTCGA CAACCCTTCC GCTGGAATTA
GAGGCTTACG GAAGTACGTT CATCGTATTT TCCCGCACAA ATGAGGTTGA ATCAAAGTCA
AAAAAACAAC CTGAAAACTT CCCCGCGGGC AAGCGTTTGT TGGCTTTGGA AAGGCCCTGG
CAGGTCCATT TCAAAGGTAT CAACGCCCCG GCGCAGGCTA TCACGTTTGA GCACCTTATA
GATTGGAAAG ATTCGAAGGA TTCCACCATC AAATACTTTT CAGGTACCGC TTCATACAGT
ACTACATTCA GCATGGATTC CCTGCCGCAG CAAGCGCAGT ACATCGATCT GGGCCATGTG
ATGGTGATGG CAAAAGTTTA TGTAAATGAC AAGTATGCCG GCGGCGTGTG GACAAAACCT
TACCGCTTAA ACATAACTGA CTTTCTTCAA AAAGGCGAAA ATACCCTTCG GGTGGAGGTG
GTCAACAACT GGATGAACCG GTTGATCGGC GATCAGCATT TACCTGAGTC GCACCGAAAA
ACATGGACAA GGGAAAATCC CTGGAAAGCT TCTTCGCTGC TGCAACCTTC GGGGTTACTA
GGGCCGGTAA CGATTTCTTC TTTTGATTAT CATGTTATTA AGAGTGAATA G
 
Protein sequence
MNKFYLAVCL AFSLTCSAQI VPQSPPLNVP ISSLRTSFRT GPDSLPLAVY WYWLSDHISK 
EGVVKDLESM KKVGINRAFI GHINVGAPYG EHKLFSDAWW EILHTALKKA GELNIEIGLF
NSPGWSQSGG PWIKPEASMR YLASTEVPVS GPKKMQGPLP ELGPDAQDVK VLAYPVQTKP
EAYSQILAKK DKAEASVEMP VSGNQPMRSL TIQVDKPIKT SAVLYYKQGN EYRELLRIEV
DRSNTALNVG FAPLAPVVIS LPEVKASAFR LVVAPAGTAR IAVSLSSEPQ VERYPEKTLA
KMFQTPLPLW ADYLWRQQPQ VTDPGTIVQA KAVRDLTSFY KNGQLTWDVP AGEWKIVRLA
MQTTSVTNSP ATPEGTGLEV DKMSKRHVAT HFDAYLGEIL RRIPPQDRRT FKVVVQDSYE
TGGQNWTDDM EARFVKTYGY NPVPFLPALS GKVVESQDKS DRFLWDLRRL IADRVAYDYV
GGLREQSHKH GLTTWLENYG HWGFPGEFLQ YGGQSDEVGG EFWSEGSLGD IENKAASSAA
HIYGKQKVWA ESFTAGGKAF ARYPYVMKAR GDRFFTEGIN STLLHVFIHQ PYEDRWPGMN
TEFGNEFNRK NTWFDQMDVF AGYLKRTNLL LQQGRFVADV AYFIGEDTPK MTGICDPALP
KGYSFDYINA EVLLQEASVK DGKLTLASGM QYKVLVLPRL TTMRPEVLRK ISALVNQGLA
ILGPAPQSSP SLANYPQADQ EVKRMAQALW KATTPTGFGK VGKGYVFGDQ NTLENILATL
SVIPDLHIAA DSASVLFCHR ALPDGNIYFL SNQQPQKVKF QATFRQNQGR PQLWDPLTGE
VRSLPEHSRT AQSTTLPLEL EAYGSTFIVF SRTNEVESKS KKQPENFPAG KRLLALERPW
QVHFKGINAP AQAITFEHLI DWKDSKDSTI KYFSGTASYS TTFSMDSLPQ QAQYIDLGHV
MVMAKVYVND KYAGGVWTKP YRLNITDFLQ KGENTLRVEV VNNWMNRLIG DQHLPESHRK
TWTRENPWKA SSLLQPSGLL GPVTISSFDY HVIKSE