Gene Dfer_4206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_4206 
Symbol 
ID8227808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp5086472 
End bp5089528 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content54% 
IMG OID644932053 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003088574 
Protein GI255037953 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATT GGTTGTTCGG CCGGTTTCGT GTTGTAGTGA TTGTTGTGGC GCTGGTGGTG 
CTGACCTCCG TGGCTTCCGT TGCGTGGAAA ATGATGTCGG TTAAGACCAC GGCCGTCAAC
AAACCCGCTG TTGCCGCACC CGAGCCCAAG CATAAGGCGC CCAAAAAGCG CCGTGACCCG
GTGTTGGTTC TTCCGGAAAA TGAATGGGTG GACAGTACGC TCAAATCCCT GAGCATCGAG
CAGAAGATCG GTCAGCTGTT CATGGTAGCT ACTTTTTCCA ACCGCGACGA ATCGCATTAC
CGTTACATCG ACAAGCTGAT TAACGATTAC CAGATCGGCG GACTGATATT CTTCCAGGGC
GGCCCCGTAA GCCAGGCGCA GCTCACCAAC CGCTATCAGG CGCATTCCAA CATTCCGCTT
TTCATCGGCA TCGACGGTGA ATGGGGCCTG GGCATGCGGC TCGACAGCAC GATTTCCTTC
CCGAAACAAA TGGTGCTCGG CGCGATCCAG GACGACCGGC TGATTTACCG CATGGGCGAC
GACATTGCGC GCCAATGCAA ACGCCTCGGC ATTCATATCA ACTTCGCGCC GGTGTCGGAT
GTGAACAGCA ACCCGGCCAA TCCGGTAATC GGTATACGTT CGTTTGGTGA GGACAAAGAG
AATGTCGCGC GGAAGTCGAT CGCCTACATG AAAGGCTTAC AGCACAATCG CATTATCGCT
ACTGCCAAGC ATTTCCCCGG CCATGGTGAT ACCGATGCCG ATTCGCATTT CACACTCCCG
GTGCTTAATC ATTCGCTGGA ACAGCTGAAT TCCGACTTGT ACCCTTACCG CGAAATGATC
GCCGACAGCC TGATGGGCGT CCTGAATGCG CATTTGTACA TTCCCGCATT GGACAACACG
CCCAACCAGG CCAGCTCGCT GTCTGATAAA GTGGTGAATG GCTTGCTCCG CAAAGACCTG
GGCTTCCGCG GCCTCGTGTT TACCGACGCC ATGAATATGC GCGGCGTGCT CAAAAACGGC
AAGCCGGCCG ACGTGAACCT GAAAGCGCTG GTAGCAGGCA ACGACGTGCT GCTTTACCCG
GAAAGCATCG CCGAAACCGT TTCGCGCATC AAGGATGCCA TCAATTCGAA GCTGATCAGT
GAAAAGATCA TCGACGACAA GGTGAAACGC ATTCTGCAAG CCAAATACTG GGCCGGGCTG
AACCAATATA AGCCGATAGA CATCAACAAT CTTTACAACG ACCTCAACAG CGAGAAGAGC
AAGGAACTGA ACCGCGAGCT CTCTGAGGCT TCCGTAACCG TTGTTAAAAA CGACAATAAC
CTGATCCCGA TCGGCTCGGT GATGGACAAC AACATAGCGT CCGTGAGCAT GGGCGAGGGC
AGCGGTGTGG CATTCCAAAA GATGCTCTCC ACCTACAAAC CGATGCGGAG CTACTCGTTT
TACGACGCGC CGTCGTCGCA GGACCAGATT ACCGAAATGC TCGGTTACCT GCAACCTTAC
AATACAGTGA TCGTCGACGT CCACGGCATT AGCTCCAAGC CCGGCCGGAA TTATGGCATT
ACCACGGGCA TGGTGGAGTT TGTTAACCAG CTCAAACAGC AGAATAAAAA AGTGATCCTC
TGCCTGTTCG GAACGCCTTA TAGCATTCAG TTTTTCCCTG ACACCGATGT GCTCATTTGC
GCCAACCAGG ACGGCAAAGA CCAGCAGGAG ATCGTGCCGC AGATCATTTT CGGCGCGCTG
GGCTCGAAGG GCCGCCTGCC GGTGTCGGTA CTGGCCCACA AAAGTGGCGG CGGTGTAAGC
ACCACGTCCA TCAATCGCAT TGCATTCGGC ACGCCGGAGA GTGTGGGAAT GGATGGTATT
TCTCTTAAAA AAATCGATGA AATCGCCACT GCGGCCGTGA ATGACCATGT TTTCCCCGGC
TGCGAAGTGC TGGTGGCCAG ACAGGGTAAG ATCATTTACG AAAAACAATT CGGTGGGTTA
AGCTATCGCA CCAACGAACG CGTAACGCCC GAGACGATCT ACGACCTGGC GTCGCTTACC
AAGGTTTCGG CTACATTGCA GGCTGTGATG CTCCTTTATG ACCGCAAGCA GATCAGTTTG
GATGAAAAGG CATCCAAATA CCTGCCTGAA CTGGCGGGTA CCAATAAGCA AAACTTCACC
GTCCGTGATT TGCTGCTGCA TCGTTCGGGA CTGGTTTCGT TCTACCCGTC GCTGTGGGAC
CGTACCAAAA CCAGCGCAGG CGGTCTTCTG CCGGAATATT ACAGTTCGAA ACAAGACACG
GCCTACTACC TGCAGGTAGC CCCGAAACTC TTCGCCAAAG GTGCATTGCG CGATTCGGTA
TGGAAGTGGG TGGTGGAATC GCCCATGAAC AACCGCCGCG ACCGCGCCGG CAGCTACGGT
TACCTGTACA GTGACCTCGG CTTTTTGACA CTCCAAAAAA TTGTCGAAAG AGTGGCCGGC
CAGTCGCTCG ACAATTTTGT GGCCGCCAAC ATTTACGAGC CGCTGGGGCT GCCCTACCTG
GGTTTCAACC CGCTGCGTCG TTTTCCTGAA AAGCAGATTG CGCCTACCGA GCAGGATTAC
CGGTTCCGCG GGCAGCTGCT ACAAGGAACA GTCCATGATC AGATGGCCGC CATTGTTGGC
GGCGTATCGG GCCACGCCGG CCTTTTTGGC ACCGCACGCG ACCTGGCGGT TTTACTTCAA
ATGAACCTCT GGAAGGGCAA CTACGCAGGA AAACGCTATT ATGAGCAAGC CACGGTGCCA
TTCTTCTCGC GGATGTACGA CGAATCCCAT CACCGCGGAC TGGGCTGGGA CAAGGCCCCT
GCGGACGGCA ACAGCTCATT CGTATCACCG CTGGCGTCCG TCAACTCATT CGGCCACACG
GGTTTCACCG GTACAATGGT GTGGGTAGAT CCCGAGGAAG ACCTGGTATT CATATTCCTT
TCCAACCGCG TGAACCCCGA CCCGGAGAAT ACGGCCATTA CGACCCATCG TACACGAAGA
AAAATCCAGG ATGTAGTGTA TGGTTCTTTG ATTCAACGGA AAAGCCAGCT TCCTTAA
 
Protein sequence
MKNWLFGRFR VVVIVVALVV LTSVASVAWK MMSVKTTAVN KPAVAAPEPK HKAPKKRRDP 
VLVLPENEWV DSTLKSLSIE QKIGQLFMVA TFSNRDESHY RYIDKLINDY QIGGLIFFQG
GPVSQAQLTN RYQAHSNIPL FIGIDGEWGL GMRLDSTISF PKQMVLGAIQ DDRLIYRMGD
DIARQCKRLG IHINFAPVSD VNSNPANPVI GIRSFGEDKE NVARKSIAYM KGLQHNRIIA
TAKHFPGHGD TDADSHFTLP VLNHSLEQLN SDLYPYREMI ADSLMGVLNA HLYIPALDNT
PNQASSLSDK VVNGLLRKDL GFRGLVFTDA MNMRGVLKNG KPADVNLKAL VAGNDVLLYP
ESIAETVSRI KDAINSKLIS EKIIDDKVKR ILQAKYWAGL NQYKPIDINN LYNDLNSEKS
KELNRELSEA SVTVVKNDNN LIPIGSVMDN NIASVSMGEG SGVAFQKMLS TYKPMRSYSF
YDAPSSQDQI TEMLGYLQPY NTVIVDVHGI SSKPGRNYGI TTGMVEFVNQ LKQQNKKVIL
CLFGTPYSIQ FFPDTDVLIC ANQDGKDQQE IVPQIIFGAL GSKGRLPVSV LAHKSGGGVS
TTSINRIAFG TPESVGMDGI SLKKIDEIAT AAVNDHVFPG CEVLVARQGK IIYEKQFGGL
SYRTNERVTP ETIYDLASLT KVSATLQAVM LLYDRKQISL DEKASKYLPE LAGTNKQNFT
VRDLLLHRSG LVSFYPSLWD RTKTSAGGLL PEYYSSKQDT AYYLQVAPKL FAKGALRDSV
WKWVVESPMN NRRDRAGSYG YLYSDLGFLT LQKIVERVAG QSLDNFVAAN IYEPLGLPYL
GFNPLRRFPE KQIAPTEQDY RFRGQLLQGT VHDQMAAIVG GVSGHAGLFG TARDLAVLLQ
MNLWKGNYAG KRYYEQATVP FFSRMYDESH HRGLGWDKAP ADGNSSFVSP LASVNSFGHT
GFTGTMVWVD PEEDLVFIFL SNRVNPDPEN TAITTHRTRR KIQDVVYGSL IQRKSQLP