Gene Dfer_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_3042 
Symbol 
ID8226616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp3719265 
End bp3722207 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content55% 
IMG OID644930873 
ProductCellulose synthase (UDP-forming) 
Protein accessionYP_003087422 
Protein GI255036801 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0324719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAG CGCTGTACGT AAAGCCGCCC ACCCGCAAGC AGCTGATCAT GCTGCGGCTG 
ATGATCTTCT TCGGCTTGAT TTCGATGGGG TTTTTCCTGT TCAGCGTGCT GTCACCGGCG
GTCCGCGGCT ATGCGCCGTT GTACTGGATG CTGGTGGTGA CGTTCGTTTT CACCTGCCTG
AAAGTGCTGC ACGAATGGTA CCATTACCTC TACATAACCG TTCCGCCCAC GCCACCACCC
ACGCGCAGTT ACACCGTGGA TATTTTCACA ACCTTCTGCG CCGGAGAGCC TTACGAGATG
ATTATCGAAA CGCTTACCGC AATGCAGGCC ATTACCTATC CGCACGAGAG TTACCTGTGC
GACGAGGCCG ATGATCCTTA CCTGCGCGAC GTTTGCGCAC GCCTGGGCGT GCATCACGTC
ACGCGCATTG AGAAAACGAA TGCCAAAGCC GGGAACATCA ATAACGCATT GCGCATCTCG
AATGGAGAAC TTTGCGTAGT CCTCGACCCC GACCACGTAC CTTTCCCCGA TTTCCTGGAC
CCGATTGTTT CGCATTTCGA TAATCCGGAA ATCGGCTACG TCCAGATTGT ACAGGCTTAT
AAAAATCACG ACGAAGGACT GATCGCCAAA GGTGCCGCCC AGCAAACATA CCAGTTTTAC
GGCCCGATGA TGATGACCAT GAATCATTAT GGCACCGTGC TGGCCATTGG TGCAAACTGC
ACGTTCCGGC GCACGGCACT GGACTCGATC GGCGGGCATG CGGCGGGGCT GGCCGAGGAC
ATGCACACCT CCATGCAGCT GCACGCGAAA GGATGGAAAT CGGTGTATGT GCCGGCGGTG
CTGGCACGCG GGCTGGTTCC GTCTACGCTT TCGGCTTACT ACAAGCAACA GCTCAAATGG
TCGCGCGGGG TGTTTGACCT TTTCGTTCAT GTTTATCCGA GGCTGTTCAC GAAGTTCACT
TGGAGTCAGC GCATTCATTA TGGCACAATT CCGCTGCATT ACCTGTCGGG CTTCATTTTC
CTGATCAACT TCCTGATTCC CGCAATAGCG CTCGTGCTGG GCGTAAGTCC CATGCATTTC
GACCTCGCCG ATTTCGGGCT CGTTATCCTT CCGATGGTTT CCTGCATCGT TTTGATCCGG
CATTTCGTGC AATGGTGGGT GATGGAGGAT GAAGAACGCG GATTTCATGT GGTGGGTGGC
TTGCTGATGA TCGGGACCTG GTGGATATTC ATTCTGGGCG TGCTGTACAC GATTTCAGGC
AAAAAAATCC CGTATGTACC TACGCCTAAG GACGGCAACG AAGCCAACAA CTGGCCGTTG
AATGTACCCA ATCTGGTAGT ATTGGGTATT TCAATGCTCG CGATCGTGTA TGGACTCTAT
CAGGATCTTA ACCCCTATAA CCTTATCATG GCCGGTTTCG CGGGGTTGAA CTGCTTTTTT
ATGTGCTTCA ACATCGCCGC CAGCCGTCAG CAGCAAATCC GTGAGCTTTC CGTCACATCG
CCCCTGATGA ATACCGTTTT CAGGGCTATT AAAGAGTTGA AAGGCAATTT CTGGATTCTG
CGCCGGCGGG TTTACAGCGG CGTAAGGACG TCGGCTTTCC TGCTCACCGT TCTCGTTATC
AGCACTATTA TCTATTTCCG GCGTTTTAAT CCCCAATTGG AACACCAGCT CGCGGTTGCA
CGCGAAAACG AGCAGTATGC ACGCAGCCTC GCCGGCATTA AACCCGCTAA ACGTCCTGAT
ATGCCCGCAC TTTTCCGTGC GATGGGCATC CAGGAACCGC GGCAGGCCGA TGCCAAACAA
GCTGGCGTAC CATTTTTCCC GGGCGAGCGC GGGGTTAATT ACACGAAAGG CCACAACTGG
TCGCGCCGGT ATCCTGCATT CACGAAGAGG GAACTCGAAG CTGATATCAC GCTTATGCAG
CAAACCGGCA TCAATGCGAT CCGGCATTTT GGACCGGGTA TTTACGATTA CAATGTGCTG
AAAGCGACGC GGCAGGCGGG TATCCGCGTG CATTACACAT TCTGGATACC CGAAGCGCTG
GATTTCATCC GGGATAAAGA AGAAGCGGAC GACCTGGCGG CCAAAATACT CGCCACCGTC
CGACGGCTGA ACCATCATGC GCACATTGTT TCCTGGAACA TCGGCAATGC GGCCATCCAG
CGGCATCGCC GCGCCGAAAG CACCGGGGAG CAGCGGCAGT TTTTGTATTG GCTCAAAAAC
CTCAGCGCGG CAATCAAGAA GGTTGACGAC AAGCGGCCGC TCACGACGGA TATCGAGTTA
ACCGATGAAG CATTCCGCAT CGCTTACCTG ATCAAGCGCG TCGCTCCGGC GATCGATGCA
TTCGGGCTGG TGGTGGAAGA CCCGCATCAG CAGCCCGACG CACCGGCACT GCGCAGGCTC
GGCATGCCAT TCTATTTTTC CTACATAAGT GTGTCTGCAT TCTCCCAAAT GCAGCAACCG
CTTGCAGGAA CGTTCATTTC CAACTGGCAG GACGAAAAGA TCTTCGCACA CGTGAGCCTC
GACGGCCTGT ACGACTACGC CGGGCGGCCC AAGCGCCCAT TGCAGGTGTT GCAGTCGATT
TGGGGAAAAG GGAAACCGCC GGAGCCGGTG GCAAGTTTCA GGATACTGCG CCCTGCACTG
GGCACTTTTG AGGGCACAAC ACTCGACTAT CACGCCATAA AATGGCAGAA CGAAAAGTGG
GAAATGGCGG CCGCATCGCA AAAAGGGCCG CGATTTGAGT GGAAACTTGT GAGAACGGAC
GGATTCGACA ACCTCGTGGA GATGACCGAT GCAGGCACCG GGCCGCGACT CGCTTTGACC
ATTCCGAGGC ATCCCTCGTT ATACCGGCTG TATTTGTATG TAATCCAGGG TAATGTGATT
ACTGAAATCG TTGATTCCCC ACTGAATACG CCGCTAGAAC CGGTAGTAGG ACCTGCTCAC
TGA
 
Protein sequence
MKQALYVKPP TRKQLIMLRL MIFFGLISMG FFLFSVLSPA VRGYAPLYWM LVVTFVFTCL 
KVLHEWYHYL YITVPPTPPP TRSYTVDIFT TFCAGEPYEM IIETLTAMQA ITYPHESYLC
DEADDPYLRD VCARLGVHHV TRIEKTNAKA GNINNALRIS NGELCVVLDP DHVPFPDFLD
PIVSHFDNPE IGYVQIVQAY KNHDEGLIAK GAAQQTYQFY GPMMMTMNHY GTVLAIGANC
TFRRTALDSI GGHAAGLAED MHTSMQLHAK GWKSVYVPAV LARGLVPSTL SAYYKQQLKW
SRGVFDLFVH VYPRLFTKFT WSQRIHYGTI PLHYLSGFIF LINFLIPAIA LVLGVSPMHF
DLADFGLVIL PMVSCIVLIR HFVQWWVMED EERGFHVVGG LLMIGTWWIF ILGVLYTISG
KKIPYVPTPK DGNEANNWPL NVPNLVVLGI SMLAIVYGLY QDLNPYNLIM AGFAGLNCFF
MCFNIAASRQ QQIRELSVTS PLMNTVFRAI KELKGNFWIL RRRVYSGVRT SAFLLTVLVI
STIIYFRRFN PQLEHQLAVA RENEQYARSL AGIKPAKRPD MPALFRAMGI QEPRQADAKQ
AGVPFFPGER GVNYTKGHNW SRRYPAFTKR ELEADITLMQ QTGINAIRHF GPGIYDYNVL
KATRQAGIRV HYTFWIPEAL DFIRDKEEAD DLAAKILATV RRLNHHAHIV SWNIGNAAIQ
RHRRAESTGE QRQFLYWLKN LSAAIKKVDD KRPLTTDIEL TDEAFRIAYL IKRVAPAIDA
FGLVVEDPHQ QPDAPALRRL GMPFYFSYIS VSAFSQMQQP LAGTFISNWQ DEKIFAHVSL
DGLYDYAGRP KRPLQVLQSI WGKGKPPEPV ASFRILRPAL GTFEGTTLDY HAIKWQNEKW
EMAAASQKGP RFEWKLVRTD GFDNLVEMTD AGTGPRLALT IPRHPSLYRL YLYVIQGNVI
TEIVDSPLNT PLEPVVGPAH