Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_3042 |
Symbol | |
ID | 8226616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 3719265 |
End bp | 3722207 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644930873 |
Product | Cellulose synthase (UDP-forming) |
Protein accession | YP_003087422 |
Protein GI | 255036801 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0324719 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAAG CGCTGTACGT AAAGCCGCCC ACCCGCAAGC AGCTGATCAT GCTGCGGCTG ATGATCTTCT TCGGCTTGAT TTCGATGGGG TTTTTCCTGT TCAGCGTGCT GTCACCGGCG GTCCGCGGCT ATGCGCCGTT GTACTGGATG CTGGTGGTGA CGTTCGTTTT CACCTGCCTG AAAGTGCTGC ACGAATGGTA CCATTACCTC TACATAACCG TTCCGCCCAC GCCACCACCC ACGCGCAGTT ACACCGTGGA TATTTTCACA ACCTTCTGCG CCGGAGAGCC TTACGAGATG ATTATCGAAA CGCTTACCGC AATGCAGGCC ATTACCTATC CGCACGAGAG TTACCTGTGC GACGAGGCCG ATGATCCTTA CCTGCGCGAC GTTTGCGCAC GCCTGGGCGT GCATCACGTC ACGCGCATTG AGAAAACGAA TGCCAAAGCC GGGAACATCA ATAACGCATT GCGCATCTCG AATGGAGAAC TTTGCGTAGT CCTCGACCCC GACCACGTAC CTTTCCCCGA TTTCCTGGAC CCGATTGTTT CGCATTTCGA TAATCCGGAA ATCGGCTACG TCCAGATTGT ACAGGCTTAT AAAAATCACG ACGAAGGACT GATCGCCAAA GGTGCCGCCC AGCAAACATA CCAGTTTTAC GGCCCGATGA TGATGACCAT GAATCATTAT GGCACCGTGC TGGCCATTGG TGCAAACTGC ACGTTCCGGC GCACGGCACT GGACTCGATC GGCGGGCATG CGGCGGGGCT GGCCGAGGAC ATGCACACCT CCATGCAGCT GCACGCGAAA GGATGGAAAT CGGTGTATGT GCCGGCGGTG CTGGCACGCG GGCTGGTTCC GTCTACGCTT TCGGCTTACT ACAAGCAACA GCTCAAATGG TCGCGCGGGG TGTTTGACCT TTTCGTTCAT GTTTATCCGA GGCTGTTCAC GAAGTTCACT TGGAGTCAGC GCATTCATTA TGGCACAATT CCGCTGCATT ACCTGTCGGG CTTCATTTTC CTGATCAACT TCCTGATTCC CGCAATAGCG CTCGTGCTGG GCGTAAGTCC CATGCATTTC GACCTCGCCG ATTTCGGGCT CGTTATCCTT CCGATGGTTT CCTGCATCGT TTTGATCCGG CATTTCGTGC AATGGTGGGT GATGGAGGAT GAAGAACGCG GATTTCATGT GGTGGGTGGC TTGCTGATGA TCGGGACCTG GTGGATATTC ATTCTGGGCG TGCTGTACAC GATTTCAGGC AAAAAAATCC CGTATGTACC TACGCCTAAG GACGGCAACG AAGCCAACAA CTGGCCGTTG AATGTACCCA ATCTGGTAGT ATTGGGTATT TCAATGCTCG CGATCGTGTA TGGACTCTAT CAGGATCTTA ACCCCTATAA CCTTATCATG GCCGGTTTCG CGGGGTTGAA CTGCTTTTTT ATGTGCTTCA ACATCGCCGC CAGCCGTCAG CAGCAAATCC GTGAGCTTTC CGTCACATCG CCCCTGATGA ATACCGTTTT CAGGGCTATT AAAGAGTTGA AAGGCAATTT CTGGATTCTG CGCCGGCGGG TTTACAGCGG CGTAAGGACG TCGGCTTTCC TGCTCACCGT TCTCGTTATC AGCACTATTA TCTATTTCCG GCGTTTTAAT CCCCAATTGG AACACCAGCT CGCGGTTGCA CGCGAAAACG AGCAGTATGC ACGCAGCCTC GCCGGCATTA AACCCGCTAA ACGTCCTGAT ATGCCCGCAC TTTTCCGTGC GATGGGCATC CAGGAACCGC GGCAGGCCGA TGCCAAACAA GCTGGCGTAC CATTTTTCCC GGGCGAGCGC GGGGTTAATT ACACGAAAGG CCACAACTGG TCGCGCCGGT ATCCTGCATT CACGAAGAGG GAACTCGAAG CTGATATCAC GCTTATGCAG CAAACCGGCA TCAATGCGAT CCGGCATTTT GGACCGGGTA TTTACGATTA CAATGTGCTG AAAGCGACGC GGCAGGCGGG TATCCGCGTG CATTACACAT TCTGGATACC CGAAGCGCTG GATTTCATCC GGGATAAAGA AGAAGCGGAC GACCTGGCGG CCAAAATACT CGCCACCGTC CGACGGCTGA ACCATCATGC GCACATTGTT TCCTGGAACA TCGGCAATGC GGCCATCCAG CGGCATCGCC GCGCCGAAAG CACCGGGGAG CAGCGGCAGT TTTTGTATTG GCTCAAAAAC CTCAGCGCGG CAATCAAGAA GGTTGACGAC AAGCGGCCGC TCACGACGGA TATCGAGTTA ACCGATGAAG CATTCCGCAT CGCTTACCTG ATCAAGCGCG TCGCTCCGGC GATCGATGCA TTCGGGCTGG TGGTGGAAGA CCCGCATCAG CAGCCCGACG CACCGGCACT GCGCAGGCTC GGCATGCCAT TCTATTTTTC CTACATAAGT GTGTCTGCAT TCTCCCAAAT GCAGCAACCG CTTGCAGGAA CGTTCATTTC CAACTGGCAG GACGAAAAGA TCTTCGCACA CGTGAGCCTC GACGGCCTGT ACGACTACGC CGGGCGGCCC AAGCGCCCAT TGCAGGTGTT GCAGTCGATT TGGGGAAAAG GGAAACCGCC GGAGCCGGTG GCAAGTTTCA GGATACTGCG CCCTGCACTG GGCACTTTTG AGGGCACAAC ACTCGACTAT CACGCCATAA AATGGCAGAA CGAAAAGTGG GAAATGGCGG CCGCATCGCA AAAAGGGCCG CGATTTGAGT GGAAACTTGT GAGAACGGAC GGATTCGACA ACCTCGTGGA GATGACCGAT GCAGGCACCG GGCCGCGACT CGCTTTGACC ATTCCGAGGC ATCCCTCGTT ATACCGGCTG TATTTGTATG TAATCCAGGG TAATGTGATT ACTGAAATCG TTGATTCCCC ACTGAATACG CCGCTAGAAC CGGTAGTAGG ACCTGCTCAC TGA
|
Protein sequence | MKQALYVKPP TRKQLIMLRL MIFFGLISMG FFLFSVLSPA VRGYAPLYWM LVVTFVFTCL KVLHEWYHYL YITVPPTPPP TRSYTVDIFT TFCAGEPYEM IIETLTAMQA ITYPHESYLC DEADDPYLRD VCARLGVHHV TRIEKTNAKA GNINNALRIS NGELCVVLDP DHVPFPDFLD PIVSHFDNPE IGYVQIVQAY KNHDEGLIAK GAAQQTYQFY GPMMMTMNHY GTVLAIGANC TFRRTALDSI GGHAAGLAED MHTSMQLHAK GWKSVYVPAV LARGLVPSTL SAYYKQQLKW SRGVFDLFVH VYPRLFTKFT WSQRIHYGTI PLHYLSGFIF LINFLIPAIA LVLGVSPMHF DLADFGLVIL PMVSCIVLIR HFVQWWVMED EERGFHVVGG LLMIGTWWIF ILGVLYTISG KKIPYVPTPK DGNEANNWPL NVPNLVVLGI SMLAIVYGLY QDLNPYNLIM AGFAGLNCFF MCFNIAASRQ QQIRELSVTS PLMNTVFRAI KELKGNFWIL RRRVYSGVRT SAFLLTVLVI STIIYFRRFN PQLEHQLAVA RENEQYARSL AGIKPAKRPD MPALFRAMGI QEPRQADAKQ AGVPFFPGER GVNYTKGHNW SRRYPAFTKR ELEADITLMQ QTGINAIRHF GPGIYDYNVL KATRQAGIRV HYTFWIPEAL DFIRDKEEAD DLAAKILATV RRLNHHAHIV SWNIGNAAIQ RHRRAESTGE QRQFLYWLKN LSAAIKKVDD KRPLTTDIEL TDEAFRIAYL IKRVAPAIDA FGLVVEDPHQ QPDAPALRRL GMPFYFSYIS VSAFSQMQQP LAGTFISNWQ DEKIFAHVSL DGLYDYAGRP KRPLQVLQSI WGKGKPPEPV ASFRILRPAL GTFEGTTLDY HAIKWQNEKW EMAAASQKGP RFEWKLVRTD GFDNLVEMTD AGTGPRLALT IPRHPSLYRL YLYVIQGNVI TEIVDSPLNT PLEPVVGPAH
|
| |