Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_4456 |
Symbol | |
ID | 8228059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | - |
Start bp | 5374550 |
End bp | 5377582 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644932302 |
Product | TonB-dependent receptor plug |
Protein accession | YP_003088822 |
Protein GI | 255038201 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000450006 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.634618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACTA TTGTACGATT ACTGTACATG CTCGTTTTCC TGGCGGTAGG GAATATCGCC TTCGCACAGG AAATAAATGT GACTGGGAAA GTCACATCGA GTGCAGATAA TTCGGCGCTG CCGGGCGCCA GTGTGCTCAT CAAAGGCACC ACCACCGGTG TTCCTACCGA TGCCGAGGGT AACTACGCCA TAAAAGTCCC GTCCACGCAA TCCACCCTCG TATTCTCGAT GATCGGAATG ACACCCCAGG AAATCACAGT GGGCAACCAA ACCACTGTGA ATGTTGCGCT GGCCGAAGAC GCCAAAGCAT TGAATGAAGT GCTGGTGGTG GGTTACGGTT CCCAAAAGAA ACTTGACGTA ACGGGCGCTA TCACGCAAAT CAAGGGTGCC GAAATCGCGA AACAGCCTTC TATGAATGCC GTCAGCGGTT TGCAGGGTAA GGTTGCCGGT GTGCAGATCA ACAACTCCGG TAAGCCGGGC GAGGCGCCGC AGATCCGCAT CCGCGGTGTA GGAACAGCTT ACGGCAGCGC CAACCCGCTA TATGTGGTGG ACGGTGTGTG GTTCGACGAT ATCAGCTTCC TCAACTCAGG CGACATCGAG AGCATGAACA TCCTGAAAGA TGCTTCCAGC CAATCCATTT ACGGGGTGCG CGCAGCAAAC GGGGTGGTAC TGATTACAAC CAAAAAGGGC AAATCAGGCC AGGCTGTCAT CGATTACAAT GGTTTTGTGG GTTACCAGAA AGTGACCAAC CAGATCGAAA TGGCGAATGG AAACGAGTAC GCCACGATGA TCAACGAGCT GAGCCGCATT AACGGTAAAC CGGACATTCT CGACCCCGCG CAATTCGGCG AAGGCACCGA CTGGTACCGC CAGATCCTGC GTAACGGATT CCTGACCAAC CACCAGCTTT CGATCAGCGG TGGCGGTGAA AAATCAACCT ACAACTTCTC ACTTGGTTAC CTGGATCAGA ACGGTATCGT GGAGAGAAAC AATTTCAAGC GCTACACGGC ACGTTTGCAA AATGATTTCC AGGTGCTGAA AAACCTGAAA GTCGGCTATT CTGCAACCGC TTCTTACAGC AAATCGAAAG ATGAAGCAGG CGGCATTTTC CGTCAGTTGT ACGCGGCTGG CCCGGTTGTT CCTGTATACT ACGCCGATGG TACTTATGGT GATCCCAACG ATTTCAGCCT TGGGGACGGT AATAACTTCA ACCCACAGGT AACAGTCGAT TTTTATAATC AAAACACCAA AAAAACGCTG CTTACCGGTA ATGCCTACGC AGAACTTTCG CTTTTCAAAG GCCTGACATT CCGCAGCAGC CTCGGCGGCC GTTACGGACA GGACGAAACC CGCACTTATG TGCCGCAATA CGTGGCGACA TTCAAGCAGC GCAATACGAC CAGCTTCCTG GAATTCGTTC GTCCGCAGAC CCGCTACTGG ATTTTTGAAA ATACATTGAC CTACACCAAA GATTTCGGTC AGCACAGCAT TACTGCATTG CTCGGTCAAT CCGCACAGCG CGACCAGTCG TACAAGGTTA CGGCCAATGC ATTAAACGTG CCATACAGCA GCGAAGGCGA TTTATACCTG GCATTAGGTA GCGCCGACAG CCGTAACATC ACCGACGAAG GCGATCTCGG AACCTATGCT TCATATTTCG GTCGGGTAAA CTACTCTTTC GGTGAGCGCT ACTTGCTGAA CGCGTCACTT CGTGCCGATG GTTCTTCGAA GTTCTTCCAG GGCGGCAATG CGTGGGGTTA TTTCCCTTCG GTGGGTGCAG GCTGGGTGAT CAGCAATGAA GACTTCATGA AAGGCCAGAC GATTTTCGAC AACCTGAAAA TCCGCGGTAG CTGGGGTAAG ATCGGTAATG CATCCGTGCC TTCCAACCTT TCTACACTGA CCGTAGCCAC TGGCGGCGGC CTGGCGGCGA TCTTCGGCGG GCAGCTGAAT ACCGGTGCGA GCATTAACAC GATCATCCCT CCTACCACTT ACTGGGAGCG CGGTGTAGGT ACCGACGCGG GTATCGAAGC TTCATTCCTG AAATCGCGGT TGACGGTTGA ATTGGATTAT TATATCAAGA AAACCGAACG TGCGATTTTC GACATTCCGG TACTGACTTC CATCGGTACA AGCTCTGGCC GCATCGTGGG TAACCAGGCC GACTTCCAGA ATAAAGGTTT TGAATTCGCA TTGAACTGGC GTGACGACAT CGGCGACGGT CTTTCTTATA GCGTCGGAGT GAATGGTGCG ATGAACAACA ACAAAGTGCT TTCTGTAACC TCCGGAGCCA ACCCGATTTA TGACGGTGGC GTGGGCCTCA CGAGCGGCGC GCTCGCCACA CGTACCCGCG TAGGCGACCC GATCGGCTCA TTCTACGGCT ATGTGGTGGA CGGTATTTTC CAGAATGAAG AAGAAATCAG AAACTCGGCA CAGCCAAGCG CGAAACCGGG CGATTTCAGA TACCGCGACA TCAGCGGAAC GAACGGAACC CCGGACGGCA ACATCTCCGG TCTGGACCGC CAGGTAATCG GTAACCCGAA CCCGAAATTC ACGTACGGTA TCAACACCAG CTGGAATTAC AAAAATGTGG ACCTGATGCT CGACTTCCAG GGCGTTGCCA AAGTGGATAT CTACAATGCC AACCTCGGCT GGCGCTATGG TAACGAGAAC TTTACCAAGG ATTTCTACGA AAACAGATGG CACGGCGAAG GTACTTCGAA CACCTATCCT TCGGCTAACA TCGGAGGCGG CCAGAACTAC CTGCCGAACA CCTTCTTCGT ACAAAGCGGC AGCTACTTCC GGGTAAGAAA TGCGCAAATC GGCTACACTT TCCCGCAGGC ATTCAACGAA AAGCTGAAAA TCCGTAAGCT GCGTTTGTAC GCGAATGCGC AAAACCCTTT GAATTTCTTT AAATACAAAG GCCTGTCGCC GGAGGTGAGA GCCAACGAAA ACAAACCGAC ACAAGCCAAT ATCGACGCCA ATGTGTACCC GCTTTCAGCG ACTTACAACT TTGGTATTAA TGTCACTTTC TAA
|
Protein sequence | MKTIVRLLYM LVFLAVGNIA FAQEINVTGK VTSSADNSAL PGASVLIKGT TTGVPTDAEG NYAIKVPSTQ STLVFSMIGM TPQEITVGNQ TTVNVALAED AKALNEVLVV GYGSQKKLDV TGAITQIKGA EIAKQPSMNA VSGLQGKVAG VQINNSGKPG EAPQIRIRGV GTAYGSANPL YVVDGVWFDD ISFLNSGDIE SMNILKDASS QSIYGVRAAN GVVLITTKKG KSGQAVIDYN GFVGYQKVTN QIEMANGNEY ATMINELSRI NGKPDILDPA QFGEGTDWYR QILRNGFLTN HQLSISGGGE KSTYNFSLGY LDQNGIVERN NFKRYTARLQ NDFQVLKNLK VGYSATASYS KSKDEAGGIF RQLYAAGPVV PVYYADGTYG DPNDFSLGDG NNFNPQVTVD FYNQNTKKTL LTGNAYAELS LFKGLTFRSS LGGRYGQDET RTYVPQYVAT FKQRNTTSFL EFVRPQTRYW IFENTLTYTK DFGQHSITAL LGQSAQRDQS YKVTANALNV PYSSEGDLYL ALGSADSRNI TDEGDLGTYA SYFGRVNYSF GERYLLNASL RADGSSKFFQ GGNAWGYFPS VGAGWVISNE DFMKGQTIFD NLKIRGSWGK IGNASVPSNL STLTVATGGG LAAIFGGQLN TGASINTIIP PTTYWERGVG TDAGIEASFL KSRLTVELDY YIKKTERAIF DIPVLTSIGT SSGRIVGNQA DFQNKGFEFA LNWRDDIGDG LSYSVGVNGA MNNNKVLSVT SGANPIYDGG VGLTSGALAT RTRVGDPIGS FYGYVVDGIF QNEEEIRNSA QPSAKPGDFR YRDISGTNGT PDGNISGLDR QVIGNPNPKF TYGINTSWNY KNVDLMLDFQ GVAKVDIYNA NLGWRYGNEN FTKDFYENRW HGEGTSNTYP SANIGGGQNY LPNTFFVQSG SYFRVRNAQI GYTFPQAFNE KLKIRKLRLY ANAQNPLNFF KYKGLSPEVR ANENKPTQAN IDANVYPLSA TYNFGINVTF
|
| |