Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_4223 |
Symbol | |
ID | 8227826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | - |
Start bp | 5107095 |
End bp | 5110133 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644932071 |
Product | TonB-dependent receptor plug |
Protein accession | YP_003088591 |
Protein GI | 255037970 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000201846 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.958019 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAAT CAATTCTACG TTTTGATCGT CCCTACGAGA AGAACGTGCA AGGACCCGCG TCCCTGAGGA TGTGCGGATT GGGCATATTG CTACTCTTAT GCACCCTGTT TTTCAATAAT GCACATGCCC AGGACGTCGC CCTCAAAGGC AAAGTGACCG ACTCCAAGGG AAATGCCCTG CCGGGTGTTT CCATCACCAT TAAGGGAACT ACCAAAGGAA CTTCCACCGA CATGGAAGGT GGCTACGCGA TCTCCGCGCC GCCTGCATCC ACATTATCAT TCAGCTTCAT CGGTTTTTCA TCCAAAGACG TCGTAGTAGG CAATGAAACG ACGATCGACG TGGTGCTCAG CGACGACGTG AAAGCGCTCG AAGAGGTGGT CGTGATTGGC TACGGCGCCG TGAAGAAGAA GGATGTGACG GGTTCCGTGG TTTCGCTGCA ACCGAAAGAC TTCAACAAAG GGGTGATCAC AGCGCCGGAC CAGCTCATTC AGGGCCGGAC GCCGGGTGTG ATGGTGATCA ACAACACGGG CCAGCCCGGC GGTGCAACCA CCGTCCGCAT CCGTGGTAAC TCCTCGATCC GCGCCGGTAA CAACCCGCTG TTCGTACTCG ACGGAATCCC GCTTTCGGGT AGCTCGGCCC GTCCTGGCGG AAGCGGCGGC TTCGGCTCCG ACGGCGGTAA CCCGCTCGCT TACCTCAACC CGAACGACAT CGCGAGCATG GACATCCTCA AAGATGCCTC GGCAACGGCC ATTTACGGGT CACGTGGTGC CAACGGTGTG GTGATCATCA CCACCAAAAG AGGAGCTTCC GGAGCGCCTA CCGTGAGTTT CAATGCTTCC ACCGGCGTAT CCAGCCTGCT TAGAAAGCCG GAGGTGCTCA ATGCCGACGA GTTCAGACAA GCCGTTCAGT TCTACGCACC CAATGATCTG AAAGGCGCTG ATTTCGGTGG AAATGTGAAT GCATTCGATG AAATCACACG CGTAGCACCG GTTCAGAACT ACTCGATGGC GATCACCGGC GGTACGGATA ATGGTAAATA CCGGGTATCG GCCGGCTATA TGGACCAGAA AGGGATCATC GAAACATCTC GCCTGAAAAA GTACACAGCC AGCCTGAACA CGAGTTTCAA GTTCCTTGAA AGCAAAAAGC TGGGACTGGA CTTCAATGTA TTGTTTACGC AAACGGACGA AAACCTTGCG CCGATTGACG TAGGTGTCGG TTTTGAAGGA AACGTAATTT CTCAGGCGCT TACCTGGAAC CCCACCCGCC CACTGCGCGT GAATTCCGAC TCGCTGACTT ACGTGAGCCC TTCGACGATC AACCCGCTCG CATCGCTGCA AGCCTACAAG GACATTGCGG TGGTGAATAC CACGGTGGCC AGCATTGCGC CTTCGTACAA GATCACCAAC GACCTCGAGT ACAAACTCGT GTACAGCATT ACCCGCCAGA CAGGTACGAG AAAGGGACGT TACATCCGCG GCCTCATCGA CCCGGCACAG ATCCGCAACG GCTTTGCATT CATCGGCAAC AACTCCGAAA CGAACCAGCA GCTCACCCAT ACGCTGAGCT ACAACAAGGA TATCAGCACC GACCTGAATA TCAATGCAGT AGCAGGTTAC GAATTCCTGG ATTATGCCAA TAGCTGGAAC GCCCTGAGCG GTAGCGGATT TACGGATGTC GGCCTGGATT TTTACGATTA CCTCAACTAT TCGGTGGCCA GCAACCGGGG TGTAAACTCT TTCAAATCGC CTACCAACCA GTTGCAATCG TTGTTCCTAC GGGTTGGTTT GAATTATATG GATAAATACC TCTTCACCGG AACGGTCCGG AGAGATGGTT CGACCAAATT CGGGGAGAAT AACAAATACG CAAACTTCCC TGCGGTGGCG CTTGCCTGGA ATGTGTCGAA CGAAGCATTC CTGAAAGGCA ACAGCACGCT CAACAACCTG AAAGTGCGTT TGGGCTGGGG TAAAACCGGT AACCGTGAGT TCGACTCAGG TGCATCGAGA AACCGCTATA TCTTCGACGT GCAGAGTGTG AGCCAGGCGA ACTTCGGTAA CCCCGACCTG AGATGGGAAA GCTCAGAGAC ATTCAACGGA GGTATCGATT TCGGCTTGTT CCAATCGCGG ATCGTCGGTT CGATCGACGT TTTCCGCAAG AAAACTACCG ACGGTTTGTT CGAGAGAACA CTGGCCCAGC CTGCGCCAAG CGGTAAAATC TGGGTAAACC TGGACGGCGA GGTGATCAAC AAAGGTGTGG AAATTGCCCT GACTGGTACG ATCCTGGAAA ACAAGGACTT TACCTGGAAC CTCTCGGCCA ATGCGACTTA CCTGAAAAAC AGCGTTTCCG GCCTGCCCGG CTTCTACGAA ACTGCCCAGC TGAGAGGACA AGGCTTCTCG GGTGTGCTTG GACAAAGAAT GGTGAGCGGA CAGCCCCTCA ACGTGTGGTA CCTGGCCAAA TACGCCGGTC TTGATCCGGC AACCGGAACG AGCATGTACG AAGGAGCGGA CGGTAACGCC AGCACCACCG TGGACCCTGC TCAGAACAAG TTTTACGTAA ACAGCCCTAA CCCTAAATAC CTGCTGGGCG TGTCTACGGA CGTGACTTAC AAGAAGTTCT CGGCAGTGGT GAACATGAAT GGCGCATTCG GACATTATTT GTTCAACAAC ACTTTCGCGA CTGTACTGGG TATCAACAAC CTGTCGACGA AGAACATCGC CAAGGAATTT TTCAACACGG ATATCAAAGA AGCAACGTCC AACTCCGCAG CGCCATCGAC GCGCTACCTG GAAAAGGGCG ATTTCATGAA AATGTCTAAC GTTACGCTAA GCTACCGCGT CGGAAACATC GGAAAAGCCT TCAAAAACCT GAACATCTCA CTGACGGGCC AGAACCTGTT TGTGCTCACA GGCTACAAAG GTTTCGACCC CGAGGTGAAC ACCGACGGTT CAAGCGGCGG AATTCCTTCC CTTGGTATCG AGTACATTCC TTATCCATCG GCAAGAACAT TCCTTTTGGG CGTAAACTTT TCGTTGTAA
|
Protein sequence | MMKSILRFDR PYEKNVQGPA SLRMCGLGIL LLLCTLFFNN AHAQDVALKG KVTDSKGNAL PGVSITIKGT TKGTSTDMEG GYAISAPPAS TLSFSFIGFS SKDVVVGNET TIDVVLSDDV KALEEVVVIG YGAVKKKDVT GSVVSLQPKD FNKGVITAPD QLIQGRTPGV MVINNTGQPG GATTVRIRGN SSIRAGNNPL FVLDGIPLSG SSARPGGSGG FGSDGGNPLA YLNPNDIASM DILKDASATA IYGSRGANGV VIITTKRGAS GAPTVSFNAS TGVSSLLRKP EVLNADEFRQ AVQFYAPNDL KGADFGGNVN AFDEITRVAP VQNYSMAITG GTDNGKYRVS AGYMDQKGII ETSRLKKYTA SLNTSFKFLE SKKLGLDFNV LFTQTDENLA PIDVGVGFEG NVISQALTWN PTRPLRVNSD SLTYVSPSTI NPLASLQAYK DIAVVNTTVA SIAPSYKITN DLEYKLVYSI TRQTGTRKGR YIRGLIDPAQ IRNGFAFIGN NSETNQQLTH TLSYNKDIST DLNINAVAGY EFLDYANSWN ALSGSGFTDV GLDFYDYLNY SVASNRGVNS FKSPTNQLQS LFLRVGLNYM DKYLFTGTVR RDGSTKFGEN NKYANFPAVA LAWNVSNEAF LKGNSTLNNL KVRLGWGKTG NREFDSGASR NRYIFDVQSV SQANFGNPDL RWESSETFNG GIDFGLFQSR IVGSIDVFRK KTTDGLFERT LAQPAPSGKI WVNLDGEVIN KGVEIALTGT ILENKDFTWN LSANATYLKN SVSGLPGFYE TAQLRGQGFS GVLGQRMVSG QPLNVWYLAK YAGLDPATGT SMYEGADGNA STTVDPAQNK FYVNSPNPKY LLGVSTDVTY KKFSAVVNMN GAFGHYLFNN TFATVLGINN LSTKNIAKEF FNTDIKEATS NSAAPSTRYL EKGDFMKMSN VTLSYRVGNI GKAFKNLNIS LTGQNLFVLT GYKGFDPEVN TDGSSGGIPS LGIEYIPYPS ARTFLLGVNF SL
|
| |