Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_5226 |
Symbol | |
ID | 8228837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | - |
Start bp | 6300484 |
End bp | 6303540 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644933076 |
Product | type III restriction protein res subunit |
Protein accession | YP_003089589 |
Protein GI | 255038968 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.310365 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTAC AATTTAAAGA GCAAAGCTTT CAGTTAGATG CTGTTCAGGC CATCACCGAC TGCTTTCTGG GGCAACCCAG AGAAACGAAC CGTTTTACAC TGGAAAAAAG CAAGGAACTG ATTGCCAAAG CAAGGTCGGC AAGCAAAGGT CAGTTTGGTA TGGATCTGGA GGTAGAGGAG CTGATCGGGT ACCGAAACCG GCAGATCCAG ATTACCGAAG ATCAGTTACT AGAGAACATT CAGCAGGTTC AGCGGCAAAA TGATATCACT CAAAGTAAAT CCCTTGAAAA ACCCAAAGGT GTAAAAAAGG GCTATCAGTT CACCGTTGAA ATGGAGACTG GAACGGGTAA AACGTACACC TATATCCGCA CCATGTATGA ACTGCACCAG CTTTATGGCT GGAGTAAGTT TATTGTGGTT GTTCCCAGTA TTGCAATCCG TGAAGGGGTC TATAAGTCGT TCCAGGTGAT GCAAGACCAC TTCCAGGAGC GATACGGACA CCGCATTAGC CCATTTATTT ATAACTCATC TCGTCCACAG GACATTGAGA GTTTTGCGTC GGATAGCCGC ATTAGTGTGA TGATCATCAA TACGCAGGCA TTTAATGCTA AAGGTAAAGA TGCACGTCGT ATTTATATGG AACTTGATCA GTTCGGCACT CGAAAGCCCA TCGAGATTAT CGCTCAGACT AATCCCATTC TGATTATTGA TGAACCACAA TCGGTAGAAG GCGATAAAAC CTTGGAGAGT ATGCAGGATT TCAACCCGCT TTTTACACTC CGTTATTCCG CTACCCATAA ATTTGAGTAT AACAAGGTAT ATCGTCTCGA TGCACTGGAT GCCTATAACA AGAAGCTCGT AAAGAAAATC CAGGTAAAAG GTATCAACAT CAAAGGTACT ACGGGTACCA GTGGATATCT ATATCTGGAA CAAATCCAGC TTTCTACCTC ACGTCCCCCA CTTGCTGTCT TGGAGTACGA AAAGCGTAAT GGGACGGGCG TCAGGCGGGT ACGCGAGAAA CTTGAAAAAG GAGCCAACCT GTTTGAACTT TCCGGCGAAA TGCCCCAGTA TAAAAACTGG CTGTTAGAGG AGGTTGATGG CTATTTTAAC CGGGTGGTGA TCAATGGGAA AATAATTGAA GCCGGAGAGG CTATTGGCGA TCTGGACGAA AAGGCATTCC GGCGAATACA AATCCGCGAA ACGATCAGTT CTCATTTAAA AAAAGAACGA GAGCTTTTTG ACAAGGGAAT CAAGGTACTA TCGCTCTTCT TCATAGATAC AGTAGATAAA TACCGTATCT ATGATAAGGA AGGTAATCCT GGTTTAGGTG AATATGCCCA GATGTTTGAG GAGGAATACG TTCAACTAAG GAACGAATAT CTAGATCTGT TCTATCCCGA TTACAATCAA TATCTGCAAC GTGATCCGGC AGAGAGGGTG CACAACGGGT ATTTTTCCAT TGATAAGCAA CGGAAAATGA TTGACCCCTT AGTAAAGCGG GGAAGTGAAG AGACGGATGA TATCAGCGCT TATGACTTGA TTATGAAGGA TAAGGAGCGA TTGCTAAGTT TTGACGAACC CACCCGCTTC ATTTTTTCGC ATTCTGCCTT GAAAGAAGGA TGGGACAATC CGAACGTCTT TCAAATTTGT ACCCTCAAGC ATTCGGATGC GTCCATTCGC CGCCGTCAGG AAGTAGGGCG CGGTATGCGT CTTTCGGTCA ATAAACATGG CATACGGCAG GACGAAGAGG CCATTGGCGA GCAGGTACAT GAAATCAATA AACTGACGAT TATCGCCTCT GAAAGCTATG AAGAATTTGC CCGAGGCCTG CAATCAGAAA TTGCCGCGAC CTTAAAAGAT CGACCTCAAA AAGCGACCGT TGAATTTTTG ACTGGTAAAC TTTTGACTGA CGAGCATGGA AATCAAAAAC GCCTGACCTT CGAAGAAGCG AAAAAGTTGA ACAAGTATCT TTATAAAGAG GATGTTTTGG ACGATGACGA TAAAATTACG GATGATGGAC GGAGGTTGGT TGAAGAAAAC AATATCCCCC TTCCTGACCA ATTGGAAGCA TTTCGGGATA GCATAAATCA ATTGCTACGA TCCGTTTATA TGGGCGAAGC CATCAAACCT GAGAATGACC GGCAAAGCAT TACAATTCAG ACGAACAGTA ACTTTCATAA GAAGGAATTT CAGAAGCTTT GGAATAAGAT CAACCTGAAA ACCATTTATG AAGTACAGTT TAACTCGGAA AAGCTGGTTT CTGATGCCAA AATTCGGATA AACGCAGATT TGAATATTTC GGAGCGTACG TATGAAATCC GCAGCGGAGA GCTGGAGGAG AGTACGAAAG AGCAGTTGCA GGAAAAGAAT GCCTTTCAGG AGACTTCCCG CCAACACAAG AAGCTCAATT CCGATGTCTA TACAAATACA AGATATGATG TGGTTGGAGA GATTGTCAAA CATACCAACC TTACCCGAAA AACCATCGTC GAGATATTAA AGAGCATTGA CACGAGCAAG TTCCTGATGA TCCGGAAAAA TCCAGAGGAG TTTATAGCCC GAACCAGCAA GCTAATCAAT GAAGTTAAGG CCAGTCTGAT CATCAATAAC ATCGTCTATC ACAAAGTTGA TGACAGGCAT GATGCTAAAA CCGTATTTGT GAATGACAAA TCCGTTATTC GGCAGTCAGA AATATTGAAA AAGCATGTTT ATGACTTTCT GACTACTGAT TCACAAACCG AAGCACGTTT TGCCGAAGCA TTGGAAAACA GCAACCATGT TCAGGTATAC GCTAAGCTTC CGAAAAGTTT TTACATTACC ACGCCTGTCG CTAATTATAG TCCTGACTGG GCGATTGTGT TTGATAAGGA CACTATCCGC CACATCTATT TTGTAGCAGA AACCAAGGGT ACGGATTCAG ACCTAGAGCT ACGCGAGATA GAAAAGCTGA AAATCCATTG TGCCGGGGAA CATTTCAAAG CAATCAGTGG ACAAGAGATG AAGTTTTCAA AAGTCAGCAA CTACCAGCAA ATGCTGGAAA TAGTGCAAGT GAAATAA
|
Protein sequence | MKLQFKEQSF QLDAVQAITD CFLGQPRETN RFTLEKSKEL IAKARSASKG QFGMDLEVEE LIGYRNRQIQ ITEDQLLENI QQVQRQNDIT QSKSLEKPKG VKKGYQFTVE METGTGKTYT YIRTMYELHQ LYGWSKFIVV VPSIAIREGV YKSFQVMQDH FQERYGHRIS PFIYNSSRPQ DIESFASDSR ISVMIINTQA FNAKGKDARR IYMELDQFGT RKPIEIIAQT NPILIIDEPQ SVEGDKTLES MQDFNPLFTL RYSATHKFEY NKVYRLDALD AYNKKLVKKI QVKGINIKGT TGTSGYLYLE QIQLSTSRPP LAVLEYEKRN GTGVRRVREK LEKGANLFEL SGEMPQYKNW LLEEVDGYFN RVVINGKIIE AGEAIGDLDE KAFRRIQIRE TISSHLKKER ELFDKGIKVL SLFFIDTVDK YRIYDKEGNP GLGEYAQMFE EEYVQLRNEY LDLFYPDYNQ YLQRDPAERV HNGYFSIDKQ RKMIDPLVKR GSEETDDISA YDLIMKDKER LLSFDEPTRF IFSHSALKEG WDNPNVFQIC TLKHSDASIR RRQEVGRGMR LSVNKHGIRQ DEEAIGEQVH EINKLTIIAS ESYEEFARGL QSEIAATLKD RPQKATVEFL TGKLLTDEHG NQKRLTFEEA KKLNKYLYKE DVLDDDDKIT DDGRRLVEEN NIPLPDQLEA FRDSINQLLR SVYMGEAIKP ENDRQSITIQ TNSNFHKKEF QKLWNKINLK TIYEVQFNSE KLVSDAKIRI NADLNISERT YEIRSGELEE STKEQLQEKN AFQETSRQHK KLNSDVYTNT RYDVVGEIVK HTNLTRKTIV EILKSIDTSK FLMIRKNPEE FIARTSKLIN EVKASLIINN IVYHKVDDRH DAKTVFVNDK SVIRQSEILK KHVYDFLTTD SQTEARFAEA LENSNHVQVY AKLPKSFYIT TPVANYSPDW AIVFDKDTIR HIYFVAETKG TDSDLELREI EKLKIHCAGE HFKAISGQEM KFSKVSNYQQ MLEIVQVK
|
| |