Gene Dfer_5226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_5226 
Symbol 
ID8228837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp6300484 
End bp6303540 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content43% 
IMG OID644933076 
Producttype III restriction protein res subunit 
Protein accessionYP_003089589 
Protein GI255038968 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.310365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAC AATTTAAAGA GCAAAGCTTT CAGTTAGATG CTGTTCAGGC CATCACCGAC 
TGCTTTCTGG GGCAACCCAG AGAAACGAAC CGTTTTACAC TGGAAAAAAG CAAGGAACTG
ATTGCCAAAG CAAGGTCGGC AAGCAAAGGT CAGTTTGGTA TGGATCTGGA GGTAGAGGAG
CTGATCGGGT ACCGAAACCG GCAGATCCAG ATTACCGAAG ATCAGTTACT AGAGAACATT
CAGCAGGTTC AGCGGCAAAA TGATATCACT CAAAGTAAAT CCCTTGAAAA ACCCAAAGGT
GTAAAAAAGG GCTATCAGTT CACCGTTGAA ATGGAGACTG GAACGGGTAA AACGTACACC
TATATCCGCA CCATGTATGA ACTGCACCAG CTTTATGGCT GGAGTAAGTT TATTGTGGTT
GTTCCCAGTA TTGCAATCCG TGAAGGGGTC TATAAGTCGT TCCAGGTGAT GCAAGACCAC
TTCCAGGAGC GATACGGACA CCGCATTAGC CCATTTATTT ATAACTCATC TCGTCCACAG
GACATTGAGA GTTTTGCGTC GGATAGCCGC ATTAGTGTGA TGATCATCAA TACGCAGGCA
TTTAATGCTA AAGGTAAAGA TGCACGTCGT ATTTATATGG AACTTGATCA GTTCGGCACT
CGAAAGCCCA TCGAGATTAT CGCTCAGACT AATCCCATTC TGATTATTGA TGAACCACAA
TCGGTAGAAG GCGATAAAAC CTTGGAGAGT ATGCAGGATT TCAACCCGCT TTTTACACTC
CGTTATTCCG CTACCCATAA ATTTGAGTAT AACAAGGTAT ATCGTCTCGA TGCACTGGAT
GCCTATAACA AGAAGCTCGT AAAGAAAATC CAGGTAAAAG GTATCAACAT CAAAGGTACT
ACGGGTACCA GTGGATATCT ATATCTGGAA CAAATCCAGC TTTCTACCTC ACGTCCCCCA
CTTGCTGTCT TGGAGTACGA AAAGCGTAAT GGGACGGGCG TCAGGCGGGT ACGCGAGAAA
CTTGAAAAAG GAGCCAACCT GTTTGAACTT TCCGGCGAAA TGCCCCAGTA TAAAAACTGG
CTGTTAGAGG AGGTTGATGG CTATTTTAAC CGGGTGGTGA TCAATGGGAA AATAATTGAA
GCCGGAGAGG CTATTGGCGA TCTGGACGAA AAGGCATTCC GGCGAATACA AATCCGCGAA
ACGATCAGTT CTCATTTAAA AAAAGAACGA GAGCTTTTTG ACAAGGGAAT CAAGGTACTA
TCGCTCTTCT TCATAGATAC AGTAGATAAA TACCGTATCT ATGATAAGGA AGGTAATCCT
GGTTTAGGTG AATATGCCCA GATGTTTGAG GAGGAATACG TTCAACTAAG GAACGAATAT
CTAGATCTGT TCTATCCCGA TTACAATCAA TATCTGCAAC GTGATCCGGC AGAGAGGGTG
CACAACGGGT ATTTTTCCAT TGATAAGCAA CGGAAAATGA TTGACCCCTT AGTAAAGCGG
GGAAGTGAAG AGACGGATGA TATCAGCGCT TATGACTTGA TTATGAAGGA TAAGGAGCGA
TTGCTAAGTT TTGACGAACC CACCCGCTTC ATTTTTTCGC ATTCTGCCTT GAAAGAAGGA
TGGGACAATC CGAACGTCTT TCAAATTTGT ACCCTCAAGC ATTCGGATGC GTCCATTCGC
CGCCGTCAGG AAGTAGGGCG CGGTATGCGT CTTTCGGTCA ATAAACATGG CATACGGCAG
GACGAAGAGG CCATTGGCGA GCAGGTACAT GAAATCAATA AACTGACGAT TATCGCCTCT
GAAAGCTATG AAGAATTTGC CCGAGGCCTG CAATCAGAAA TTGCCGCGAC CTTAAAAGAT
CGACCTCAAA AAGCGACCGT TGAATTTTTG ACTGGTAAAC TTTTGACTGA CGAGCATGGA
AATCAAAAAC GCCTGACCTT CGAAGAAGCG AAAAAGTTGA ACAAGTATCT TTATAAAGAG
GATGTTTTGG ACGATGACGA TAAAATTACG GATGATGGAC GGAGGTTGGT TGAAGAAAAC
AATATCCCCC TTCCTGACCA ATTGGAAGCA TTTCGGGATA GCATAAATCA ATTGCTACGA
TCCGTTTATA TGGGCGAAGC CATCAAACCT GAGAATGACC GGCAAAGCAT TACAATTCAG
ACGAACAGTA ACTTTCATAA GAAGGAATTT CAGAAGCTTT GGAATAAGAT CAACCTGAAA
ACCATTTATG AAGTACAGTT TAACTCGGAA AAGCTGGTTT CTGATGCCAA AATTCGGATA
AACGCAGATT TGAATATTTC GGAGCGTACG TATGAAATCC GCAGCGGAGA GCTGGAGGAG
AGTACGAAAG AGCAGTTGCA GGAAAAGAAT GCCTTTCAGG AGACTTCCCG CCAACACAAG
AAGCTCAATT CCGATGTCTA TACAAATACA AGATATGATG TGGTTGGAGA GATTGTCAAA
CATACCAACC TTACCCGAAA AACCATCGTC GAGATATTAA AGAGCATTGA CACGAGCAAG
TTCCTGATGA TCCGGAAAAA TCCAGAGGAG TTTATAGCCC GAACCAGCAA GCTAATCAAT
GAAGTTAAGG CCAGTCTGAT CATCAATAAC ATCGTCTATC ACAAAGTTGA TGACAGGCAT
GATGCTAAAA CCGTATTTGT GAATGACAAA TCCGTTATTC GGCAGTCAGA AATATTGAAA
AAGCATGTTT ATGACTTTCT GACTACTGAT TCACAAACCG AAGCACGTTT TGCCGAAGCA
TTGGAAAACA GCAACCATGT TCAGGTATAC GCTAAGCTTC CGAAAAGTTT TTACATTACC
ACGCCTGTCG CTAATTATAG TCCTGACTGG GCGATTGTGT TTGATAAGGA CACTATCCGC
CACATCTATT TTGTAGCAGA AACCAAGGGT ACGGATTCAG ACCTAGAGCT ACGCGAGATA
GAAAAGCTGA AAATCCATTG TGCCGGGGAA CATTTCAAAG CAATCAGTGG ACAAGAGATG
AAGTTTTCAA AAGTCAGCAA CTACCAGCAA ATGCTGGAAA TAGTGCAAGT GAAATAA
 
Protein sequence
MKLQFKEQSF QLDAVQAITD CFLGQPRETN RFTLEKSKEL IAKARSASKG QFGMDLEVEE 
LIGYRNRQIQ ITEDQLLENI QQVQRQNDIT QSKSLEKPKG VKKGYQFTVE METGTGKTYT
YIRTMYELHQ LYGWSKFIVV VPSIAIREGV YKSFQVMQDH FQERYGHRIS PFIYNSSRPQ
DIESFASDSR ISVMIINTQA FNAKGKDARR IYMELDQFGT RKPIEIIAQT NPILIIDEPQ
SVEGDKTLES MQDFNPLFTL RYSATHKFEY NKVYRLDALD AYNKKLVKKI QVKGINIKGT
TGTSGYLYLE QIQLSTSRPP LAVLEYEKRN GTGVRRVREK LEKGANLFEL SGEMPQYKNW
LLEEVDGYFN RVVINGKIIE AGEAIGDLDE KAFRRIQIRE TISSHLKKER ELFDKGIKVL
SLFFIDTVDK YRIYDKEGNP GLGEYAQMFE EEYVQLRNEY LDLFYPDYNQ YLQRDPAERV
HNGYFSIDKQ RKMIDPLVKR GSEETDDISA YDLIMKDKER LLSFDEPTRF IFSHSALKEG
WDNPNVFQIC TLKHSDASIR RRQEVGRGMR LSVNKHGIRQ DEEAIGEQVH EINKLTIIAS
ESYEEFARGL QSEIAATLKD RPQKATVEFL TGKLLTDEHG NQKRLTFEEA KKLNKYLYKE
DVLDDDDKIT DDGRRLVEEN NIPLPDQLEA FRDSINQLLR SVYMGEAIKP ENDRQSITIQ
TNSNFHKKEF QKLWNKINLK TIYEVQFNSE KLVSDAKIRI NADLNISERT YEIRSGELEE
STKEQLQEKN AFQETSRQHK KLNSDVYTNT RYDVVGEIVK HTNLTRKTIV EILKSIDTSK
FLMIRKNPEE FIARTSKLIN EVKASLIINN IVYHKVDDRH DAKTVFVNDK SVIRQSEILK
KHVYDFLTTD SQTEARFAEA LENSNHVQVY AKLPKSFYIT TPVANYSPDW AIVFDKDTIR
HIYFVAETKG TDSDLELREI EKLKIHCAGE HFKAISGQEM KFSKVSNYQQ MLEIVQVK