Gene Dfer_5242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_5242 
Symbol 
ID8228853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp6318483 
End bp6321407 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content46% 
IMG OID644933089 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_003089602 
Protein GI255038981 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGC AACCAGAACA GATCCTTGAA GAACAATTAG TTGATCAACT TCAAAAGTTG 
GGTTATGCTT TTGTGGCCAT CAAAGACGAA AAGCAGCTGA TTTCTAACCT CAAATCCCAG
CTTGAAAAAC ATAATAGTAT TGTTTTCTCA GATAAGGAAT TCGATAAAGT CCTCAACATC
CTCAATAAGG GTTCCGTATT TGAAAAAGCT AAAACACTAC GGGAGAAGCA GCATATCGTG
CGGGATAATG GGGAGAGCCT GTATTTTGAA TTTCTGAACC AGGAACATTG GTGCCAGAAC
CAGTTTCAGG TGACGCAGCA GGTGAGCATG GAGGGCGTTT ATAAGAACCG TTATGATGTG
ACGCTGCTCA TCAATGGTTT GCCACTGGTG CAGATCGAGC TCAAACGGCG CGGGCTGGAA
CTCAAAGAAG CATTCAACCA GATCAACCGA TATCAGCGGC ATTCTTTCGG CGTCAATTCG
GCGCTGTTTC AGTATGTGCA GATTTTCGTG ATCAGCAATG GTGTGAATAC CAAGTATTAT
GCCAATAACA AGAACCAGAC TTTTAAGCAG ACTTTTTACT GGAGTGATGT CGCTAATAAA
CGGCTGACGA ATATTCTCAA TGGCTTCACG AGCCAGTTTC TGGAACCGTG CCATATCTCC
AAAATGATAT GCAAATACAT TGTCGTCAGC GAAACGAATA AGGTTTTGAT GGTGCTTCGT
CCGTATCAGT TTTATGCAGT GGAAGCATTG GTGGAGCGGG TGAAGAATTC AGGAATCAAA
GGCCCGCATA AAAACGGGTA TATCTGGCAT ACCACCGGCT CGGGGAAGAC ATTGACGTCC
TTTAAAGCCA GCCAGATCCT GATGCATTTG CCGCAAGTGA AGAAAGTCGT TTTCGTGGTC
GACCGGAAAG ATCTGGATTA TCAGACTAAT AAGGAGTTTA ACATTTTCAG CAAAGGCAGC
GTCGACGGGA CGGACGATAC CCGGGCGCTG GTGAAGCAGT TTGCCGACGA TACCAAGCTC
ATCGTTACCA CGATCCAGAA ACTCAATTCG GCGATCGGCA AAAAGCAATA TCTCGATCAG
ATGGCGCGGA TGAAAGATGA GCGCATTATC TTCATTTTCG ACGAATGCCA CCGTTCGCAA
TTTGGTGAAA CGCACACGCG GATCAAATCA TTTTTCAATA ACATACAGCT TTTTGGCTTT
ACTGGGACGC CCATTTTTGC AGATAATGCA GTCAAGAATG AATTGGGAAA ACGGACCACC
ACGGAGCTGT TTGGCGAGTG CCTGCACAAG TATGTGATTA CGGACGCGAT CCGCGACGAA
AATGTTTTAA AATTCTCCGT CGAATATGTG GGCAGGTACA AGCAAAAAGA CGAAAGCCGT
ACAAATATAG ACATTGAGGT AGAAGACATT GACACCAAAG AACTGATGGA TTCGCCGGTT
CGTCTGGAAA AGATCGCGGA TTACATCCTC GCCAACCATG CACGGAAAAC ACACAGCCGC
GAGTTCACGG GCATGTTTTG CGTGAGCAGT GTGGAAACAT TAATCCGCTA TTACGAAATT
CTGCAGGCTA AAAAACTCGC CGGGCAGCAT AACCTCAAAA TTGCCACGAT TTTCAGCTAT
ACAGCCAATG AAGATGATCA GGATGCAAAC GGATCGATGA GCTATGAATC CGATCCGCCT
CCACTGCGAA TGGTGGCCGA TCCTCCGGCC CCTTACCTGA AAAAAGGCGA AGAGGCTGCG
CCGACTGCAC CTGTGATCAT CAATAAGCAC AGCCGGGAAA AGCTGGATGA ATACATCGGG
CATTACAACA AAATGTTTGG AACGGCTTAC TCCACCAAAG ACAGCGTTTC ATTTTACAAT
TATTACAACG ACGTTTCCAA AAAGGTTAAG GAAAAGCAGA TCGACATTCT GCTGGTTGTA
AACATGTTTC TGACAGGCTT CGACAGCAAG AAGCTGAACA CGCTTTACGT CGATAAAAAC
CTGAAATACC ACGGACTGAT CCAGGCTTAT TCGCGGACCA ACCGGATCGA TAATGATGTG
AAATCACAGG GGAACATCAT TGTCTTTCGC AACCTGAAAA AGGCCACCGA CGAAGCCATT
GCATTGTTCA GTAATAAGGA TGCGAAGGAC ATCATCATCA TGGAGCCGTA TGAGGATTAT
GCCGAGAAGT TTGGCGCTGC ATTCGATAAG TTACTCACGG TTGCTCCGAC TGTCGAAAGC
GTAAATCAGC TGACAACCGA ATACGACGAG CTGGAATTCA TCAAGGCATT CCGTGAGCTG
ATGCGGATCA AGAACGTGCT GACCACATTT GCCGATTTCA AATGGGAAGA CCTGGCGATG
AGCGAGCAGC TCTTTGAGGA TTACAAAAGC AAATACCTCG ACCTGCATGA CAAGGTGAAG
TTCGATACGC AGAAAGAAAA GGTATCGATC CTGGAAGATG TGGATTTTGA ACTGGAACTC
ATCCACCGCG ACGAGATAAA TGTCGCCTAC ATTCTGAAAT TGCTGGCAAA CTTGAACAAT
GGCAGCAAGG CAGACCAGGA AAAGAAACGC AAAGAAATCA TCGATCTCCT CACCGGGGAA
GCCAACCTGC GCAGTAAACG CGAACTGATC GAAAAATTCA TTGCCGAAAA CCTGCCCCGG
ATAGAAGACA ATGAAACACT GCCCGATGAA TTTGAGAAAT TCTGGAATGA AGAGCAGGTG
AAGGCATTCA AACAATTGGT GGAGGAAGAA AAACTTTCCG AACCCAAAAC CGAGAAACTG
ATTGAAAGCA TCCTCTTCTC AGAACGCGAG CCATTACGGG AGGAAGTACT TGGCTTATTG
GAAGGAAATA TGCCAACGGT TTTACAAAGG AAGAAAGTTG GGGAAAGAGT TTTGAGTAAG
ATCATGGGGT TTATTGAAAC TTTTGTTGAG GGGATTGTTG GGTGA
 
Protein sequence
MAKQPEQILE EQLVDQLQKL GYAFVAIKDE KQLISNLKSQ LEKHNSIVFS DKEFDKVLNI 
LNKGSVFEKA KTLREKQHIV RDNGESLYFE FLNQEHWCQN QFQVTQQVSM EGVYKNRYDV
TLLINGLPLV QIELKRRGLE LKEAFNQINR YQRHSFGVNS ALFQYVQIFV ISNGVNTKYY
ANNKNQTFKQ TFYWSDVANK RLTNILNGFT SQFLEPCHIS KMICKYIVVS ETNKVLMVLR
PYQFYAVEAL VERVKNSGIK GPHKNGYIWH TTGSGKTLTS FKASQILMHL PQVKKVVFVV
DRKDLDYQTN KEFNIFSKGS VDGTDDTRAL VKQFADDTKL IVTTIQKLNS AIGKKQYLDQ
MARMKDERII FIFDECHRSQ FGETHTRIKS FFNNIQLFGF TGTPIFADNA VKNELGKRTT
TELFGECLHK YVITDAIRDE NVLKFSVEYV GRYKQKDESR TNIDIEVEDI DTKELMDSPV
RLEKIADYIL ANHARKTHSR EFTGMFCVSS VETLIRYYEI LQAKKLAGQH NLKIATIFSY
TANEDDQDAN GSMSYESDPP PLRMVADPPA PYLKKGEEAA PTAPVIINKH SREKLDEYIG
HYNKMFGTAY STKDSVSFYN YYNDVSKKVK EKQIDILLVV NMFLTGFDSK KLNTLYVDKN
LKYHGLIQAY SRTNRIDNDV KSQGNIIVFR NLKKATDEAI ALFSNKDAKD IIIMEPYEDY
AEKFGAAFDK LLTVAPTVES VNQLTTEYDE LEFIKAFREL MRIKNVLTTF ADFKWEDLAM
SEQLFEDYKS KYLDLHDKVK FDTQKEKVSI LEDVDFELEL IHRDEINVAY ILKLLANLNN
GSKADQEKKR KEIIDLLTGE ANLRSKRELI EKFIAENLPR IEDNETLPDE FEKFWNEEQV
KAFKQLVEEE KLSEPKTEKL IESILFSERE PLREEVLGLL EGNMPTVLQR KKVGERVLSK
IMGFIETFVE GIVG