Gene Daci_4204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_4204 
Symbol 
ID5749792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp4602520 
End bp4605525 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content59% 
IMG OID641299307 
Producttype III restriction protein res subunit 
Protein accessionYP_001565220 
Protein GI160899638 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.416822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0251436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGC ACTTCGAGCC CAACCTCGAC TACCAGATGC AGGCCATCGA GGCTGTATGC 
GATCTTTTCC GTGGTCAGGA GGTCTGCCGC ACCGAATTCA CGGTGACCAT GAAATTGCCC
GATGACGTGC AGATGTCACT GGGCGTGGCG CAGTCCGACC TTGGCGTTGG CAACCGCTTG
ACCCTGCTGG ACGATGAACT GCTCAAGAAC CTCGCGGACA TCCAGTTGCG CGGTGGCTTG
CCGCCTTCCA GTTCGCTGAC TTCGGGCGAC TTCACTGTGG AAATGGAGAC CGGCACCGGC
AAGACCTATG TGTATCTGCG CTCGATTTTC GAGCTGAACA AACGCTACGG CTTCACCAAG
TTCGTGATCG TGGTGCCTTC AGTGGCGATC AAGGAGGGTG TTTATAAAAC CCTGCAGATC
ACCGAGGAAC ACTTCAAGGG GCTCTACGCG GGCGTACCCT TCGATTACTT CCTATACGAC
TCCGGTAAGC CGGGGCCGGT GCGCAATTTC GCCACGAGCT CCAACATCCA GATCATGGTG
GTGACGGTGG GCGCCATCAA CAAGAAGGAT GTGAACAACC TCTACAAAGA GAGCGAGAAA
ACCGGCGGCG AGAAGCCCAT CGACCTGATC AAGGCCACCC GGCCGATAAT CATCGTGGAT
GAGCCGCAAA GCGTGGACGG CGGCATGGAA GGCCGTGGCA AGGAAGCACT GGACGCCATG
AACCCGCTCT GCACGCTGCG CTACTCCGCT ACCCATGTGG ACAAGCACCA CATGGTATTT
CGCCTCGATG CCGTCGATGC CTACGAGCGC AAACTGGTCA AGCAGATCGA GGTGGCGTCG
GCCACGGTAG AGGACGCGCA CAACAGGCCC TTTGTGCGCC TGGTGAAGGT GGAAAACAAG
CGCGGCCGCA TCAGCGCCAA GGTCGAGCTA GATAAACAGA CCGCCACTGG TGTGCAGCGG
GCTGAAGTGA CGGTCAGCGA CGGCGACGAC CTCCAGCAGA GCGCCGATGG CCGCGCGATC
TATGCCGATT TTCGCGTCGG CGAGATCAAC ACGGCCAAGG GCGAAGCGTT CATGGAGCTG
CGCTACCCCG GTGGCGAGGT GTTTTTGCAA CCTGGCCAAG CCCACGGTGA TGTGGATGCG
CTTGCCGTGC AACGCGAGAT GATCCGCCGC ACGATCAAGG AACACCTGGA CAAGGAGAAG
CACCTGCGCC CGCTGGGCAT CAAGGTGTTG AGCCTGTTCT TCATCGACGC GGTGGACAAA
TATCGTCAGT ACGATGCGGA CGGCCAGCCG GTCAAGGGTG TGTATGCGCA GATGTTCGAG
GAGGAATATC GCCGTGCCGC CAAGTTGCCG GCTTACCAGA GTTTGTTTGC CGAGATCGAC
CTGGAGTCCG CCGCCGAAGA AGTGCACAAC GGTTATTTCT CCATCGACAA GAAAGGCGGC
TGGACTGACA CCGCCGAGAA CAATGCGGGT AACCGGGAGA ATGCCGAACG CGCCTACAAC
CTGATCATGA AGGAGAAGGA GAAGCTGCTG TCCTTCGGTA CGCCGCTGAA GTTCATCTTC
TCCCACTCCG CCCTCAAGGA AGGCTGGGAC AACCCCAACG TGTTCCAGAT TTGCACCTTG
CGCGACATCC AGACCGAGCG CGAGCGCCGC CAGACCATTG GCCGTGGCCT GCGTCTGTGC
GTCAACCAGG ATGGCGAGCG GGTACGCGGC TTTGAGGTCA ACACCCTGAC CGTGGTGGCC
ACGGAAAACT ACGAACAGTT TGCCGAAAAC CTGCAGAAGG AAATCGAGAA AGACACAGGC
ATCCGCTTTG GCATCGTGGA GCAGCACCAA TTTGCCGCCA TTGCCGTGAC TGGCGCTGAT
GGGCACGCCG CACCGCTGGG CATCGAGCAA TCAAAGGCAC TGTGGGAGCA CCTGAAAGCC
GCCGGCCATA TAGATGCCAA AGGCAAGGTG CAGGATTCAC TGAAAACGGC GCTGAAGAAC
GGCACCTTGG AACTGCCGGA CGAGTTTGAT GCGCAAAAGG CCCAGATTGC TGAAGTGCTG
CGCAAGGTGT CGGGCCGGCT CGATATCAAA AATGCCGATG AACGCAGGCA AGTGCCGCTG
CGCAAGGGCA AGGATGGCAA GGCCGTTTAT CTGAGTGACG AGTTCAAGGC ACTGTGGGAC
CGCATCAAGC ACCAAACAAC GTACCGCGTG CAGTTCGATA ACGCCAAGTT GGTGACGGAT
TGCATCGCAG CGTTGCAGAA GGCCCCGGTG ATTGCCAAAG CACGACTGCA ATGGCGCAAG
GCCGACATCT CTATCGGCAA GGCGGGTGTC GCCGCGACGG AGAAAGCGGG CGCGGCGACC
GTGGTGCTGG ACGAGGCGGA TATTGAGCTG CCGGATTTGC TGACCGACCT TCAGGATCGC
ACCCAGCTCA CCCGGCGCAC CATCGTCAGC ATCCTGACGG GAAGCGGTCG CCTGAACGAC
TTCAAACGCA ATCCGCAGCA GTTCATCGAA TTGACTGCCG AAACCATCAA CCGCTGCAAG
CGCTTGGCCC TGGTCGATGG CATCAAGTAC CAGAAGCTGG GTGACCAGCA TGTCTATGCG
CAGGAGCTGT TCGAGAAGGA AGAGCTCACC GGCTATCTCA AGAACATGCT GCTGGATACC
CAGAAGTCGA TCTACGAGCA CGTGGTGTAC GACTCGACCA CTGAGCGGGA TTTCGCCGAT
GGGCTGGAGA AGAACGACGC CATCAAGCTC TACGCCAAGT TGCCAGGCTG GTTCAAAGTG
CCCACGCCGC TGGGCACCTA CAACCCCGAC TGGGCCGTGT TGGTGGAAGA AGACGGCACT
CAGCACCTGT ATTTTGTGGT GGAAACCAAG AGCAGCCTGT TCACCGACGA TATGCGCGAC
AAGGAAAGCG CCAAGATCGA ATGCGGCAAG GCGCATTTCA CTGCGCTGGA GGGCGGCGAG
AACCCAGCCC GGTATGTGGT TGCGCGCTCG GTTGGTGATC TTTTGACCGA GGCGGCAAAG
GGGTAG
 
Protein sequence
MKLHFEPNLD YQMQAIEAVC DLFRGQEVCR TEFTVTMKLP DDVQMSLGVA QSDLGVGNRL 
TLLDDELLKN LADIQLRGGL PPSSSLTSGD FTVEMETGTG KTYVYLRSIF ELNKRYGFTK
FVIVVPSVAI KEGVYKTLQI TEEHFKGLYA GVPFDYFLYD SGKPGPVRNF ATSSNIQIMV
VTVGAINKKD VNNLYKESEK TGGEKPIDLI KATRPIIIVD EPQSVDGGME GRGKEALDAM
NPLCTLRYSA THVDKHHMVF RLDAVDAYER KLVKQIEVAS ATVEDAHNRP FVRLVKVENK
RGRISAKVEL DKQTATGVQR AEVTVSDGDD LQQSADGRAI YADFRVGEIN TAKGEAFMEL
RYPGGEVFLQ PGQAHGDVDA LAVQREMIRR TIKEHLDKEK HLRPLGIKVL SLFFIDAVDK
YRQYDADGQP VKGVYAQMFE EEYRRAAKLP AYQSLFAEID LESAAEEVHN GYFSIDKKGG
WTDTAENNAG NRENAERAYN LIMKEKEKLL SFGTPLKFIF SHSALKEGWD NPNVFQICTL
RDIQTERERR QTIGRGLRLC VNQDGERVRG FEVNTLTVVA TENYEQFAEN LQKEIEKDTG
IRFGIVEQHQ FAAIAVTGAD GHAAPLGIEQ SKALWEHLKA AGHIDAKGKV QDSLKTALKN
GTLELPDEFD AQKAQIAEVL RKVSGRLDIK NADERRQVPL RKGKDGKAVY LSDEFKALWD
RIKHQTTYRV QFDNAKLVTD CIAALQKAPV IAKARLQWRK ADISIGKAGV AATEKAGAAT
VVLDEADIEL PDLLTDLQDR TQLTRRTIVS ILTGSGRLND FKRNPQQFIE LTAETINRCK
RLALVDGIKY QKLGDQHVYA QELFEKEELT GYLKNMLLDT QKSIYEHVVY DSTTERDFAD
GLEKNDAIKL YAKLPGWFKV PTPLGTYNPD WAVLVEEDGT QHLYFVVETK SSLFTDDMRD
KESAKIECGK AHFTALEGGE NPARYVVARS VGDLLTEAAK G