Gene Daro_1308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1308 
Symbol 
ID3569171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1419442 
End bp1422675 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content59% 
IMG OID637679771 
ProductType I site-specific deoxyribonuclease HsdR 
Protein accessionYP_284529 
Protein GI71906942 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.181512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG TCCTCACCGC TAAGGAAGTC ACCGCCGCGT ACGGAGAACT CGTCCTCGTC 
GAGTTACCCG CAATGGATCG CTTTGCTTCC CTCGGCTGGG CGGTCGCCAA CCTATACAAC
GAAACCTTCG GGCAGGATGG CACCGAGGGG CGTACCTCAG AAGCAGAGGT CATTCTCACC
CGGCGACTCC GCAAGGCGCT GGAGCGCATC AATCCGGGCT ACTCGGCATC GGCTTATGAT
CAGGCCATTG AACAACTGAC CGAGGACCGC TCGAAGCAGT TGCCGGTCAA TGCTAACCAA
GCGTTCTACA AGTTGCTGCG TGATCGGGTA AAAGTAGAGA TCACCGACAA CGAGGGCAAC
CCACAGACCG TTGAACTCAC GGTCATCGAC TGGCGAAATC CTGAGAACAA CGACTTCTTT
CTGACCCAGC AGATGTGGGT TTCGGGTGAG ATGTACAAAC GCCGCTGCGA CCTTGTAGGC
TTCGTGAATG GCATCCCTCT GGTATTCATC GAACTGAAGG CTCCCCACGT CCCCCTGAAG
TCTGCCTACG ACGACAACCT GAAGGACTAC AAGGGCCAGA GCATCCCGCA ACTGTTCCAC
CCCAACGCCT TCATCCTGCT GTCGAATGGA TCGCAGACCC GCATCGGCAC CCTGACCAGC
CCCTGGGAAC ACTTCTTCGA GTGGCGGCGC ATTGATGACG AGACAGAGGC TGGCTCCACC
TCACTGGATA CGGCCATCCG GGGGATGTGC GACAAGCGGC GACTGCTGGA CATAGTCGAG
AACTTCACCA TCTTTGAGGA GGCCCGTGGT GGCCTGATCA AAAAGGTTGC CAAGAACCAC
CAGTACCTCG GGGTCAATAA AGCCATCGCC CAGATGATCA AGCTGCGCGA ATCGGGGGAT
GTCGAGGCCG CCAAGCGTCT GGGCGTCTTC TGGCACACTC AGGGCAGCGG CAAGAGCCTG
TCGATGGTCT TCTTCACCCA GAAGATACTC CGCACCCTCC CCGGCAAATG GACCTTCCTG
ATCGTCACTG ATCGGGAGGA ACTGGACGAC CAGATCAGCA AGACGTTCAA GGCCACGGGG
GCGATCCCCA ATGCCAAGCT CGGGGGCACC AAGGACAACA TCCTGGTCAG GGCTGCCAGT
AGCAACCACC TGAAGCAGTT GCTGAAAACT GACGAGCGGT ACATCTTCAC GCTGATCCAG
AAGTTCAGGA CCGAGGCCGG GGCGCAGTAC GAGAAGCTCT CGGATCGTAG CGACATCATC
GTCATCACCG ACGAAGCCCA CCGCAGCCAG TACGACATCT TCGCCCTCAA CATGAGGAAC
GCCCTGCCCC ATGCGGCCTT TCTAGGGTTC ACCGGCACCC CGCTGATCGC TGGGGAGGAG
GAGCGCACCC GTGAGGTCTT CGGCGATTAC GTCTCAGTCT ATGACTTCGC CCGGTCTATC
GAGGACGGGG CCACCGTACC GCTCTACTAC GAGAACCGCA CCCCGGAACT CCAGATCACC
AACCAGAACC TCAACAAGGA CATCGAGAAG CTCCTCGAAG AAGCCGAATT GGATGAGGCT
CAGGAACAAA AACTGGAGCG TGAGTTTGCT AGGGAATACC ACCTGATCAC TCGGGATGAT
CGGCTGGAGA CCATTGCCGA GGACTTGGTG AAGCATTTCC TTGGGCGTGG CTATCGTGGC
AAAGCAATGA TGGTCTGTAT CGACAAGGCC ACCGCAGTGC GGATGTACGA CAAGGTGCAA
GCCTACTGGC AACAGCATCT GGCTGAACTC CATGAGAAAC TCACCAAGGC CACCGGGGAC
GACAAGGAGA TACTGGCCGA GAAGATCATG GTGATGGACA CCACCGACAT GGCCGTGGTC
GTCTCCCAGT CGCAGAATGA GATCGAAGAC CTCAAGGCCA AGGGCTTGGA CATCGTCCCC
CACCGGCAGC GGATGCTGAA GGAAGACCTC GACGAGCAGT TCAAAGACCC CGACGGCAAT
CTGCGACTGG TCTTCGTCTG CGCCATGTGG ATCACCGGCT TCGATGTGCC GACCTGTTCC
ACCCTTTACC TCGACAAGCC GATGCGGAAC CACACCCTGA TGCAGACCAT CGCACGGGCC
AACCGCGTAT CGCCGGGCAA GAGTGCTGGC CTGATCGTCG ATTACGTGGG CATCTTCCGT
AACCTTCAGG ACGCCCTCCG CATCTACGCC AAGCCCAACC AGCCGGGACA ACTGCCGATC
AACGACAAAG CTGCCCTGGT CGCTCAACTG GAAGCCCTGC TGGAACGTGC AGAATCGTTC
TGTACCGACC TTGGCATCAA TCTGAAGGGT GTGGTGAACA CGCCCCCGGA AAAACGCCTG
GAAGCCCTTC AGGAGGCTAT GAACACCATC CTCGGCGAGG GCGAGAACAC GACCAAGGCG
TACCTGCTGC TGGCGGGTCA GGTTGCCCGA ACCTTCAAGT CCATCCTGCC TGATGCGGCT
GCCAACGCCT ATGCACCGAT GTCTGTTCTG GTGGCGTACC TGGGGGCGAT GATCAAGGCC
CTGCGTCCGC CCCCGGACAT CTCCGACGTG ATGAACGATC TGGATGCCTT GCTGGATGAT
TCCATCGCCA CCGAGGGTTA CCGCATCGGC CACAAGCCAG AAGCCGAGGC CCTGATCGAC
CTCTCGCAGA TCGACTTCGC AGCCCTTCAG GAAAAGTTCG CCGAGGGCAA GAAGGCTACC
GAGACTGAGA AGCTCAAGGG GCAGATCGAA CAGAAGCTGG AGAAGATGGT CCGCGAGAAC
AAGGGGCGCA CTGATTTCCT GGAGAAGTTC CAGAAGCTGA TCGATGCCTA CAACTCGTCA
AGCCACAACC TCGAAGCCTT CTTCAAGGAA CTGCTAGCCT TTGCCCAGAA CCTCACCGAA
GAGGAGCAAC GGGCGCACCG TGAGAACCTC TCCGAGGAGG AACTGGCAAT CTTCGACCTG
CTGACCCAGC CAGAACCGGC ACTGAGCGAG AAGGAGAAGG ACGAGGTCAA GAAGGTCGCC
AAGGAACTTC TCGCCAAGCT GAAGGCCGAG AAGCTGGTCC TCGACTGGAA GCTGAAGACC
CAGACCAAGG CCGATGTCGA GCGCACCATC CGGGACTTCT TCATCAAGCT ACCGTCTGCC
TACACCCCGG CACTCAAGAA GGACAAGCGG GTGAAGACCT ATGCGCATGT CTATGAGAAT
TATTTCGGGG CAGGGCAGAG TGTCTACCAG CGTGGTGCCG CAGGTGCCCA CTGA
 
Protein sequence
MTTVLTAKEV TAAYGELVLV ELPAMDRFAS LGWAVANLYN ETFGQDGTEG RTSEAEVILT 
RRLRKALERI NPGYSASAYD QAIEQLTEDR SKQLPVNANQ AFYKLLRDRV KVEITDNEGN
PQTVELTVID WRNPENNDFF LTQQMWVSGE MYKRRCDLVG FVNGIPLVFI ELKAPHVPLK
SAYDDNLKDY KGQSIPQLFH PNAFILLSNG SQTRIGTLTS PWEHFFEWRR IDDETEAGST
SLDTAIRGMC DKRRLLDIVE NFTIFEEARG GLIKKVAKNH QYLGVNKAIA QMIKLRESGD
VEAAKRLGVF WHTQGSGKSL SMVFFTQKIL RTLPGKWTFL IVTDREELDD QISKTFKATG
AIPNAKLGGT KDNILVRAAS SNHLKQLLKT DERYIFTLIQ KFRTEAGAQY EKLSDRSDII
VITDEAHRSQ YDIFALNMRN ALPHAAFLGF TGTPLIAGEE ERTREVFGDY VSVYDFARSI
EDGATVPLYY ENRTPELQIT NQNLNKDIEK LLEEAELDEA QEQKLEREFA REYHLITRDD
RLETIAEDLV KHFLGRGYRG KAMMVCIDKA TAVRMYDKVQ AYWQQHLAEL HEKLTKATGD
DKEILAEKIM VMDTTDMAVV VSQSQNEIED LKAKGLDIVP HRQRMLKEDL DEQFKDPDGN
LRLVFVCAMW ITGFDVPTCS TLYLDKPMRN HTLMQTIARA NRVSPGKSAG LIVDYVGIFR
NLQDALRIYA KPNQPGQLPI NDKAALVAQL EALLERAESF CTDLGINLKG VVNTPPEKRL
EALQEAMNTI LGEGENTTKA YLLLAGQVAR TFKSILPDAA ANAYAPMSVL VAYLGAMIKA
LRPPPDISDV MNDLDALLDD SIATEGYRIG HKPEAEALID LSQIDFAALQ EKFAEGKKAT
ETEKLKGQIE QKLEKMVREN KGRTDFLEKF QKLIDAYNSS SHNLEAFFKE LLAFAQNLTE
EEQRAHRENL SEEELAIFDL LTQPEPALSE KEKDEVKKVA KELLAKLKAE KLVLDWKLKT
QTKADVERTI RDFFIKLPSA YTPALKKDKR VKTYAHVYEN YFGAGQSVYQ RGAAGAH