Gene Daro_3525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3525 
Symbol 
ID3567390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3774763 
End bp3776478 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content50% 
IMG OID637681997 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_286724 
Protein GI71909137 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000124097 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00602561 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGACAG AAGTAGCTTG CCAACACTTT GAGGTTGGTT TTGAATGCCG AGTTACTCAG 
AAAATGGCAG AAAAAAATAC GATGAAATCC ACCAAGGCAA AACCACGCGC GATTGATCTT
TACTCAGGGA TCGGCGGGTG GTCACTCGGC CTTGAAATGG CGGGCATTGA GGTCGTTGCG
TCCTATGAAT GGTGGGACAA GGCCAATCGA ACCAATCACA AAAATAATCA ACATCTCGCT
ACTGAGATCG ATATCCGACA GTTGCGACTC GAAGATTTGC CCAAGAACAT CGATATTGTG
GTTGGAAGCC CTCCTTGCAC ACAGTTTTCC TTCGCAAATC GTGGAGGTAG TGGCGATATT
GAGGATGGTC TGAAAGATAT AGCCAAGTTT TTGGCGGTGG TCGATTACGT CCGCCCCAAG
CATTGGGCAA TGGAAAACGT CCCTCGTGTT GCGAGCATCA TCGAGAAGGA AATGCAGATT
GGCGGTCGTC TAGCCCATTT TTCCCACCTG AAGCCAGTAA TCAAGATTGT TGATACTTGC
GAGTGGGGAG TGCCACAGCG TCGTCAGCGC TGCATCGTTG GCAACTTCGA TTTCGGCCTA
CTCCAGTCGT ACAAAGAGCA TACAAAGCCG CGAACACTTG GTGACGTTGT TTCCGTACTC
TCTCAAGCAA CCATAACCGA TCCGATTTAC GGTATCGAAG TGGCTAGAGA AGAGCTTACC
GATCACGTGA TTGAGGAGTT CCTCTCAGCA GAAGAGGAGC GGATGAATCG GGAAATGAAG
ACCTACCACC CGGTTTATAA CAACATGGCC TTCCCCGATC CCCTAAACCG AACCGCAAGG
ACGATTACGG CGACCTGTAC GCGGGTATCC AGAGAGAGCG TTGTGATTGC AGCGCCTGAA
CAGAAGGGAC GCTTCAGGCG TTTGACCGTG CGTGAGCGAG GGTGTCTTCA GGGATTCCCT
ATTACTTACC AGTTTTTTGG TGATAGCTAT GCTCAAAAAC TGAAGATGAT TGGAAATGCA
GTTCCTCCGC TCTTTACGTT TTACGTTGCA CAAGCGATGT TGGGGATTTC GCCAGAGACT
CTCATTCAAC CCAATGAGGG TATCGGTCGG TTTGTTGGGA CAGAGGAGCG CCCCAAAGTA
ACTCGCCCTG ATACTGCCGG TGACTCCTAC TCTTCCACCA GACGGTTCAG AGCAGCACTT
CCTCATCTAA GGTTCAAGAG TGGGGTGCGT TTTGAGCTGG CAAACTCGTT TGTCGGTAGC
GTACCTGAAT GGAGGGTCAA GTTCTTCTTC GGTAATTCGA AGAACATTAC AGAACTCCCT
CTTTCAGATG GCCTTCTGAA GAAGTTGAAA GCGTCCAAAG AGGTTGAGGG TGTCTTGCCT
CAAGTTATGA CATCAATCGC CCACGTAGAT CAGGTGATTT CTTCAACTGA TGCCCAGACC
CTTCAACGTG TCTGGACTCA TACGCTAGAG CATCGCTTCC ATCCGTACGA TGTTGTTGAT
GTTCTGGGTC AGGCAACAGA GGAAACGATC ACTCTCCTTT GCAAGCACAA CCGGGGGTCT
GAAGGCGTAA TCGCGAAAAT CCTCGAAGAT ATGGGTTACC CACAAGGTTC CGACAAGGTA
CTCAAATACT CGGATGCGGT ACTTGCAGGG TTGCTGGTTG GAGCGTTTGC AAATCTGCGT
TTCTGCGACC CCTCGTTCTC AAACTCTAAG GGCTAA
 
Protein sequence
MTTEVACQHF EVGFECRVTQ KMAEKNTMKS TKAKPRAIDL YSGIGGWSLG LEMAGIEVVA 
SYEWWDKANR TNHKNNQHLA TEIDIRQLRL EDLPKNIDIV VGSPPCTQFS FANRGGSGDI
EDGLKDIAKF LAVVDYVRPK HWAMENVPRV ASIIEKEMQI GGRLAHFSHL KPVIKIVDTC
EWGVPQRRQR CIVGNFDFGL LQSYKEHTKP RTLGDVVSVL SQATITDPIY GIEVAREELT
DHVIEEFLSA EEERMNREMK TYHPVYNNMA FPDPLNRTAR TITATCTRVS RESVVIAAPE
QKGRFRRLTV RERGCLQGFP ITYQFFGDSY AQKLKMIGNA VPPLFTFYVA QAMLGISPET
LIQPNEGIGR FVGTEERPKV TRPDTAGDSY SSTRRFRAAL PHLRFKSGVR FELANSFVGS
VPEWRVKFFF GNSKNITELP LSDGLLKKLK ASKEVEGVLP QVMTSIAHVD QVISSTDAQT
LQRVWTHTLE HRFHPYDVVD VLGQATEETI TLLCKHNRGS EGVIAKILED MGYPQGSDKV
LKYSDAVLAG LLVGAFANLR FCDPSFSNSK G