Gene Strop_0568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0568 
Symbol 
ID5057008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp638553 
End bp640511 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content67% 
IMG OID640472838 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_001157427 
Protein GI145593130 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase
[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.905473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAA ATACCGTCGA ACGGCGTCGG GCAGGCTACG GCGTCAAACT CGTACGTGGC 
CCGTTCGTAC GCCTGGCGCC ACACCCCGAG GCATGCGAGA ACGACGACGA GTTCCTCCGC
TTCGCAAAAG CGTGCCGTGA GCGGGGCGAG CGGCTGGCAG CGGACCTGTT CAGCGGCGCG
GGTGGTCTGA GCCTGGGTCT CACCGAGGCC GGTTTCCGTG TGGTCCTGGC GGCCGACCGC
GACCCAGAGT CCGTCGAGAC ACATCGTCAT CACTACCCGG GCCTCACCCT CGACTACGAC
CTGGGAGAGT CCGCCAACAT CCGGCGAATC GCCGCTCTGG TCAAGGAGGC GGGGATCGAG
CTGCTGACCG GCGGCCCACC ATGCCAACCC TTCTCCCGAG CGGGCCGGTC ACTGATCCGT
CACCAGGTCC GTCACGGTCT CCGCCCGGCA CATGACGAAC GCCGGGACCT CTGGCACTCG
TTCCTGGAGG TCATTCAGCT GGCCACACCC GCCGCCGTGA TCATGGAAAA CGTCCCCGAC
ATGGCACTCG ATCGGGAGAT GTTCATCCTC CGCACCATGG TGCACGAACT GGAGTCGATC
GGTTACGCCG TCGAGGAACA GGTCGTCGAC ACCCTCCGTT ACGGCGTACC CCAGTTCCGC
CAGCGATTGA TCCTGGTTGC ACTGCGAGAC GGAGTCGCCT TCGACTGGCC GCCGGAGGTA
CCGGACCGGG TGTCGGTGTG GAACGCGATC GGCGACCTAC CCGAGGTCGA GGGCGGCTGG
AGGCCCGAGG GTGGTGCCGA CGGCTGGTCA GACTACACCG GCCCCGCTTC CGCCTTTCAG
AGGCAGATGC GTCAGGGTGT TGCACCGGTG GACACGGGCA AGGTCTTCGA CCACATCACC
CGGCCCGTAC GCGAGGACGA CCAGCGCGCC TTCGACATGA TGGACGCCGG CACCCGATAC
TCAGAACTGC CGGAGGACGT CAAGCGCTAC CGCGACGACA TCTTCGACGA CAAGTACAAG
CGCCTGAGCG AGGACACCTA CTCCCGCACT ATCACCGCGC ACATCGCCAA GGATGGCTAC
TGGTACATCC ATCCCCGACA GGACCGCACA CTGACGGTGC GTGAGGCCGC GCGGCTGCAG
ACCTTCCCCG ACTGGTTCCG CTTCGCCGGC CCACCGTCGG CCGCGTTCCG TCAGATCGGC
AACGCCGTAC CCCCGGCGCT GGGCACCCAG CTCGGACGCG CCGTGATGGC AGCGCTGGAC
GCCCTCAGGC CAACTCCGTA CCGCAGCCGC GACATAGCCC ACGCCCTCGC CACCTGGTTC
GACGACCTCA GCGAGCCGGC CCTCCCGTGG CTGCGCGCTC GGACACGGTG GCAGGTCATC
TCCGCGGAGA TGCTTCTCGA CCGGACAGCA CCCGAGCAGG TTCGAATCCT CTGGTCACTG
CTCGAACGAT GGGAGCAGCC ACAGGACACC GTCGACGCCG GCGACGAACT CGTCGAGATC
GGGCGGTGGA TCAACCGCGA GCACCGCGCG GAGCGGCTGC TCGAACTGGC GCGCACGCTC
ACCTCGCAAC CAGACCTGCT CGACGACTAC AAGATCCACA GCCTTCGCGG GGTCGACGCC
TCGGTGATCG ACCTTGCTGT CCTCGCGATC CCAACCCGCG ACGAAGACAA CGCGGAGGAA
CCCGTCCTCA TCACCAAGGG CACTCACCGC GTCGCCGCCC GCTTCACCGG CGAGCACGTC
GAACGCAGCC ACAGAATGAC AGCAGGGCGT CTGGCCGTTG CCCGAATGAT CGGCGATGAC
GCCGACGCGC GGCGAGCACA CCTCGGCCTG ATCGAGCTGG CCACCTCCGT CTGCCGCCCG
ACCGACCCTG CCTGCCCACG CTGCCCCCTC AACAGGGCAT GCAGCGAGGC TGCCAAACGG
GGCAGTCGGA TCGAACGGCA GCCGGTGGTC AGTCCCTGA
 
Protein sequence
MTENTVERRR AGYGVKLVRG PFVRLAPHPE ACENDDEFLR FAKACRERGE RLAADLFSGA 
GGLSLGLTEA GFRVVLAADR DPESVETHRH HYPGLTLDYD LGESANIRRI AALVKEAGIE
LLTGGPPCQP FSRAGRSLIR HQVRHGLRPA HDERRDLWHS FLEVIQLATP AAVIMENVPD
MALDREMFIL RTMVHELESI GYAVEEQVVD TLRYGVPQFR QRLILVALRD GVAFDWPPEV
PDRVSVWNAI GDLPEVEGGW RPEGGADGWS DYTGPASAFQ RQMRQGVAPV DTGKVFDHIT
RPVREDDQRA FDMMDAGTRY SELPEDVKRY RDDIFDDKYK RLSEDTYSRT ITAHIAKDGY
WYIHPRQDRT LTVREAARLQ TFPDWFRFAG PPSAAFRQIG NAVPPALGTQ LGRAVMAALD
ALRPTPYRSR DIAHALATWF DDLSEPALPW LRARTRWQVI SAEMLLDRTA PEQVRILWSL
LERWEQPQDT VDAGDELVEI GRWINREHRA ERLLELARTL TSQPDLLDDY KIHSLRGVDA
SVIDLAVLAI PTRDEDNAEE PVLITKGTHR VAARFTGEHV ERSHRMTAGR LAVARMIGDD
ADARRAHLGL IELATSVCRP TDPACPRCPL NRACSEAAKR GSRIERQPVV SP