Gene Caci_3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3037 
Symbol 
ID8334388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3351919 
End bp3353370 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content67% 
IMG OID644956183 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionYP_003113787 
Protein GI256392223 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0129697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.245214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCA CTTTTACCGA CATTTTCTGC GGCGCCGGCG GAAGCTCAAC CGGCCTTGTC 
GCTGCGGGCT TCGAGCTGAA GCTGGCGGCA AACCACAGCA AGGTCGCGAT CTCCACGCAC
GCCGCCAACC ACGGCAACGC CGAGCACGTC TGCGCCGACG TCAACAACTA CGACATGCGG
CGCCTGCCCA CAACCGACGT GCTGTGGGCA TCGCCGATCT GCACCGAGAT CTCACCCGCC
GGCGGGCGGG GACGTTCCCG CAAGCTGCTG CCCGGAGAGG AGGCGCTGCT GGAGTACGGC
CCGGTGGAGA ACGCAGCCTG GGAACGGACC CGCGCCACCG CCTACGACGT CATCCGCGCC
GCCGAGGTCC ACCGCTACAA GGTCGTGATG TGCGAGAACG TCATGGAGTT CGCCACCGAC
TGGGAATTGT TCGACTGGTG GTTCAGCGGC ATGGAACGCC TCGGCTACCA GGGGCAGATC
GTGTCGGTGT CCGCAGCCCA CATCGGCGGC GACGGCAACG AAGCCGCGCC GCAGTGGCGG
GACCGGATCT ACATCGTGTT CACCCTCAAG GGCATTCCGC TGCCGGACCT GAAGCCGCGT
CCGCTCGCCT GGTGCCCCGA GTGCGGAACC GATGTCCGAG CCGTACAGGC ATGGCGCAAT
GGCCGCAAGA TCGGCAAGTA CAAGCAGCAG TACGACTACC GTTGTGAGAA CTCGTCATGC
CGCCACAGCA TCGTCGAGCC CTACATCAAC CCGGCCGCGT CCATCATCGA CTGGGACAAC
CTCGGCGAGC GCATCGGCGA CCGCACCAAG CCGCTGGCCG CGTCCACGAT GAAGCGGATC
GCCGCCGGGC TGGTGAAGTT CCCCGACCGG CGCAGCGTCA TCACCGTCAA CCACTCCGGG
CACGACGGGC GCGCGTTCCC CGCCGACGAG GGGCCGCTGC CGGTCCGCAG CACGAAGATC
GGCGAGGGGC TGTTGATCCC GTGCGGCGGC GGCTGGAACA CGACCGCCTC GCCGACGAAC
GTTCCGATGC GGACCCGGAC GGCCAACCCG AAGGGCTTCG AGGCGCTGGT CGCAACGTCC
ACGCCGTTCA TCGTCGAGTA CCGCAACCAC GCCGATGCCT CGGCCGTGAC TCAGCCGTTG
GCGACTGTCA CGTCCGGCGG GAACCACCAC GCGCTGGTGG TGCCGTGCCG CAATGCCTCG
ACGAAGACGA CGAGCGAGCC GTTCCACACG ATGTCCACGG TGGACTCGGC CGCGCTGGTT
GGGCCTGCGG TCGACATCAA CGACTGCTGG TTCCGGATGG TGCAGCCGCG CGAGCAGCTG
TACTCGCAGC GATTCCCGCG CGACTACATC GTCCACGGCA CCAAGGGTGA GCAGACGATG
CAGGCCGGAA ACGCCGTCGC CTGCAACGTT GCCCAGTGGG TCGGCGAGCG CGTTATGGCG
GTGCTGTCGT GA
 
Protein sequence
MTITFTDIFC GAGGSSTGLV AAGFELKLAA NHSKVAISTH AANHGNAEHV CADVNNYDMR 
RLPTTDVLWA SPICTEISPA GGRGRSRKLL PGEEALLEYG PVENAAWERT RATAYDVIRA
AEVHRYKVVM CENVMEFATD WELFDWWFSG MERLGYQGQI VSVSAAHIGG DGNEAAPQWR
DRIYIVFTLK GIPLPDLKPR PLAWCPECGT DVRAVQAWRN GRKIGKYKQQ YDYRCENSSC
RHSIVEPYIN PAASIIDWDN LGERIGDRTK PLAASTMKRI AAGLVKFPDR RSVITVNHSG
HDGRAFPADE GPLPVRSTKI GEGLLIPCGG GWNTTASPTN VPMRTRTANP KGFEALVATS
TPFIVEYRNH ADASAVTQPL ATVTSGGNHH ALVVPCRNAS TKTTSEPFHT MSTVDSAALV
GPAVDINDCW FRMVQPREQL YSQRFPRDYI VHGTKGEQTM QAGNAVACNV AQWVGERVMA
VLS