Gene Cagg_3300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3300 
Symbol 
ID7267774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3997446 
End bp3999059 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content68% 
IMG OID643568112 
ProductN-6 DNA methylase 
Protein accessionYP_002464585 
Protein GI219850152 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000500405 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGGAGA CCTTGGCCAA GCGAAAGAAC AACGCAAGCA GCCCGAACAC GACCGGCGCC 
ACCGTCGGCT ACGAGGCCGA ACTCTGGCAG ATGGCCAACG CGCTCCGCGG TAGCATGGAC
GCCGCCGAGT ACAAGCACGT TGTCCTCGGA CTCATCTTCC TCAAGTACAT CTCCGACGCC
TTCGAGGAGC AGCACGCGCG GCTCGAAGCC GAGCGCGCGC AGGGTGCGGA CCCCGAAGAC
CCCGACGAGT ACCGCGCCGT GAACGTCTTC TGGGTGCCAC CCGAGGCGCG CCGGGCGCAC
CGCAACGCCC GGGTCAAGCA GCCGACGATC GGTCGGCTCG TGGACGACGC GATGGCCGGC
ATCGAACGCG ACAACCCGGC GCTCACGGGC GTGGTGCCGA AGAACGACGA CCGACCGGTG
CTCGACAGGC AGCACCTGGG CCGGCTCATC GACCTCATCG GCACCATCCG AGTCGGCGAC
GAGGAGGCAC GAGCCAAGGA CGTGCTCGGC CGCGTTGACG AGGAGGGTCG CTCGCAGTTC
GCGAGCGCCG AGGGCACGCA ACGCGGCGCG TTGACCACGC CGCGTTGCGT GGTCAAGCTG
CCGGTCGAGA GGCTCGATCC CTACCGCGGC CGCGTCGATG ACCCCTGCTG TGGATCGGCC
GGCATGTTCG TGCAGTCGGT GGAATTCATC CGCGCGCATG CCAACGGAAA TGGCAATGGC
GGCAAGACCG GGGCCGACAT CTCGATCTAC GGCCAGGAGT CGAACTACAC CACCTGGCGG
CTGGCCAAGA TGAACCTCGC CATCCGCGGC ATTGACGGCC AGATCGCCCA CGGCGACACG
TTCCACAACG ACCGCTTCCC GGACCTCAAA GCCGACTTCA TCCTCGCCAA TCCGCCGTTC
AACGTGAAGG ACTGGGGCGG CGAGCGCCTG CGCGACGACA AGCGCTGGAA GTACGGCGTG
CCGCCCGTGG GTAACGCCAA CTTCGCCTGG GTGCAGCGCA TCATTCACCA CCTCGCACCC
ACCGGCTACG CCGGCTTCGT GCTCGCCAAC GGCTCGATGT CGTCGAACCG GTCCGGCGAG
GGCGAGATCC GCAAGCATAT CATCGAAGCC GACCTCGTGG ACTGCATGGT CGCGCTGCCG
GGCCGGCGCT GCTCCGCGAC CCAGATCCCC GCGTGCCTGT GGTTCTCGGC GCGCGACACA
TCGGGGAGGG GTGGGTTCGG ACCCCACCCC TCCCGGAACC GGCGCGGCCA CGTGCGCTTC
ATCGACGCGC GCACGATGGG CTGCATGGTG GACCGCACCC ACTGCGATCT GACCGACGAG
GACATCACGA AGATCGCCGA CACCTCCCAC GCCTGGCGCG GGGAGCAGGA GGCCGGCGAT
GACGCCGACG TGCCCGGCTT CGGCACGAGC GCCACGCGCG AAGCGATCCG CACGCACGGC
GATGTGCTCA CGCCCGGCCG CTCCGTCGGC GCCGAGGCGG TCGAGGACGA TGGCGAGCCG
TTCGACGAGA AGATGAAGCG GCTGGTGACG CAACTGCGCG AGCAGCAGGC CGAAGCTCGG
CGGTTGGATG AAGCCATCTG GAAGAACCTG AGGGAACTGG GCTATGACGC TTGA
 
Protein sequence
MRETLAKRKN NASSPNTTGA TVGYEAELWQ MANALRGSMD AAEYKHVVLG LIFLKYISDA 
FEEQHARLEA ERAQGADPED PDEYRAVNVF WVPPEARRAH RNARVKQPTI GRLVDDAMAG
IERDNPALTG VVPKNDDRPV LDRQHLGRLI DLIGTIRVGD EEARAKDVLG RVDEEGRSQF
ASAEGTQRGA LTTPRCVVKL PVERLDPYRG RVDDPCCGSA GMFVQSVEFI RAHANGNGNG
GKTGADISIY GQESNYTTWR LAKMNLAIRG IDGQIAHGDT FHNDRFPDLK ADFILANPPF
NVKDWGGERL RDDKRWKYGV PPVGNANFAW VQRIIHHLAP TGYAGFVLAN GSMSSNRSGE
GEIRKHIIEA DLVDCMVALP GRRCSATQIP ACLWFSARDT SGRGGFGPHP SRNRRGHVRF
IDARTMGCMV DRTHCDLTDE DITKIADTSH AWRGEQEAGD DADVPGFGTS ATREAIRTHG
DVLTPGRSVG AEAVEDDGEP FDEKMKRLVT QLREQQAEAR RLDEAIWKNL RELGYDA