Gene Clim_0646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0646 
Symbol 
ID6354094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp721905 
End bp723011 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content45% 
IMG OID642668277 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_001942712 
Protein GI189346183 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID[TIGR01764] DNA binding domain, excisionase family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000621493 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGC ATTATACAAC AGAAGAGGCC GCTCACTATC TGGGCGTATC TTCAGCAAGA 
ATACGTCAAT ATATTCTTGA AGAGCGCCTC CAGACAGATA AATCCGGCAG AGACCACTTG
ATTGCCGAGT CTGTTCTTGC TGAATTTGCC AGGTTTGGCA GAAAAAAGGT AGGACGTCCC
TTCCATGAAT TGTGCAATAC GAATACGGTC ACAGTCGGGT CAGAACGAGC ATCCGCATCA
AACACTCTTA TCAACAGAGA ACTGCTTGAT GAAGAGGGAG TGCAGGTGAT CAACGGAGAT
ACCAGGGATA GTATCAAAAG CCTTCCTGAC AACACGTTCA GATGTGTTGT TACATCTCCA
CCCTATTGGG GTGTGCGAGA TTATGGCGTT GAGAATCAGA TTGGTGCAGA GCCTGACCTT
AAGGATTATG TAAATGCTCT TGTCGAAATA TTTTCCGAGG TGCGACGAGT GCTCAAATCT
GACGGAACAT TCTGGCTCAA TATCGGCAAT ACCTATACTT CAGGCGGAAG AAAATGGCGA
CAGGAAGACT CTAAAAATAA AGGTCGAGCA ATGTCGTACC GGCCGCCTAC GCCTGATGGT
CTGAAAAAAA AAGACCTTAT CGGCGTAGCA TGGATGGTGG CAATGGCTTG CCAGCTTGAC
GGATGGTATT TAAGAAATGA CATTATCTGG CACAAGCCGA ATTGCCAACC GGAAAGCGTA
AAAGACCGCT TAACGGTATC TCATGAGTAC CTCTTCATGT TCTCAAAATC TGAACAGTAC
TATTTTAATC AGGAGGCAAT CAAGGAGTCG TATACAAACG GAAACGGCTT CAAAAACAAG
CGGACCGTCT GGTCAATCAA TACCGAACCT TGTGCAGAAG CCCATTTTGC GGTTTTCCCT
AAAAATCTTG TACGTCCATG CATATTAGCC GGGTCAGAGG AAAACGACCT GATTCTTGAC
CCTTTCTATG GATCCGGGAC GGTTGGAATT GTATCGATGG AACTCAACAG AAAATGTGTC
GGTATTGAAA TAAATCAGGA TTATGTTGAC ATAGCAAGCA AACGCAACGC ACGGGTACAA
GGTGCACTTA TACTGCAGGA ATCGTAA
 
Protein sequence
MSKHYTTEEA AHYLGVSSAR IRQYILEERL QTDKSGRDHL IAESVLAEFA RFGRKKVGRP 
FHELCNTNTV TVGSERASAS NTLINRELLD EEGVQVINGD TRDSIKSLPD NTFRCVVTSP
PYWGVRDYGV ENQIGAEPDL KDYVNALVEI FSEVRRVLKS DGTFWLNIGN TYTSGGRKWR
QEDSKNKGRA MSYRPPTPDG LKKKDLIGVA WMVAMACQLD GWYLRNDIIW HKPNCQPESV
KDRLTVSHEY LFMFSKSEQY YFNQEAIKES YTNGNGFKNK RTVWSINTEP CAEAHFAVFP
KNLVRPCILA GSEENDLILD PFYGSGTVGI VSMELNRKCV GIEINQDYVD IASKRNARVQ
GALILQES