Gene Cpha266_0080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0080 
Symbol 
ID4570647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp94386 
End bp95501 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content44% 
IMG OID639764682 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_910574 
Protein GI119355930 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.35686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAAC AATCGGAGCC TACTTGCTCA AAAATTCGTG TTTATGACTT TTTCTCAGGT 
TGTGGGGGAA CAAGTGTCGG TTTTGGGCGA GCAGGTATAC AGCACGCATT GGCTGTTGAT
TCTTGTTCTG ACGCCATAAG TACCTATCAA AAAAATTTTA TTGGTGTTCC CGTCATAACT
GATCCAATAG AAACACTTAA TGTTGACCGA ATACAAAATT ATTTCAGTCA TAACCCGGAA
GTGAAGTTAT TCTGTGGCTG TGCACCATGT CAGCCGTTTA CCAAGCAGAA AACAAATACA
AAAAAAGATG CGGCTTCGGA TGATAGACGC GGATTACTTA TATATTTCTC AGATATTGTT
CATGCGTGTT TGCCTGAGCT TGTTTTTGTT GAGAACGTGC CTGGTTTGCA AAAATTTTCT
CTTGAAGATG GCGGGCCCTT GGCTATGTTT ATAAGCCGAT TAAAGCAAAA CGACTACTTT
GTCGATTTTG ATGTGATAGC AGCTCAGGAC TATGGTTCAC CTCAGGTTCG TAGACGGTTC
GTGTTAATCG CAAGTAGGTT GGGAAAGATT ACTTTACCTG CACCAACGCA TGGCCCAAAT
ACAAAGAATT CGTATGTAAC TGTTCATGAT GCCATTGGCA ACTTACCGTC CGTCAAACAT
GGCACAGAAC ATCCAGACAA TCAAAATTAC CCTAATCACC GGGCCGCAAT GCTGTCAGCA
TTAAATCTTG AGCGCATTAG ACACACTGGC GCGAACGGAC GGCGAGATTG GCCTGAAAGA
CTATTGCCAA AATGTTATGC ACAAAAGAAA GACGGAAAAC GCTATGAAGG GCATTCGGAC
TGTTATACTC GATTAGCATG GGGCGAACCC GCACCAGGGT TGACAACTCG TTGCATCAGT
TATTCAAACG GTCGATTCGG ACACCCAGAA CAGGATCGTG CCATTACGAT CAGGGAGGCA
GCAAAGCTGC AAGGATTTCC TGATGATTTC ATTTTTACTG GTTCACTTAA CTCTATGGCT
CGCCAGATTG GCAATGCAGT TCCTGTGTCT GTCGCGGAGG TATTCGGGAG ACATTTTCTG
AATCACGTTA AAGCCATGGA GAGTACAAAT GGCTAA
 
Protein sequence
MNQQSEPTCS KIRVYDFFSG CGGTSVGFGR AGIQHALAVD SCSDAISTYQ KNFIGVPVIT 
DPIETLNVDR IQNYFSHNPE VKLFCGCAPC QPFTKQKTNT KKDAASDDRR GLLIYFSDIV
HACLPELVFV ENVPGLQKFS LEDGGPLAMF ISRLKQNDYF VDFDVIAAQD YGSPQVRRRF
VLIASRLGKI TLPAPTHGPN TKNSYVTVHD AIGNLPSVKH GTEHPDNQNY PNHRAAMLSA
LNLERIRHTG ANGRRDWPER LLPKCYAQKK DGKRYEGHSD CYTRLAWGEP APGLTTRCIS
YSNGRFGHPE QDRAITIREA AKLQGFPDDF IFTGSLNSMA RQIGNAVPVS VAEVFGRHFL
NHVKAMESTN G