Gene Cphamn1_0333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0333 
Symbol 
ID6373993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp337284 
End bp338927 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content55% 
IMG OID642682852 
ProductN-6 DNA methylase 
Protein accessionYP_001958783 
Protein GI189499313 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.175232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAACA ACAACGGTAA CGGAAAGTCG CTCGAATCCT GGATCTGGGA TGCGGCATGT 
TCCATCCGTG GCGCAAAAGA TGCACCGAAG TATAAGGAAT TCATCCTGCC GCTGATCTTT
ACCAAGCGCC TGTGCGATGT CTTCGATGAT GAACTGAACC GCATTGCCGC CGAGGTTGGT
TCACGCAAGA AGGCCTTCCA GCTGGTCAGA GCTGATCACA AGCTGGTCCG TTTCTATCTC
CCCCTTGTTC CCTTCGATCC CGAGGAGCCC GTCTGGTCTG TTATCCGCAA GTTCTCCGAC
AGGATCGGTG AAGGCGTTAC CACTCACATG CGAGCCATCG CCAGAGAAAA TCCGCTCCTG
CAGGGCATCA TCGACCGCGT CGACTTCAAC GCCACCACGC ACGGCCAGCG GGATATCGAC
GATGATCGCC TCTCCAATCT CATCGAAGCC ATCAGTACCA AGTGTCTCGG GCTTGATGAC
GTCGAGGCTG ACATCATCGG CAAGAGCTAC GAATACCTCA TCCGCAAGTT CGCCGAAGGC
GGCGGTCAGA GCGCGGGCGA GTTCTACACT CCCCCAGAAG TCGGCACCAT CATGTCCAGG
GTGCTCGCGC CCGAACCGGG CATGGATATC TACGATCCCT GCTGCGGTTC CGGTGGCCTT
CTTGTGAAAT GCGAGATCGC CATGGAAGAA AAACGGAGGG AGATAAAGGA GGGCGGACAT
TCCTGTCCGC CATTACATTC TGAACTCAAT GGTTATCCTT CATGTAATGG CGGGTTGGAA
AACCCGCCCT CCATTGCCCC TCTCAAGCTC TATGGTCAGG AATACATAGC CGACACCTGG
GCCATGGCCA ATATGAACAT GATCATTCAC GACATGGAAG GTCAGATCGA GATCGGTGAT
ACCTTCAAGA ATCCCAAATT CCGCAACAAA CAGGGTAAAC TCCGCACCTT CGACCGCGTC
GTCGCCAATC CCATGTGGAA CCAGGACTGG TTCACCGAGG CCGACTACGA TAACGATGAA
CTCGACCGCT TCCCCGCCGG GGCCGGATTC CCCGGCAAAT CTTCCGCCGA CTGGGGCTGG
ATTCAGCACA TACACGCAAG CCTGAATAAC TCGGGCCGCG CCGCCATCGT CCTCGACACC
GGGGCAGTCT CCCGCGGATC GGGCAATGCC GGCACCAACA AGGAAAAAAG CGTTCGCAAG
TGGTTTGTCG ATAACGACAT CATCGAAAGT GTGCTCTACC TTCCCGAAAA CCTTTTCTAC
AACACTACTG CTCCGGGTAT TGTCTTGTTC CTGAACAGGG ATAAAGAAAT AGAAAGAGAG
GGTTGTGTGC TGCTCGTCAA CGCCAGCCGG ATCTTCGAAA AAGGTGACCC CAAGAACTTC
ATTCCGGACG AGGGCATCAA GCGCATCGTC GATACCCTCA TCGGGTGGAA GGAGGAGGAA
AAGCTCAGCC GCATCGTCAA TCTCGCCGAG CTGAAGAAGA ACGACTACAA CATTTCCCCC
AGCCGCTACA TCCACACCGG AGAAGCCGAA ACCTACCGGC CCATTGAGGA TATCGTCAAG
GATCTGAACG CCATCGAGCT GGAGGCAAGA GAAACCGATG AAGCCCTGCG GAAGATCCTC
AAGCAACTGG GGGTGGATGT ATGA
 
Protein sequence
MANNNGNGKS LESWIWDAAC SIRGAKDAPK YKEFILPLIF TKRLCDVFDD ELNRIAAEVG 
SRKKAFQLVR ADHKLVRFYL PLVPFDPEEP VWSVIRKFSD RIGEGVTTHM RAIARENPLL
QGIIDRVDFN ATTHGQRDID DDRLSNLIEA ISTKCLGLDD VEADIIGKSY EYLIRKFAEG
GGQSAGEFYT PPEVGTIMSR VLAPEPGMDI YDPCCGSGGL LVKCEIAMEE KRREIKEGGH
SCPPLHSELN GYPSCNGGLE NPPSIAPLKL YGQEYIADTW AMANMNMIIH DMEGQIEIGD
TFKNPKFRNK QGKLRTFDRV VANPMWNQDW FTEADYDNDE LDRFPAGAGF PGKSSADWGW
IQHIHASLNN SGRAAIVLDT GAVSRGSGNA GTNKEKSVRK WFVDNDIIES VLYLPENLFY
NTTAPGIVLF LNRDKEIERE GCVLLVNASR IFEKGDPKNF IPDEGIKRIV DTLIGWKEEE
KLSRIVNLAE LKKNDYNISP SRYIHTGEAE TYRPIEDIVK DLNAIELEAR ETDEALRKIL
KQLGVDV