Gene Cphamn1_2550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2550 
Symbol 
ID6376248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2722605 
End bp2724188 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content50% 
IMG OID642685028 
Producttype I restriction-modification system, M subunit 
Protein accessionYP_001960925 
Protein GI189501455 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.19461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGAA AAATCGATCA GAAAGACATC AACAGCGCAG CGTGGTCGGC GTGCGACACC 
TTTCGGGGTG TGGTCGATCC GGCGCAGTAC AAAGACTACA TCCTCGTGAT GCTGTTTCTG
AAATACATCT CTGATGTATG GCAGGACCAC TACGAAGAAT ATCAGAAGCA GTATGGCGAT
GATGATATTC GTATCCGCCG CAAGCTCGAG CGTGAACGTT TTGTTCTCCC GGTGGTGAAA
CTCACCGAAA AGAATGACGA AACTGGCGAG GAGGCGGTTC TGGATGAATT TCCCGCCACC
TATTACAGTC TCTATGAACG CAGGTCCGCC GCCAACATCG GTGAATTGAT CAATATCGTT
CTCGATCATA TTGAAGACAG TAACAAGGTC AAGCTCGAAG GTGTTTTCCG GAACATCGAT
TTCAATAGCG AAGCAAACCT TGGCAAGACC AAGGACCGTA ACCGCCGTCT GAAACAACTG
CTGGAAGATT TCCACAAGCC ACAGCTCAAC ATGAAGCCCA GCCTCGTGTC CGAGGATGTG
ATCGGAAACA CCTATATCTA TCTTATCGAG CGATTCGCTT CCGATTCGGG CAAAAAAGCA
GGGGAGTTCT TTACGCCTTT CAAGGTCAGC GAACTGGTCG CAAAGCTGGC CGATCCCAGA
CCGGGTGACC GCATCTGTGA TCCGGCCTGT GGTTCCGGCG GTCTGTTGAT CAAGGCCGCG
AAGGAAGTGG GTGATCGAAA TTTCGCTCTG TTCGGCCAGG AATCGAATGG TAGCACATGG
GCACTGTGTC GCATGAACAT GTTTCTGCAC AGTTTCGACA GCGCGCGAAT CGAGTGGTGC
GATACGCTGA ACAGTCCGTT GCTGGTTGAA AATGACCGCT TGATGAAATT CAATTGCGTC
GTAGCCAATC CGCCCTTCTC ATTAGATAAA TGGGGTGCTG AAAATGCCGA AAGCGATCAA
TACAACCGCT TCTGGCGCGG CGTTCCTCCG AAGAGCAAGG GGGACTGGTC TTTTATCAGT
CATATGGTGG AAATTGCCCT CGAAAAAGAG GGCCGGGTTG CCGTTGTTGT TCCGCATGGT
GTTCTGTTCA GAGGCGCTGC AGAGGGGCGT ATCAGACAGA AAATGATCGA AGAAAATCTG
CTCGATGCAG TGATCGGTCT GCCCGGCAAC CTGTTTCAGA CCACTAACAT CCCTGTGGCG
ATTCTGGTAT TCGACAGGAG CAGAGAAGGA ACCACGAAAG ACACGAAAAG CACGAAAGGT
GAAAACAGGG ATGTTTTGTT CGTTGATGCA AGCCGGGAGT TTGTTTCAGG GAAAAACCAG
AATACCCTTT CCGATGAGCA GATCGCGAAA ATTATGCGCA CCTACAGAGA GCGTACTGAG
GTTGAAAAAT ATGCGCATGT CGCTGATGTT GCGGAGATAA AGGAGAACGA CTTCAATCTC
AATATTCCTC GCTACGTCGA TACTTTTGAA GAGGAAGAGG AGATTGATAT CGACGCGGTG
CAAGAGGAAA TTGATAATCT GGAAAAAGAG CTGGTGGAAG TCCGAAAGCA GATGGCGGAA
AAACTTCAGC AGATTCAGAG GTAG
 
Protein sequence
MSGKIDQKDI NSAAWSACDT FRGVVDPAQY KDYILVMLFL KYISDVWQDH YEEYQKQYGD 
DDIRIRRKLE RERFVLPVVK LTEKNDETGE EAVLDEFPAT YYSLYERRSA ANIGELINIV
LDHIEDSNKV KLEGVFRNID FNSEANLGKT KDRNRRLKQL LEDFHKPQLN MKPSLVSEDV
IGNTYIYLIE RFASDSGKKA GEFFTPFKVS ELVAKLADPR PGDRICDPAC GSGGLLIKAA
KEVGDRNFAL FGQESNGSTW ALCRMNMFLH SFDSARIEWC DTLNSPLLVE NDRLMKFNCV
VANPPFSLDK WGAENAESDQ YNRFWRGVPP KSKGDWSFIS HMVEIALEKE GRVAVVVPHG
VLFRGAAEGR IRQKMIEENL LDAVIGLPGN LFQTTNIPVA ILVFDRSREG TTKDTKSTKG
ENRDVLFVDA SREFVSGKNQ NTLSDEQIAK IMRTYRERTE VEKYAHVADV AEIKENDFNL
NIPRYVDTFE EEEEIDIDAV QEEIDNLEKE LVEVRKQMAE KLQQIQR