Gene Cpha266_1712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1712 
Symbol 
ID4571072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1940022 
End bp1942010 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content59% 
IMG OID639766295 
ProductN-6 DNA methylase 
Protein accessionYP_912154 
Protein GI119357510 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.857909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTGGA TCGCACCTTC CGAAAAAGAT ACCGCTACCG CCGCACTCGA AAAGCGCCTG 
TGGGATGCCG CCGACCAGCT TCGGGCGAAC TCCGGCCTCA AGGCCCAGGA GTATTCCGCA
CCCGTTCTCG GGCTTATTTT CCTGCTTTTT GCCGACGTGC GGTTCGCCGC CCGGAGGGCT
GAGCTTGAAT CGGCAAAGAG CAGTACTCGC CGGGGGAGCC GGGTGGACGA TCCGGCGGCC
TATCATGCCG AAGGCGTGCT CTACCTTTCG CCAGAAGCGC GGTTTGTCTA CCTGCTCAAC
CGTCCTGAAG CCGAGAACAT CGGCGTGATG GTCAACGAAG CCATGCGCGC TATCGAGAAG
CACAATCCGC AGCTTGCCGG TGTGCTGCCG AAGACCTACT ACCTGTTCGA CAGCCCGCTT
CTGAAGCAGT TGCTGAAAAA GGTGTCGGAG ATTCCCTCTT CGATGGATTA CGATGCGTTC
GGACGCATCT ACGAGTACTT TCTCGGCGAA TTCGCCATGA GCGAAGGGCA GGGCGGGGGA
GAGTTCTATA CGCCCGTCAG CATCGTTCGT CTCCTGACCG AAGTGATCGA GCCATATCAC
GGGCGCATTC TCGACCCCGC ATGCGGTTCC GGCGGCATGT TCGTCTCCTC GGCCCGGTTC
GTTGCCCAGC ACAAGCAGAA CCCTTCGGCA GAACTCTCCA TTCACGGCAT CGAGAAGACC
GACGAGACGG GAAGGCTTTG TCGCCTGAAC CTTGCGGTGC ACGGGCTTGA AGGCCGTATC
ATGCATGGCG GCAACGTGAA CAGCTACTAC GACGATCCGC ATGATGCAAC GGGGAATTTT
GATTTCGTGC TGGCCAATCC GCCGTTCAAT GTTAACGCCG TTGACAAGGA ACGCCTGAAA
GATTCGGTCG GTCCGGGACG ACGCTTTCCT TTCGGTCTTC CGCGAACCGA CAACGCGAAC
TATCTCTGGA TACAGCTTTT CTACTCGGCA CTGAACGAAA GGGGGAGAGC CGGTTTCGTC
ATGGCGAACT CGGCTTCCGA CGCCCGCTCC TCGGAGCAGG AAATCCGTCG CCAGCTTATC
GAAAGCCGTA CGGTGGACGT AATGGTCGCA GTCGGGCCGA ACATGTTCTA CACCGTCACG
CTGCCCTGCA CGCTGTGGTT TTTCGACAAG GCGAAAGCAA GGCTTTCGCC ACCCTCATCC
CCGGCCCTTC TCCCAAAGGT AGAAGGGGGA GAAGAAGATT TACCATTATC CAGACGCATT
CTCACAGAGC GCGACGGGGA AGGGAATGTA CCGAACAGGG CGGATACGGT GCTGTTTATC
GATGCACGGC ACATCTACCG GCAGGTTGAC AGGGCTCATC GCGACTGGAC GCCCGCCCAG
ATCGGCTTTA TGGCCAACCT TGTCCGTCTC TGGCGCGGCG AAGCGCTCGA CTACACGCTG
GGTGGCGACG AAGCTCGCGA AAAGATCGAA GAGGTTTTCG CTGCCAAAAG TTCTGACCCG
GCAGGCTTGA ACGGGCAGGA GGGGCATTCT GCCCACGCAC TGGCCGCCGA ATCCCCCGCT
CCATACGGAT CAAGTGATGA AATTGAAAAA CTCCCTTCTA CCTTTGGGAG AAGGGCCGGG
GATGAGGGTG CTGTCAAGCA TGAGGGTGCC GGGAACGTCG CCTATCGCGA TGAAGCAAAA
GAACACCCTT CGCCCTCTGG GAGAAGGGCC GGGGATGAGG GTGCTGTCAA GCATGAGGGT
GCCGGGAACG TCGCCTATAG CGACATTCCC GGCTTATGCA AGGCCGCCAC ATTAAAGGAA
ATAGAGGCGC AGGGCTGGTC GCTCAATCCC GGCAGATATG TCGGCGTTGC TCCCGGCGAG
GCAATCAGCG ACGAGGATTT CAAGGTCCAG CTCGAAACGC TGAACGAAGA ACTGGAACTT
CTGAATGCGC AGGCGCGTGA ACTGGAGGCA ACGATTGCCG GAAATGTGGC GCAAATTTTG
GAGACCTGA
 
Protein sequence
MHWIAPSEKD TATAALEKRL WDAADQLRAN SGLKAQEYSA PVLGLIFLLF ADVRFAARRA 
ELESAKSSTR RGSRVDDPAA YHAEGVLYLS PEARFVYLLN RPEAENIGVM VNEAMRAIEK
HNPQLAGVLP KTYYLFDSPL LKQLLKKVSE IPSSMDYDAF GRIYEYFLGE FAMSEGQGGG
EFYTPVSIVR LLTEVIEPYH GRILDPACGS GGMFVSSARF VAQHKQNPSA ELSIHGIEKT
DETGRLCRLN LAVHGLEGRI MHGGNVNSYY DDPHDATGNF DFVLANPPFN VNAVDKERLK
DSVGPGRRFP FGLPRTDNAN YLWIQLFYSA LNERGRAGFV MANSASDARS SEQEIRRQLI
ESRTVDVMVA VGPNMFYTVT LPCTLWFFDK AKARLSPPSS PALLPKVEGG EEDLPLSRRI
LTERDGEGNV PNRADTVLFI DARHIYRQVD RAHRDWTPAQ IGFMANLVRL WRGEALDYTL
GGDEAREKIE EVFAAKSSDP AGLNGQEGHS AHALAAESPA PYGSSDEIEK LPSTFGRRAG
DEGAVKHEGA GNVAYRDEAK EHPSPSGRRA GDEGAVKHEG AGNVAYSDIP GLCKAATLKE
IEAQGWSLNP GRYVGVAPGE AISDEDFKVQ LETLNEELEL LNAQARELEA TIAGNVAQIL
ET